Bell.One platform designed to GCP and AWS. The platform is set of services for infrastructure support for machine learning (neuton.ai) and some other services in cloud platforms www.bell.one
REQUIREMENTS:
Experience in an Enterprise Environment by implementing, configuring and supporting monitoring environments
Ability to deploy Monitoring applications in a containerized infrastructure (ex: Prometheus, Grafana, Stackdriver, Nagios or Zabbix )
Setup/Configuring of Log collecting, handling, tuning and filtering; Including best practice scenarios. (ex: logrhythm, fluentd, graylog, elastic search and/or kibana)
Programming/Scripting knowledge in one of the two: Bash/Python
Hands-on experience, minimum one (1) year experience with at least one cloud platform: GCP, AWS, Openstack, etc
At least 3 years working experience with Linux servers in System Administration
Fluent English
JOB RESPONSIBILITIES:
Create NOC Dashboards for L1/L2 Support Teams
Aid Enterprise Engineering teams by implementing monitoring solutions in the cloud, hybrid-cloud and container-based environments
Design and implement complex alerting based upon trending/correlated events
Collaborate with the Support and Management teams to provide support and training to L1/L2 Support teams
Produce streamlined, searchable and efficient Logging solutions for Development and Security Teams