Вакансия: DevOps Engineer (удаленно)
PROJECT: Bell.One platform designed to GCP and AWS. The platform is set of services for infrastructure support for machine learning (neuton.ai) and some other services in cloud platforms www.bell.one
What we need:
- DevOps or SRE Engineer to architect, build and implement an Enterprise Monitoring solution for our new SaaS Based Solution
- The successful candidate will be chartered with Automating the detection and subsequent resolution of events within the environment
- Experienced with Linux systems administration, including solid scripting skills (Python,Go, XML) and Bash
- Someone with a wealth of knowledge and desire to work in Cloud-Based platforms and technologies such as GCP (Preferred), AWS or Azure
- Ability to deploy Monitoring applications in a containerized infrastructure (ex: Prometheus, Grafana, Stackdriver, Nagios or Zabbix )
- Setup/Configuring of Log collecting, handling, tuning, and filtering; Including best practice scenarios. (ex: logrhythm, fluentd, graylog, elastic search and/or kibana)
- Experience working in DevOps and Agile Environments
What you need
- 3-5 years of experience operating, building, automating monitoring events for SaaS Based solution
- CI/CD: 3 years (Git, Jenkins)
- At least 2 years of scripting skills (Bash, Python, Go)
- At least 2 years of experience with containerized infrastructure (Kubernetes/Docker)
- An understanding of operating systems, platforms, and infrastructure including Linux, Docker, virtualization, AWS, GCP, etc.
- Profiling, performance tuning, monitoring, alerting, and troubleshooting.
- Configuration and tuning high load Log collectors. ( ELK/EFK )
- Apply infrastructure-as-code patterns to increase the reliability and predictability of our services
You may be a fit for this role if you:
- Think about systems - edge cases, failure modes, interactions
- Are experienced in configuration management tools like Ansible, Terraform
- Have strong programming skills
- Know how to scale up an application and high availability principals
- Deployment using helm