Senior Site Reliability Engineer Performance Objectives:
Contact Charlene Theile, our Professional IT Recruiter, at
Terraform
Newrelic
Ansible
Docker
Datadog
- Develop, maintain, and constantly improve on-premises infrastructure while building next generation infrastructure as code (IaC) for our migration to cloud.
- Continuously improve the visibility of the stack with enhanced logging, metrics, tracing, and relevant statistics
- Design custom dashboards to aggregate relevant data into easy to digest views, and custom alerts based on relevant thresholds
- Work with Software Architects to design highly available environments in the cloud through IaC and configuration management
- Work closely with the product team to deliver cutting edge functionality as efficiently as possible - leveraging your continually improving CI/CD pipelines and deployment utilities
- Write custom terraform, python, shell, and yaml scripts to automate the entire deployment and build process from staging to production
- Assist in the migration of legacy applications from monolithic architectures to service-based containers for scalability, reliability, and quicker deployments
- Share on call responsibilities with other team members to help meet 24/7/365 SLAs
- Troubleshoot, analyze, and assist product and customer success teams to identify client pain points and work to resolve them as quickly as possible, in a repeatable and automated manner
- Work with agile teams to bring your DevOps expertise across disparate projects and disciplines
- 7+ years of Linux administration
- 4+ years of experience in site reliability engineering, DevOps engineering, and CI/CD tooling
- 2+ years working in a software operations production environment
- 2+ years' experience working with cloud technologies (AWS, GCP, Azure)
- Expertise in application monitoring, telemetry gathering, and associated utilities (Nagios, Icinga, DataDog, NewRelic, FluentD, AWS Cloudwatch, GCP Cloud Logging/Stackdriver, etc)
- Experience with centralized logging pipelines (ELK, AWS Cloudwatch Logs, Stackdriver, etc) and exposure to parsing concepts (GROK filters, Regex)
- Experience with common CI/CD tools (Gitlab, Jenkins, CircleCI, etc)
- Experience working within an Agile/Scrum team
- Familiarity with MongoDB, PostgreSQL, Redis
- Experience with container technologies (Docker, Kubernetes) from building to deploying
- Exposure to configuration management utilities (Ansible, Chef, Puppet)
- Microservice exposure, conceptual understanding of application decoupling
- Strong networking fundamentals
- Cross functional acumen
Contact Charlene Theile, our Professional IT Recruiter, at
Recommended Skills
KubernetesTerraform
Newrelic
Ansible
Docker
Datadog