Cloud Site Reliability Engineer Nesco Resource Dublin, OH Full-Time $115,000.00 - $120,000.00 / year

Kate

Administrator
Команда форума
Our client has an immediate need for a Cloud Site Reliability Engineer who will specialize in developing scalable methods for building, deploying, and supporting our Azure cloud enterprise services and systems. This is a highly collaborative role in which you will work closely with our Software Engineers to deploy and operate our solutions; automate and streamline our processes; build and maintain tools for deployment, monitor IT operations, and troubleshoot and resolve issues in our DEV, QA, UAT, and Production environments.

Primary Responsibilities
Build infrastructure & systems that provide high levels of scalability, reliability, and performance for applications, while balancing security, maintainability, and operational excellence
Interface across teams to codify and reliably test infrastructure changes using the software development lifecycle
Partner with Dev teams to provide guidance and best practices around scalability, reliability, and performance of our productions systems, infrastructure, and software
Work as a team on escalations, resolving critical issues that impact our highly available DEV, QA, UAT, and production systems
Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
Work with an innovative engineering team to continuously implement and improve reliable and rapid build environments for DEV & QA; provide timely build status updates; automate as much as possible to improve efficiency and quality
Promote innovation, implementation of cutting-edge technologies, outside-of-the-box thinking, teamwork, and self-organization
Work with SVN, GIT, Team Foundation version control, or other build tools in a CI/CD process to build and deploy to our Azure Cloud environment
Ensure traceability, observability, and retrievability of sources and deliverables
Build logging, monitoring, and alerting systems to identify bottlenecks and assist with debugging, analysis, and optimization in the Cloud environment
Improve operational efficiency through automation and deployment or development of new tools
Experiment with and recommend new technologies that simplify or improve the Cloud environment
Craft solid and clearly explained playbooks, and documentation, for consumption by teammates and the larger engineering organization
Participate in an off-hours on-call rotation, and perform periodic off-hours work during maintenance windows

Skills:

A Successful Candidate will have:
Bachelor's degree or equivalent experience in a Cloud Engineering, DevOps Engineering, Cloud infrastructure or software engineering discipline
6+ years of experience in a Cloud SRE/Infrastructure, DevOps Engineering, or Cloud related role
Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm
Experience with cloud-agnostic configuration management frameworks (Ansible, Terraform, etc)
Experience configuring and managing Azure cloud infrastructure (AWS, GCP)
Understanding of SSH, VPN, TCP/IP, DNS, HTTP(S), network routing, load balancers, cloud services, cloud storage, and subnetting
Experience with managing and tuning datastore clusters (Elasticsearch, RDS, SQL, and MySQL)
Experience with CI/CD pipelines such as Azure DevOps, Jenkins, etc.
System Observability experience (Azure Monitor, New Relic, Zabbix, CloudWatch, PagerDuty, Datadog, SignalFx, Graphana, etc)
Knowledge of Linux (Red Hat/CentOS) architecture, security, administration, performance monitoring/tuning, troubleshooting, and production operations
Fluent in Python and PowerShell Scripting, with experience implementing automation and monitoring using shell scripting and other related tools
Experience with containerization technologies (Docker, Kubernetes, etc)
Deep experience analyzing performance, end to end service experience and overall system health.
Excellent debugging and trouble shooting skills

Nesco Resource and affiliates (Lehigh G.I.T Inc, and Callos Resource, LLC) is an equal employment opportunity employer and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, age, or veteran status, or any other legally protected characteristics with respect to employment opportunities.

Recommended Skills​

Kubernetes

Terraform

Docker

Datadog

Ansible

Cloudwatch

About the company​

 
Сверху