You will be responsible for deploying and operating core services for for Zoom applications and products.
Responsibilities
Design and implement zero-downtime to accomplish highly available service (99.99%)
Design and implement disaster recovery (DR) between different region Data Centers
Troubleshoot complex production issues, including performance and function issues
Cooperating with cloud vendor and infrastructure, engineering team for security and service availability
Provide deep level of outage troubleshooting for systems and Zoom backend service
Provide the CI/CD model to deploy and configure the production system
Requirements
4 + years experience as a DevOps Engineer, 6 + years in a Engineering or Information Technology role
In depth knowledge of Linux: RedHat, CentOS, Debian, etc.
Mandarin or Cantonese communication is a strong plus
Strong analytical and troubleshooting skills
Working knowledge of Ansible and Jenkins
Experience with various monitoring, such as Zabbix, AWS Cloudwatch.
Experience in Java, JVM performance, tomcat and SQL
Experience with Nginx, ETCD in production deployment and troubleshooting
Experience working with AWS services, such as Dynamodb, RDS, S3, Route53, etc.
Experience with Source Code Management tools (Git, Gitlab) and an understanding of branching and integration processes.
Solid Bash or Python scripting experience.
Experience with ELK, Elasticsearch/ES, Kafka, Nginx with performance tuning is a strong plus
Experience with Kubernetes and Docker is a strong plus
If any issues arise with your assigned services to monitor, you'll need to be available to triage and resolve
Must be able to work some weekends for deployments. Typically these are 2x per month on Saturday at 5pm PT to 12am PT
Must be able to have some on-call time. There are approximately 13 people rotating weekly for this
Applicants must be currently authorized to work in the United States on a full-time basis
Excellent communication skills
BS degree in a related field, MS preferred
Ensuring a diverse and inclusive workplace where we learn from each other is core to Zoom’s values. We welcome people of different backgrounds, experiences, abilities and perspectives including qualified applicants with arrest and conviction records as well as any qualified applicants requiring reasonable accommodations in accordance with the law.
We believe that the unique contributions of all Zoomies is the driver of our success. To make sure that our products and culture continue to incorporate everyone's perspectives and experience we never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status.
All your information will be kept confidential according to EEO guidelines.
Explore Zoom:
Find us on social at the links below and on
Responsibilities
Design and implement zero-downtime to accomplish highly available service (99.99%)
Design and implement disaster recovery (DR) between different region Data Centers
Troubleshoot complex production issues, including performance and function issues
Cooperating with cloud vendor and infrastructure, engineering team for security and service availability
Provide deep level of outage troubleshooting for systems and Zoom backend service
Provide the CI/CD model to deploy and configure the production system
Requirements
4 + years experience as a DevOps Engineer, 6 + years in a Engineering or Information Technology role
In depth knowledge of Linux: RedHat, CentOS, Debian, etc.
Mandarin or Cantonese communication is a strong plus
Strong analytical and troubleshooting skills
Working knowledge of Ansible and Jenkins
Experience with various monitoring, such as Zabbix, AWS Cloudwatch.
Experience in Java, JVM performance, tomcat and SQL
Experience with Nginx, ETCD in production deployment and troubleshooting
Experience working with AWS services, such as Dynamodb, RDS, S3, Route53, etc.
Experience with Source Code Management tools (Git, Gitlab) and an understanding of branching and integration processes.
Solid Bash or Python scripting experience.
Experience with ELK, Elasticsearch/ES, Kafka, Nginx with performance tuning is a strong plus
Experience with Kubernetes and Docker is a strong plus
If any issues arise with your assigned services to monitor, you'll need to be available to triage and resolve
Must be able to work some weekends for deployments. Typically these are 2x per month on Saturday at 5pm PT to 12am PT
Must be able to have some on-call time. There are approximately 13 people rotating weekly for this
Applicants must be currently authorized to work in the United States on a full-time basis
Excellent communication skills
BS degree in a related field, MS preferred
Ensuring a diverse and inclusive workplace where we learn from each other is core to Zoom’s values. We welcome people of different backgrounds, experiences, abilities and perspectives including qualified applicants with arrest and conviction records as well as any qualified applicants requiring reasonable accommodations in accordance with the law.
We believe that the unique contributions of all Zoomies is the driver of our success. To make sure that our products and culture continue to incorporate everyone's perspectives and experience we never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status.
All your information will be kept confidential according to EEO guidelines.
Explore Zoom:
Find us on social at the links below and on