Location: REMOTE
Description:
As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems.
Your role is to ensure the reliability, scalability and maximum uptime of our Cloud Platform.
Technology & Innovation Division
Working in the Technology & Innovation group, you will drive, develop, and maintain solutions for clients and colleagues.
This is an exciting time of technology advancement and innovation across the bank, particularly within our technology teams.
Responsibilities
Design, develop and implement solutions that improve stability, security, scalability and availability of client’s software platforms.
Design mechanisms for alerts and responses to identify and address reliability risks.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning, and reviews
Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Design and run performance, capacity and monitoring tests.
Create educational documentation on how-to's and best practices, and blog about use-cases and architectures that relate to cloud platforms
Liaise with the team managing our public cloud environments, including setup, management, and troubleshooting
Design, develop and implement solutions that improve stability, security, scalability and availability of software platforms.
Design mechanisms for alerts and responses to identify and address reliability risks.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning, and reviews
Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Design and run performance, capacity and monitoring tests.
Create educational material such as cloud native sample apps and starter code, as well as contribute to holding cloud native educational events like hackathons and live coding sessions.
Skills:
5+ years of experience in an Operational role, DevOps, SRE, or Software
5+ Engineering years of experience doing development in any of .Net, Java, NodeJS, .NET Core, Python
3+ years of experience with development or administration on any cloud
3+ platforms (Cloud Foundry, Heroku, AWS, Azure, Google Cloud, IBM 3+ Cloud, Bluemix, Kubernetes, and others). (The ideal candidate has 3+ significant experience with Platform as a Service cloud such as Cloud 3+ Foundry)
Expertise in Prometheus (client library and application instrumentations, PromQL), Grafana (GraphQL, Metadata, Dashboard Skills), Dynatrace, Kubernetes, and PagerDuty with ITIL Background.
Education: Bachelors in Computer Science, MIS or related degree
Required Skills:
NETWORK MONITORING
PYTHON
SPLUNK
ELASTICSEARCH
METRICS
Contact: [ Link removed ] - Click here to apply to Dev Ops Engineer IV
Software Testing
Software Quality Control
Selenium
Performance Testing
Hp Quicktest Professional
Description:
As an SRE, you will utilize your software, systems engineering, and operations background to build and run large-scale, fault-tolerant systems.
Your role is to ensure the reliability, scalability and maximum uptime of our Cloud Platform.
Technology & Innovation Division
Working in the Technology & Innovation group, you will drive, develop, and maintain solutions for clients and colleagues.
This is an exciting time of technology advancement and innovation across the bank, particularly within our technology teams.
Responsibilities
Design, develop and implement solutions that improve stability, security, scalability and availability of client’s software platforms.
Design mechanisms for alerts and responses to identify and address reliability risks.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning, and reviews
Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Design and run performance, capacity and monitoring tests.
Create educational documentation on how-to's and best practices, and blog about use-cases and architectures that relate to cloud platforms
Liaise with the team managing our public cloud environments, including setup, management, and troubleshooting
Design, develop and implement solutions that improve stability, security, scalability and availability of software platforms.
Design mechanisms for alerts and responses to identify and address reliability risks.
Support services before they go live through activities such as system design consulting, developing software platforms and frameworks, planning, and reviews
Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
Design and run performance, capacity and monitoring tests.
Create educational material such as cloud native sample apps and starter code, as well as contribute to holding cloud native educational events like hackathons and live coding sessions.
Skills:
5+ years of experience in an Operational role, DevOps, SRE, or Software
5+ Engineering years of experience doing development in any of .Net, Java, NodeJS, .NET Core, Python
3+ years of experience with development or administration on any cloud
3+ platforms (Cloud Foundry, Heroku, AWS, Azure, Google Cloud, IBM 3+ Cloud, Bluemix, Kubernetes, and others). (The ideal candidate has 3+ significant experience with Platform as a Service cloud such as Cloud 3+ Foundry)
Expertise in Prometheus (client library and application instrumentations, PromQL), Grafana (GraphQL, Metadata, Dashboard Skills), Dynatrace, Kubernetes, and PagerDuty with ITIL Background.
Education: Bachelors in Computer Science, MIS or related degree
Required Skills:
NETWORK MONITORING
PYTHON
SPLUNK
ELASTICSEARCH
METRICS
Contact: [ Link removed ] - Click here to apply to Dev Ops Engineer IV
This job and many more are available through The Judge Group. Find us on the web at [ Link removed ] - Click here to apply to Dev Ops Engineer IV
Recommended Skills
Test AutomationSoftware Testing
Software Quality Control
Selenium
Performance Testing
Hp Quicktest Professional