Senior Site Reliability Engineer Job at Diversity Resource Staffing. Inc, Atlanta, GA

TDFReXREekN2THdNYVhBejdnZ0s5Ym1VcFE9PQ==
  • Diversity Resource Staffing. Inc
  • Atlanta, GA

Job Description

This is an exciting opportunity for a Senior Site Reliability Engineer in the Consumer SRE Team at IMT division, to provide secure, resilient, scalable and maintainable services for mortgage borrowers and lenders. IMT is a division of our client based in Atlanta, which operates numerous financial and commodity marketplaces and exchanges, including the New York Stock Exchange (NYSE).


Automation is a big part of what we do - we use infrastructure-as-code within our hybrid cloud to bring stability and scalability to Windows, Linux, Docker and Serverless applications in AWS, On-Prem and Azure environments. We reduce toil through scripting and automation of repetitive tasks. You will collaborate with Developers to deliver robust services, build actionable alerts to detect / avoid incidents and to detect performance bottlenecks, as well as automation to remediate issues.



Responsibilities
  • Employ deep troubleshooting skills to improve the availability, performance, and security of Ellie Mae Services.
  • Ensure services are designed with 24/7 availability and operational readiness and rigor
  • Implement proactive monitoring, alerting, trend analysis and self-healing systems
  • Define and measure KPIs and SLOs
  • Build automated deployments, automated tests, and operational tools
  • Participate in on-call rotation for Production support
  • Collaborate with Product and Support teams to plan and deploy product releases
  • Partner with other SREs and lead by example
Knowledge and Experience
  • 10+ years of Application/Systems engineering in 24x7 Production Services environments
  • BS in Computer Science, Computer Engineering, Math, or equivalent professional experience
  • Excellent troubleshooter, utilizing a systematic problem-solving approach
  • Demonstrate the ability to lead Incident Response and root cause analysis (RCA)
  • Fluency with one or more current generation scripting language used by SRE/DevOps professionals (Powershell, Python, Perl, PHP, Ruby) + Java/.NET development
  • Experience running a SaaS application in a public cloud, on-prem or hybrid cloud environment

Additional credit for:

  • Proficiency in Windows and on-prem environments
  • Experience with Continuous Integration and Continuous Delivery concepts.
  • Automation in RunDeck or Jenkins
  • Infrastructure-as-code or Configuration Management, utilizing tools like Terraform, CloudFormation or Chef/SaltStack/Puppet/DSC
  • Containers/Docker/Micro-Services

Job Tags

Similar Jobs

Headway

Licensed Psychiatric Nurse Practitioner Job at Headway

 ...Headway makes it easy to accept insurance, boost your earnings, and...  ...your schedule, client load, and work environment. Why partner...  ...insurance plans through our nationwide network. Predictable bi-...  ...telehealth while working from home. We accept the following licenses... 

BRAVE

Audio Engineer - Englewood Job at BRAVE

 ...I. Summary of the position: The Audio Engineer is responsible for delivering exceptional sound quality across all BRAVE Church environments, including live events, worship services, broadcast productions, and postproduction projects. Serving as the lead authority on...