Passer au contenu principal

Site Reliability Engineer - Intermediate

Technology

 

Site Reliability Engineering (SRE) is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles.

SRE is also an engineering approach to building and running production systems – we engineer solutions to operational problems. As SREs are responsible for overall system operation, we use a breadth of tools and approaches to solve a broad set of problems. Practices such as limiting time spent on operational work, blameless postmortems, proactive identification, and prevention of potential outages.

 

What You’ll Do

  • Responsible for blameless postmortems and proactive identification of potential outages factor into iterative improvement.

  • Participate in release cycles of our offerings, deploying code to integration, staging and production environments, integrating with continuous integration (CI) and continuous delivery (CD) tools, monitoring, and change management

  • Build Automation Work with Agile development teams to ensure smooth promotion of code, configuration and Docker images to production

  • Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions

  • Investigate root cause of severe and systemic outages, identify corrective actions

  • As we transition to the Public cloud (AWS or Google), build new build and deployment patterns.

 

What experience you need

  • Bachelor's Degree in Computer Science, Information Management or in “STEM” Majors

  • Experience with configuring, customizing, and extending monitoring tools (Appdynamics, Apica, Sensu, Grafana, Prometheus, Graphite, Splunk, Zabbix, Nagios etc.)

  • 5+ years’ experience in continuous integration tools (Jenkins, SonarQube, JIRA, Nexus, Confluence, GIT-BitBucket, Maven or Gradle)

  • 2+ years’ experience with configuration management and automation (Terraform, Ansible, Puppet, Chef, Salt)

  • 2+ years’ experience deploying and managing infrastructure on public clouds (AWS, GCP, or Azure or Pivotal).

  • 2+ years experience working on Kubernetes and other related applications.

  • 3+ years’ experience in Linux environments (CentOS).

 

What could set you apart

  • Hands on experience Configuring and Administering SCM(GIT, SVN), Build (CMake, Make files, Maven), Nexus, CI(Jenkins), CD Automation Tools

  • Experience managing Infrastructure as code via tools such as Terraform or CloudFormation

  • Experience with  large scale cluster management systems (Mesos, Kubernetes)

  • Experience with Docker-based containers is a plus

We offer comprehensive compensation and healthcare packages, 401k matching, paid time off, and organizational growth potential through our online learning platform with guided career tracks.

If this sounds like somewhere you want to work, don’t delay, apply today - we’re looking for you!

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran

Travailler chez Equifax 

Nous croyons en une mentalité de croissance. Chez Equifax, cela comprend offrir à nos employés des occasions de donner le meilleur d’eux-mêmes et d’acquérir de nouvelles compétences en cours de route pour inspirer et bâtir des carrières épanouissantes.

 

Laptopv2

Joignez-vous à notre communauté de talents

En savoir plus sur les possibilités de carrière et les événements à venir chez Equifax

S’inscrire