Saltar al contenido principal

Site Reliability Engineer - Career


Site Reliability Engineering (SRE) is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles.

SRE is also an engineering approach to building and running production systems – we engineer solutions to operational problems. As SREs are responsible for overall system operation, we use a breadth of tools and approaches to solve a broad set of problems. Practices such as limiting time spent on operational work, blameless postmortems, proactive identification, and prevention of potential outages.


What You’ll Do

  • Work with teams across an organization and ensure core services reliability and keep an eye on capacity and performance.

  • Responsible for blameless postmortems and proactive identification of potential outages factor into iterative improvement.

  • Participate in release cycles of our offerings, deploying code to integration, staging and production environments, integrating with continuous integration (CI) and continuous delivery (CD) tools, monitoring, and change management

  • Build Automation Work with Agile development teams to ensure smooth promotion of code, configuration and Docker images to production

  • Identify single points of failure and other high-risk architecture issues; propose and implement more resilient resolutions

  • Investigate root cause of severe and systemic outages, identify corrective actions

  • As we transition to the Public cloud (AWS or Google), build new build and deployment patterns.


What experience you need

  • Bachelor's Degree in Computer Science, Information Management or in “STEM” Majors

  • Experience with configuring, customizing, and extending monitoring tools (Appdynamics, Apica, Grafana, Prometheus, Graphite, Splunk etc.)

  • 5+ years’ experience in continuous integration tools (Jenkins, SonarQube, JIRA, Nexus, Confluence, GIT-BitBucket, Maven or Gradle)

  • 2+ years’ experience with configuration management and automation 

  • 2+ years’ experience deploying and managing infrastructure on public clouds (AWS, GCP, or Azure or Pivotal)

  • 2+ years experience working on Kubernetes and other related applications.

  • Experience working with Nginx, Tomcat, HAProxy, Redis, Elastic Search, MongoDB, Kafka, Zookeeper.


  What could set you apart

  • Hands on experience Configuring and Administering SCM(GIT, SVN), Build (CMake, Make files, Maven), Nexus, CI(Jenkins), CD Automation Tools

  • Experience with  large scale cluster management systems (Mesos, Kubernetes)

  • Experience with Docker-based containers is a plus

  • Able to dive into any level of a modern internet service (schedulers, containers, Linux kernel, caching, object storage, distributed file systems, RDBMS, NoSQL, etc.)


We offer comprehensive compensation and healthcare packages, 401k matching, paid time off, and organizational growth potential through our online learning platform with guided career tracks.

If this sounds like somewhere you want to work, don’t delay, apply today - we’re looking for you!

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran

El trabajo en Equifax

Creemos en una mentalidad de crecimiento. En Equifax, esto incluye brindar a nuestros empleados oportunidades para desempeñarse al máximo y aprender nuevas habilidades a lo largo del camino para inspirar y desarrollar carreras profesionales satisfactorias


Únase a nuestra comunidad de talentos

Obtenga información sobre las próximas oportunidades y eventos profesionales en Equifax