Job Description
Tickets.com , an
MLB company , delivers innovative, cutting-edge technologies to enable frictionless and unforgettable fan experiences in venues across the globe. Together with MLB, Tickets.com is changing the landscape of the live sports and entertainment industry, delivering new digital venue and ticketing experiences to millions of fans. Our Technology team builds platforms and products that provide a new smart ticketing solution and venue experience. Using cutting-edge technology, our platform and applications are consumed by fans, stadiums, and MLB teams.
We are assembling a world-class team to build on these experiences and to scale platforms and products that anticipate emerging opportunities, including dynamic pricing and offers and digital, contactless ticketing. Our mission is to provide premium, innovative live experiences for our clients and their patrons.
Tickets.com is looking for a
Site Reliability Engineer passionate about building engaging products for our fans.
The Opportunity: The
Site Reliability Engineer will join the Infrastructure Engineering team at Tickets.com, while also working alongside MLB team members and help to drive adoption of best practices across the following areas:
- Uptime, High Availability and Disaster recovery
- Incident response
- Identify SLIs and define SLOs
- Observability tooling
- Debugging running systems and providing tools to assist runtime debugging
- Optimizations for cost control
Essential Job Functions: - Work both independently with little supervision and in a team environment
- Prioritize unblocking your teammates, collaboration and knowledge sharing
- Collaborate with teams to ensure the availability, security, and integrity of services
- Help define and configure relevant system, application, and database metrics to ensure observability.
- Create and maintain dashboards and reports to visualize systems and database performance and health
- Create monitoring and alerting to detect error conditions, degradation symptoms, and outages
- Help identify automation and self-service opportunities for infrastructure and database operational tasks to enhance reliability, efficiency and reduce manual toil
- Support and debug production issues across services and all levels of the stack
- Engage in driving improvements to our incident response and participate in on-call rotations
- Continuously identify opportunities for process improvement
Requirements: - Minimum of a bachelor's degree in computer science, MIS or a related field, and five (5) years of relevant experience including software or reliability engineering, or combination of education, training, and experience.
- Strong communication skills and the ability to convey technical information about cloud, container workloads, DevOps, and SRE Principals to all levels of the organization
- Demonstrable experience in automation, alerting, and remediation with a passion for reducing toil
- Have written code in a compiled language that runs in production somewhere
- Have written code in interpreted languages
- Experience with cloud services (e.g., AWS, Google Cloud Platform)
- Experience with DevOps practices and tools (e.g. Terraform, Git, CICD)
- Experience with real-time log/event monitoring tools (e.g., DataDog, Cloud Logging, Splunk)
- Experience working in an environment running mission critical, transactional, and analytic datastores and pipelines (e.g., Oracle, Postgres, Mongo, BigQuery, Kafka, Airflow)
- Experience in Linux OS and scripting languages
- Understanding of networking and connectivity in the context of distributed systems
- Excellent problem solving and troubleshooting skills
- Ability to work non-standard shifts including nights and/or weekend on-call responsibilities
- Dedicated to continuous improvement of yourself and our SRE capabilities
Key Technical Traits - Experience with APIs and microservices: REST, Web, GraphQL
- Database Solutions: Oracle, MYSQL, MSSQL, CloudSQL, NoSQL
- Cloud Providers: Google Cloud Platform, Oracle Cloud Infrastructure, AWS
- Real-time log/event monitoring - DataDog, Google Cloud Logging, Splunk
- Programming Languages - Go, Python, Bash, Java, JavaScript
- Scripting: PL/SQL, Shell
- Software Development tools - Jira, GIT, ArgoCD, Terraform
- Compliance: PCI DSS, SSAE18/SOC 1
Salary Range is $140-160K We offer an Outstanding Benefits Package that includes: - Medical
- Dental
- Vision
- STD & LTD
- 401K Retirement Plan
- Basic Life & AD&D
- Supplemental Life Insurance
- Paid Time Off (PTO, STO, Holidays including Year-End Holiday Break)
- HSA
- Pet Insurance
- Tuition Reimbursement
- Flexible Hybrid Work Environment
- MLB Tickets
Tickets.com is an Equal Opportunity Employer. Please click here to view our CCPA.
Job Tags
Holiday work, Flexible hours, Shift work, Night shift,