As a Staff Site Reliability Engineer for the Data Engineering team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure. Your work will directly impact the availability and performance of our data services, enabling the organization to better decisions. You will collaborate closely with data engineers, and software engineers to develop and drive 100% automation, best practices for deep monitoring and alerting. This role will report to our Director of Data Engineering and will be based out of our Mexico City office.
About You
- Bachelor's degree in Computer Science, Information Technology, or a related field.
- 12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms, data stores, data operations.
- Extensive experience with AWS cloud platform and their data-related services.
- Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights).
- Proficiency in one or more programming languages (e.g. Python, Java)
- Proficiency in automation frameworks (e.g., Terraform, Cloud Formation).
- Strong understanding of various performance metrics both at a high level and at a low level like Disk/IO saturation.
- Experience in identifying and eliminating the bottlenecks in the system.
- Strong understanding of database internals like types of indexes, schemas, query plans.
- Strong understanding of database systems (e.g., SQL, NoSQL) and experience in managing large-scale data infrastructures.
- Strong understanding and hands-on implementation of CI/CD pipelines and DataOps practices.
- Experience with data governance, compliance, and lifecycle management.
- Ability to own and execute projects while effectively collaborating with the team to influence and shape the vision of the data engineering organization.
#LifeAtCrunchyroll #LI-Hybrid
About our Values
We want to be everything for someone rather than something for everyone and we do this by living and modeling our values in all that we do. We value
- Courage. We believe that when we overcome fear, we enable our best selves.
- Curiosity. We are curious, which is the gateway to empathy, inclusion, and understanding.
- Service. We serve our community with humility, enabling joy and belonging for others.
- Kaizen. We have a growth mindset committed to constant forward progress.