Summary of Responsibilities
The role of the Data Engineer will be to maintain and further develop the modern, scalable, baseball data pipeline for the St. Louis Cardinals. This person will collaborate with the Baseball Systems group to ensure high quality data is available to scouts, coaches, players, and other baseball decision-makers. This person should be detail-oriented, enjoy collaborating with others, communicate effectively, both verbally and written, have a growth mindset, and love the game of baseball.
Essential Functions of the Job
-
Build and support components of our data pipeline that ingests raw baseball data and outputs baseball data ready for review and analytics modeling by Baseball Operations
-
Continuously extend our data pipeline to ingest additional data sources and handle increasingly dense datasets
-
Continuously improve our data pipeline by reducing latency, reducing cost, and reducing errors
-
Communicate effectively with Baseball Operations staff to ensure we are anticipating and supporting their data needs
-
Rigorously test our data pipeline to improve its quality and maintainability over time
Minimum Education and Experience
-
Bachelor's degree in a technical field, or a combination of relevant education and work experience
-
Experience identifying, triaging, and resolving data issues
-
Interest in modern data system architectures, design patterns, and best practices
-
Ability to apply creative solutions to challenging technical tasks
-
Ability to work independently in a fast-paced environment
-
Proficiency with more than one modern programming languages
-
Familiarity with data-related concepts such as data pipelines, databases, SQL, JSON, and REST APIs
Education and Experience Preferred
-
Professional experience in a software engineering, data reliability, and/or a quality assurance environment
-
Proficiency with Python, Go, TypeScript, and/or Node.js
-
Proficiency with DevOps tools including Git, CI/CD pipelines, and configuration-as-code
-
Proficiency with Cloud computing, Kubernetes, and/or container-based or serverless application deployment
-
Literacy in Spanish, Korean, or Japanese