MAXPAYLOAD is an alternative data factory company that creates tooling and systems to manufacture synthetic data for competitive intelligence decision making.
We are seeking a Senior Data Factory Automation Engineer with 3-5 years of experience. Ideal candidates will be a player/coach with hands-on experience in designing, building, and scaling data platforms and products using modern cloud technologies. This role requires an individual with expertise in Python development, cloud data platforms (e.g, AWS, Snowflake) and knowledge of modern data modeling and storage formats. Must be a self-starter that can lead complex automation projects and guide junior developers. THIS IS AN IN-PERSON POSITION.
Important Requirements
- Must be based in the Washington DC metro area
- Must be authorized to work in the US
- No visa sponsorship available. We are unable to sponsor a new applicant for employment authorization, or offer any immigration related support for this position (i.e. H1B, F-1 OPT, F-1 STEM OPT, F-1 CPT, J-1, TN, or another type of work authorization).
Responsibilities
- Design, develop, and deploy high-performance Python-based data pipelines for structured, semi-structured, and unstructured data.
- Build and deploy customized data product manufacturing tools and data factory automation.
- Implement data processing solutions with batch, real-time, and event-driven data processing frameworks. Collaborate with team members to build features that solve real customer needs.
- Create scalable data processing pipelines.
- Write clean, maintainable, and well-documented code while following best practices for software development and testing.
- Conduct code reviews and provide constructive feedback to peers, ensuring high standards of quality and performance.
- Enforce data integrity, lineage, and observability by developing resilient and maintainable code.
- Solid understanding of statistical modelling and machine learning algorithms, and experience deploying and managing models in production
- Experience with Aviation data sets is desirable.
- Very good data engineering skills, with the ability to manipulate and process large amounts of data at scale.
- Collaborate on the creation of proof-of-concept data manipulation scripts in to determine the feasibility of using our data sets to create a desired analysis.
- Collaborate on the development and application of data mining and machine learning algorithms for advanced analysis and prediction.
- Project leader and coder, able to prioritize simultaneously across several projects and to lead and coordinate larger initiatives and meet deadlines in a fast-paced environment.
- Mentor junior engineers and contribute to continual team growth.
Basic Qualifications:
- Minimum 3-5 years of experience in Python and modern data engineering solutions.
- Core Python skills: in Python programming language including experience with Python data libraries (Pandas, NumPy, PyArrow, Dask) and a proven ability to write efficient, scalable, and maintainable code.
- Modern Data Engineering skills: in Django and SQL. Strong ability to build data-ingestion and data-processing/curation frameworks; including batch, streaming, and event-driven data pipelines
- Minimum 3 years of Cloud Data Platform skills: launching cloud data products on AWS (Lambda, S3) and Snowflake
- Minimum 5+ years’ Data Management - experience related to Data Management or Data Science, including Data analysis and correlation, data management, data mining, schema design, data lakes, metadata use and statistical modeling
- Excellent problem-solving skills and the ability to work in a fast-paced, dynamic environment.
- Experience with transportation network datasets and related graphs.
- Strong communication skills, both written and verbal, to effectively collaborate with team members and stakeholders.
- Skilled in data visualization (Tableau, Power BI).
- MS Office Suite (Teams, Word, Excel, PowerPoint, Outlook, etc.)
- Bachelor's degree in Computer Science (required), Data Science, Engineering, or related field
Our Technology Stack
- Frontend: React, Next.js, TypeScript, Prisma ORM, tRPC, and Next Auth
- Backend: Python, Django, DjangoAPI, REST APIs
- Data: PostgreSQL, Elasticsearch,
- Infrastructure: AWS, Docker, Vercel, Heroku, Snowflake
Job Type: Full-time
Benefits:
- 401(k)
- Dental insurance
- Health insurance
- Life insurance
- Paid time off
Compensation Package:
- Bonus opportunities
- Yearly pay
Application Question(s):
- Are you located in the Washington, DC area?
- Do you have an Undergraduate (not a masters) degree in computer science/engineering from a US-based university?
- Are you legally authorized to work in the US without immigration related sponsorship now and in the future? That includes visas such as H1B, F-1 OPT, F-1 STEM or other similar post-graduate visas? If yes, please indicate authorization.
Work Location: In person