Senior Data Engineer - Data Pipeline Architecture, ETL Systems & AI Platform Development
Posted 2026-05-06About arenaflex
Welcome to arenaflex, where we're on a mission to build safe and beneficial artificial general intelligence that benefits all of humanity. As a leader in cutting-edge AI research and deployment, we push the boundaries of what AI systems can accomplish and strive to deliver these breakthroughs to the world through innovative products. Our work sits at the intersection of cutting-edge technology, scientific research, and real-world application, making arenaflex one of the most exciting places to build your career in tech.
At arenaflex, we believe that artificial intelligence has the potential to help people solve massive global challenges, and we're committed to ensuring the upside of AI is widely shared. Our team of world-class researchers, engineers, and business professionals work collaboratively to create AI systems that are both powerful and safe. We value diverse perspectives, voices, and experiences, because we know that the best solutions come from inclusive teams.
We're currently seeking a talented and passionate Data Engineer to join our growing data infrastructure team. This is a unique opportunity to work at the forefront of AI technology, collaborating directly with the researchers behind some of the most advanced AI models in the world. If you're passionate about data, love solving complex engineering challenges, and want to make an impact with your work, arenaflex might be the perfect place for you.
Position Overview
As a Data Engineer at arenaflex, you'll play a critical role in building and maintaining the data infrastructure that powers our AI systems and business operations. You'll design, build, and manage data pipelines that seamlessly integrate client event data into our data warehouse, creating the foundation for informed business decisions, product development, and security monitoring.
This role offers an extraordinary opportunity to collaborate closely with the researchers developing next-generation AI models and help them train new models to deliver groundbreaking capabilities to customers worldwide. As we continue our rapid growth, we rely heavily on data-driven insights, and your contributions will be instrumental in shaping our strategic direction.
Key Responsibilities
- Design, build, and manage data pipelines: Create and maintain robust data pipelines that ensure all client event data is seamlessly integrated into our data warehouse with minimal latency and maximum reliability.
- Develop canonical datasets: Build and maintain authoritative datasets that track key product metrics including customer growth, engagement, retention, and revenue streams.
- Collaborate across teams: Work collaboratively with various departments including Infrastructure, Data Science, Product, Marketing, Finance, and Research to understand their data needs and deliver tailored solutions.
- Implement fault-tolerant systems: Design and execute resilient and fault-tolerant systems for data ingestion and processing that can handle massive scale and ensure data availability.
- Drive architectural decisions: Participate in data architecture and engineering decisions, bringing strong perspectives and best practices to bear on complex technical challenges.
- Ensure data governance: Maintain the security, integrity, and compliance of all data in accordance with industry standards and company policies.
- Optimize performance: Write, debug, and optimize SQL queries and data processing scripts to ensure efficient data flow and processing.
- Mentor and guide: Provide technical guidance and mentorship to junior team members and contribute to our engineering best practices.
Essential Qualifications
To thrive in this role, you'll need:
- Educational background: Bachelor's degree in Computer Science, Engineering, Mathematics, or a related technical field. Advanced degrees are a plus but not required.
- Programming proficiency: Strong capability in at least one programming language commonly used in Data Engineering, such as Python, Scala, or Java.
- Distributed processing experience: Hands-on experience with distributed processing technologies and frameworks, such as Hadoop, Flink, Spark, and distributed storage systems (e.g., HDFS, S3).
- ETL and scheduling expertise: Proficiency with ETL schedulers like Airflow, Dagster, Prefect, or similar frameworks.
- SQL expertise: Strong understanding of SQL and the ability to write, debug, and optimize complex queries.
- Data modeling skills: Experience with dimensional modeling, data warehousing concepts, and database design.
- Problem-solving abilities: Strong analytical and problem-solving skills with attention to detail.
- Communication skills: Excellent verbal and written communication skills to effectively collaborate with cross-functional teams.
Preferred Qualifications
While not required, the following experiences would make you an even stronger candidate:
- Experience working with real-time data processing systems and stream processing technologies.
- Knowledge of cloud platforms (AWS, GCP, or Azure) and cloud-native data services.
- Familiarity with machine learning workflows and ML infrastructure.
- Experience in the AI/ML or technology industry.
- Understanding of data privacy regulations and best practices (GDPR, CCPA, etc.).
- Experience with data cataloging and governance tools.
- Background in working with large-scale data systems handling petabyte-scale data.
Skills and Competencies
Beyond technical qualifications, we look for candidates who demonstrate:
- Ownership mindset: Takes accountability for deliverables and sees projects through to completion.
- Collaboration spirit: Works effectively with cross-functional teams and values diverse perspectives.
- Innovation drive: Thrives on solving complex problems and is always looking for better solutions.
- Learning agility: Quick to learn new technologies and adapt to evolving requirements.
- Attention to detail: Maintains high standards for data quality and system reliability.
- Communication clarity: Can articulate technical concepts to both technical and non-technical stakeholders.
Career Growth and Learning Opportunities
At arenaflex, we're invested in your professional development. As a Data Engineer here, you'll have access to:
- Technical mentorship: Work alongside some of the brightest minds in AI and data engineering, learning from their expertise and experience.
- Cutting-edge projects: Tackle unprecedented technical challenges at the frontier of AI technology.
- Learning resources: Access to conferences, workshops, training programs, and internal tech talks.
- Career advancement: Clear pathways for career growth into senior technical roles, team lead positions, or specialized domains.
- Cross-functional exposure: Opportunities to work across different teams and gain broad experience in AI research, product development, and business operations.
You'll be working with massive data scales and complex engineering problems that you won't find anywhere else. The skills and experience you gain at arenaflex will position you for long-term success in the rapidly evolving field of AI and data engineering.
Work Environment and Culture
arenaflex offers a dynamic, fast-paced work environment where innovation is encouraged and excellence is expected. We believe in fostering a culture that values:
- Collaboration over competition: We work together to achieve the best outcomes for our mission.
- Safety and responsibility: We build AI systems with safety and human needs at their core.
- Diversity and inclusion: We welcome and value diverse perspectives from all backgrounds.
- Work-life balance: We support your well-being with flexible arrangements and generous time off.
- Continuous learning: We encourage experimentation, learning from failures, and constant improvement.
Our headquarters is located in San Francisco, but we embrace flexible work arrangements to support our team members. We believe in hiring the best talent regardless of location and providing the tools and environment needed for remote and hybrid work success.
Compensation and Benefits
We offer competitive compensation packages that include:
- Attractive salary: $28 per hour for this position, with opportunities for growth based on performance.
- Equity participation: Generous equity grants that allow you to share in arenaflex's success.
- Comprehensive health coverage: Medical, dental, and vision insurance for you and your family.
- Mental health support: Access to mental health resources and wellness programs to support your overall well-being.
- Retirement savings: 401(k) plan with 4% company matching to help you save for the future.
- Generous time off: Unlimited PTO and 18+ company events per year to recharge and connect with colleagues.
- Family support: Paid parental leave (20 weeks) and family-planning support.
- Professional development: Budget for conferences, courses, and learning materials.
- Home office stipend: Support for setting up your remote work environment.
Join Us in Shaping the Future
AI is a transformative technology that has the potential to help humanity tackle some of our greatest challenges—from climate change to disease to economic inequality. At arenaflex, we're building AI that is safe, beneficial, and accessible to all.
If you're passionate about working with data, excited about the possibilities of AI, and want to be part of something bigger than yourself, we want to hear from you. This is more than just a job—it's an opportunity to contribute to one of the most important technological missions of our time.
Come join us at arenaflex and help shape the future of artificial intelligence. Together, we can build technology that benefits all of humanity.
We are an equal opportunity employer and welcome applicants from all backgrounds. We celebrate diversity and are committed to creating an inclusive environment for all employees.