Experienced Staff Software Engineer – High-Performance Computing & Machine Learning Infrastructure (Remote/Hybrid)
Posted 2026-05-05Job Description:
Join arenaflex and Shape the Future of Cloud Computing
Are you ready to be part of a technological revolution? At arenaflex, we're not just building software—we're transforming how billions of people interact with data and each other. Our cutting-edge innovations power some of the most critical computing infrastructure in the world, and we need exceptional engineers like you to help us push the boundaries of what's possible.
As a Staff Software Engineer on our High-Performance Computing (HPC) and Machine Learning (ML) infrastructure team, you'll play a pivotal role in optimizing and developing next-generation solutions that enable organizations worldwide to solve their most complex computational challenges. This is a unique opportunity to work at the intersection of distributed systems, performance engineering, and cloud computing—where your expertise will directly impact how businesses leverage AI and high-performance computing at scale.
Why arenaflex?
At arenaflex, we believe that great software comes from great people. Our culture fosters innovation, collaboration, and continuous learning. We encourage our engineers to think big, take ownership, and explore new ideas across the entire technology stack. When you join arenaflex, you're not just accepting a job—you're embarking on a career where your contributions matter and your growth is our priority.
Our Cloud division is accelerating organizations' ability to transform their businesses with the best infrastructure, platforms, and industry solutions available. We deliver enterprise-grade solutions that leverage arenaflex's state-of-the-art technology—designed for organizations that demand performance, reliability, and scalability. Clients across more than 200 countries and regions trust arenaflex Cloud as their partner to enable innovation and solve their most critical business challenges.
What You'll Do
As a Staff Software Engineer specializing in HPC and ML infrastructure, you will be responsible for driving technical excellence and leading initiatives that push the boundaries of performance optimization. Your work will span the full stack, from kernel-level optimizations to user-space communication libraries and client applications.
Key Responsibilities:
- Full Stack Optimization: Optimize HPC and ML performance on arenaflex Cloud Platform (ACP) infrastructure, including kernel optimization, userspace communication libraries (such as MPI library, libfabric, and NCCL), and client HPC and ML applications to achieve maximum efficiency and throughput.
- Solution Development: Architect and develop cutting-edge HPC and ML solutions on the arenaflex Cloud Platform, working with cross-functional teams to deliver scalable, high-performance systems that meet evolving customer needs.
- Technical Leadership: Set technical direction and standards for a team of talented engineers, providing mentorship, code reviews, and guidance to ensure delivery of high-quality, maintainable solutions.
- System Design: Design, develop, test, deploy, maintain, and improve software solutions that address complex computational challenges, ensuring robustness, scalability, and performance excellence.
- Cross-Team Collaboration: Partner with product managers, other engineering teams, and stakeholders to understand requirements, translate them into technical specifications, and deliver solutions that exceed expectations.
- Innovation: Identify opportunities for innovation and continuous improvement, proposing new approaches and technologies that enhance our HPC and ML offerings.
- Performance Analysis: Conduct thorough performance analysis and profiling to identify bottlenecks and optimization opportunities across the entire system stack.
What We're Looking For
We're seeking an experienced engineer who thrives on complex technical challenges and has a proven track record of delivering high-impact solutions in distributed systems, cloud infrastructure, or high-performance computing environments.
Minimum Qualifications:
- Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
- Minimum 2 years of experience in software development with strong proficiency in data structures and algorithms.
- Minimum 2 years of experience building and scaling large-scale distributed systems, wide-reaching frameworks, or networking infrastructure.
- Demonstrated ability to work collaboratively across teams and effectively communicate technical concepts to diverse audiences.
- Strong problem-solving skills with the ability to analyze complex systems and identify root causes of performance issues.
Preferred Qualifications:
- Advanced degree (Master's or PhD) in Engineering, Computer Science, or a related technical field.
- Experience with C++ programming and performance-critical software development.
- Deep expertise in Linux kernel development, including device drivers, kernel subsystems, and memory management.
- Experience with Linux device drivers, networking stacks, and operating system tuning and packaging.
- Strong background in HPC and ML communications, including MPI (Message Passing Interface), collective communication libraries, libfabric, and socket programming.
- Familiarity with Remote Direct Memory Access (RDMA) technologies and high-speed networking protocols.
- Experience optimizing performance for machine learning workloads and understanding of ML frameworks and training/inference pipelines.
- Knowledge of cloud computing platforms and architectures, particularly in HPC and AI/ML contexts.
Skills and Competencies
To excel in this role, you'll need a combination of technical expertise and soft skills that enable you to deliver complex projects while leading and inspiring your team.
- Technical Excellence: Deep understanding of computer architecture, operating systems, networking, and distributed systems principles.
- Programming Proficiency: Strong coding skills in C++ and familiarity with other languages such as Python, Go, or Java.
- Performance Optimization: Ability to profile, analyze, and optimize system performance at all levels of the stack.
- System Design: Experience designing scalable, reliable distributed systems that can handle massive workloads.
- Communication: Excellent verbal and written communication skills to articulate technical decisions and collaborate effectively.
- Leadership: Ability to mentor junior engineers, provide technical guidance, and drive projects to successful completion.
- Adaptability: Comfortable working in a fast-paced environment with evolving requirements and priorities.
- Innovation: Creative thinking and willingness to explore new technologies and approaches to solve challenging problems.
Career Growth and Learning Opportunities
At arenaflex, we invest in your professional development. As a Staff Software Engineer, you'll have access to numerous opportunities to grow your career and expand your skillset:
- Technical Leadership Paths: Advance into principal engineer, distinguished engineer, or fellow roles where you'll influence technology direction at the highest levels.
- Management Tracks: If you're interested in leading teams, we offer engineering management opportunities where you can develop and mentor larger groups.
- Continuous Learning: Access to internal training programs, conferences, certifications, and tuition reimbursement for advanced degrees.
- Cross-Functional Exposure: Opportunities to work on different projects and teams, gaining broad experience across our technology stack.
- Innovation Time: Dedicated time to explore new ideas and contribute to technical innovation initiatives.
- Mentorship Programs: Both receive mentorship from senior engineers and mentor others as you advance in your career.
Work Environment and Culture
At arenaflex, we know that great work happens when people can bring their whole selves to the job. We're committed to creating an inclusive environment where diverse perspectives are celebrated, and everyone has the opportunity to do their best work.
Our engineering culture emphasizes:
- Collaboration over Silos: We break down barriers between teams and encourage open communication and knowledge sharing.
- Data-Driven Decision Making: We use data and metrics to guide our decisions and measure success.
- Customer Focus: Everything we do is driven by understanding and meeting the needs of our customers.
- Work-Life Balance: We support flexible work arrangements, including remote and hybrid options, to help you thrive both professionally and personally.
- Psychological Safety: We foster an environment where team members feel safe to take risks, ask questions, and share ideas without fear of judgment.
Located in Sunnyvale, you'll be in the heart of Silicon Valley—a hub of innovation and technology. While we offer flexibility in work arrangements, our offices provide state-of-the-art facilities, collaborative spaces, and easy access to the vibrant tech community.
Compensation and Benefits
We offer competitive compensation packages that recognize your skills, experience, and contributions. Our comprehensive benefits package includes:
- Competitive Salary: Industry-leading compensation commensurate with experience and expertise.
- Equity: Stock options or equity grants that allow you to share in arenaflex's success.
- Health & Wellness: Comprehensive medical, dental, and vision insurance, plus wellness programs and gym memberships.
- Retirement Plans: 401(k) matching and retirement savings programs.
- Paid Time Off: Generous vacation, sick leave, and parental leave policies.
- Professional Development: Learning budgets, conference attendance, and career development resources.
- Relocation Assistance: For eligible candidates moving to the Sunnyvale area.
- Remote Work Flexibility: Options to work remotely or in a hybrid arrangement based on role requirements and personal preference.
Ready to Make an Impact?
If you're passionate about pushing the limits of what's possible in computing, if you thrive on solving hard technical problems, and if you want to work with a team of talented engineers who share your drive for excellence, then arenaflex is the place for you.
We're looking for engineers who bring new ideas from all areas, including distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design, and mobile technologies. The list continues and is growing always. As an engineer at arenaflex, you'll work on specific projects critical to our needs, with opportunities to switch teams and projects as you and our fast-moving business evolve and advance.
We want our engineers to be versatile, demonstrate leadership qualities, and be excited to take on new challenges across the full stack as we continue to push technology forward. Join us and help shape the future of cloud computing.
Don't miss this opportunity to make your mark. Apply today and take the first step toward an exciting career at arenaflex!