About the Role:
CoreWeave is seeking a passionate and innovative Software Engineer of Network Services to lead the architecture, scaling, and operations of our network services. You'll be tackling the complex challenges of building and operating world-class network services for the most demanding AI workloads on the planet. You will be focusing on achieving the highest levels of service quality, scalability and performance. You will also build comprehensive monitoring and alerting into our network services. You will work closely with your team to execute our network service roadmap, drive innovation, and ensure the reliability, security, and scalability of the CoreWeave Cloud Platform.
β
β
Key Responsibilities
- Develop, and maintain the network services infrastructure for our GPU cloud services, including networking cloud services, host networking, smart NICs, and overlay/underlay networks.
- Collaborate with cross-functional teams, including DevOps, System Administrators, and Cloud Services teams, to ensure seamless integration of network datapath solutions.
- Optimize host network performance, latency, and throughput to meet the demanding requirements of GPU-based workloads.
- Implement and maintain network security protocols and best practices to protect customer data and infrastructure.
- Participate in an on-call rotation, troubleshoot and resolve network-related issues, and provide timely support to the operations team.
- Stay up-to-date with the latest networking technologies, industry trends, and best practices to continually improve our networking infrastructure.
- Participate in code reviews, design discussions, and architecture decisions to ensure high-quality software development practices.
β
β
Qualifications
- Bachelor's or Master's degree in Computer Science, Computer Engineering, or a related field.
- Experience (2+ years) as a Software Engineer, with a focus on networking and cloud infrastructure.
- Expertise in network protocols, routing, and security (TCP/IP, BGP, kernel networking, etc.).
- Proficiency in programming languages such as Golang, Python, or C++.
- Familiarity with software-defined networking (SDN) and network virtualization technologies.
- Experience building and deploying k8s networking and infrastructure, CNIs.
- Experience with k8s operators and controllers is a plus.
- Excellent problem-solving skills and a proactive attitude towards troubleshooting and issue resolution.
- Strong communication skills and the ability to collaborate effectively with cross-functional teams.
- A passion for staying updated on emerging technologies and a commitment to continuous learning.
β