What you can expect

The AI Infra team at Zoom is dedicated to building a world-class inference infrastructure that powers all of Zoom’s AI services. Our mission is to deliver high efficiency, scalability, and cost optimization across a wide range of AI applications, including large language models (LLM), vision-language models (VLM), automatic speech recognition (ASR), and machine translation. We focus on creating a seamless collaboration between small and large models, ensuring cost-effective, privacy-preserving, and high-quality AI services for our customers.

‍

About the Team

As an AI Software Engineer on Zoom’s AI Infra team, you will design, optimize, and scale the runtimes and services that power our AI models. Your work will directly improve efficiency, reduce latency, and lower costs across Zoom’s AI stack, ensuring reliable, high-performance AI experiences for millions of users.

‍

Responsibilities

Develop and optimize AI runtimes for LLMs, ASR, and MT systems with a focus on performance and cost efficiency.
Apply GPU-level optimization techniques including CUDA, kernel fusion, and memory throughput improvements.
Implement inference optimizations such as TorchCompile, graph optimization, KV cache, and continuous batching.
Build scalable, highly available infrastructure services to support enterprise-grade AI workloads.
Optimize models for edge devices (laptops, PCs and mobile devices) as well as large-scale cloud deployments.
Continuously improve latency, throughput, and efficiency across serving pipelines.
Rapidly integrate and optimize new industry models to stay ahead in AI infrastructure.

‍

What we’re looking for

Track record of building scalable, reliable AI infrastructure under real-world production constraints.
Strong expertise in GPU programming and optimization (CUDA, kernel-level development).
Deep experience with transformer-based models and inference frameworks (vLLM, TensorRT-LLM, SGLang, ONNX Runtime).
Proficiency in Python and C++ (Java is a plus).
Hands-on experience with PyTorch (TorchCompile, graph-level optimization) and/or TensorFlow.
Knowledge of low-level hardware concepts (GPU memory hierarchy, caching, vectorization).
Familiarity with cloud platforms (AWS, GCP, Azure) and AI deployment tools (Docker, Kubernetes, MLflow).

‍

Latest jobs

Amazon

Network Engineer I, Just Walk Out Tech

📍

Hyderabad, India

December 14, 2025

view job ->

Amazon

Mechatronics & Robotics Technician

📍

Pflugerville, TX

December 14, 2025

view job ->

Amazon

AWS Finance analyst , AWS Finance

📍

Hyderabad, India

December 14, 2025

view job ->

Amazon

Tax Analyst I, Federal Tax

📍

Hyderabad, India

December 14, 2025

view job ->

Amazon

2026 Engineering Operations Intern, Data Center Engineering Operations

📍

Frankfurt, Germany

December 14, 2025

view job ->

Amazon

Software Development Engineer

📍

San Francisco, CA

December 14, 2025

view job ->

Amazon

Financial Analyst I, FOAA - Payroll

📍

Bengaluru, India

December 14, 2025

view job ->

Amazon

Software Development Engineer Internship - Vaga para mulheres, IES- LATECH

📍

São Paulo, Brazil

December 14, 2025

view job ->

Amazon

Mechatronics & Robotics Tech

📍

Maryville, TN

December 14, 2025

view job ->

Amazon

Equipment Coordinator

📍

Tracy, CA

December 14, 2025

view job ->

AI Software Engineer

Zoom

Latest jobs

Search by city

Cities - 🇺🇸 North America

Cities - 🌏 Asia

Cities - 🌎 South America

Search by area

Cities - 🇪🇺 Europe

Cities - 🇬🇧 UK

Cities - 🇦🇺 Australia

Cities - 🌍 Middle East

Search by experience level

Search by role type

2026 internships

2026 graduate roles

2026 apprenticeships

No experience roles
(0 yrs)

1 year experience roles
(1 yrs)

2 years experience roles
(2 yrs)

3 years experience roles
(3 yrs)

AI Software Engineer

Zoom

Latest jobs

Search by city

Cities - 🇺🇸 North America

Cities - 🌏 Asia

Cities - 🌎 South America

Search by area

Cities - 🇪🇺 Europe

Cities - 🇬🇧 UK

Cities - 🇦🇺 Australia

Cities - 🌍 Middle East

Search by experience level

Search by role type

2026 internships

2026 graduate roles

2026 apprenticeships

No experience roles(0 yrs)

1 year experience roles(1 yrs)

2 years experience roles(2 yrs)

3 years experience roles(3 yrs)

No experience roles
(0 yrs)

1 year experience roles
(1 yrs)

2 years experience roles
(2 yrs)

3 years experience roles
(3 yrs)