📍
Mountain View, CA

Software Engineer - Lakeflow PhD Candidates

1 year experience
Technology & Digital
Software engineering
Posted:
December 29, 2025

Databricks

Data intelligence platform for businesses
77.5
Palpable Score
Apply >view company >

Databricks is radically simplifying the entire data lifecycle, from ingestion to generative AI and everything in-between. We’re doing it cross-cloud with a unified platform, serving over 10k customers, processing exabytes of data/day on 15+ million VMs, and growing exponentially.

The Lakeflow team is looking for recent PhD graduates. Lakeflow team includes products like Apache Spark™ Structured Streaming, Delta Live Tables (DLT), and Materialized Views. Apache Spark™ Structured Streaming is one the world’s most popular streaming engines. DLT makes it easy to build and manage reliable batch and streaming data pipelines that deliver high-quality data on the Databricks Lakehouse Platform. DLT helps data engineering teams simplify ETL (extract-transform-load) development and management with declarative pipeline development, automatic data testing, and deep visibility for monitoring and recovery. DLT optimizes pipeline execution by logical optimization through query transformations, and physical optimization such as instance type selection and vertical/horizontal autoscaling.

Moreover, as part of DLT, we have a new catalyst optimization layer, Eenzyme, designed specifically to speed up the ETL process and make declarative ETL computation possible by incrementally computing and materializing the intermediate results. Enzyme can create and keep up-to-date a materialization of the results of a given query stored in a Delta table. Enzyme does this by using a cost model to choose between a variety of techniques that borrow from traditional literature on the maintenance of materialized views, delta-to-delta streaming, and manual ETL patterns commonly used by our customers.

As a part of the LakeflowDLT team, there are opportunities to design and implement in many areas that leapfrog existing systems:

What We Look For:

© Databricks 2017. All rights reserved. Apache, Apache Spark, Spark and the Spark logo are trademarks of the Apache Software Foundation. Privacy Policy | Terms of Use

Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents the expected salary range for non-commissionable roles or on-target earnings for commissionable roles.  Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks anticipates utilizing the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here.

Local Pay Range

$142,200—$204,600 USD

About the company

Databricks

Company overview
Databricks builds a cloud platform used by data teams to run analytics, data engineering, and AI workloads in one place, often described as a “lakehouse” approach. The company sells to enterprises that want to unify data pipelines, BI, and machine learning on major clouds. Databricks is closely associated with Apache Spark’s origins and supports open-source projects used widely in data engineering and ML workflows. The company’s customers range from large enterprises to fast-growing tech companies building data products.

Locations and presence

Databricks is headquartered in San Francisco and lists many offices across the Americas, EMEA, and Asia-Pacific. The company describes a hybrid setup built around weekly “Team Day” in-office time, with most work happening from home, and notes that some roles are fully remote.

Palpable Score

77.5
/ 100
Databricks offers clear, recurring entry points for students and new grads, and the company backs that up with cohort-style roles and explicit mentorship language in early-career job postings. Databricks also publishes candidate-facing guidance on the hiring process and shares pay bands on at least some early-career roles, which is a practical signal of transparency. The main limiter is mixed public candidate and employee feedback on consistency, especially around responsiveness and clarity of progression, and the lack of published early-career outcomes data.
view full company profile >

Related jobs

📍
United Kingdom
Cleo
Commercial Associate
January 22, 2026
view job >
📍
Warsaw, Poland
Cleo
Product Engineer - Ruby | Poland
January 22, 2026
view job >
📍
United Kingdom
Cleo
Graduate Machine Learning Engineer
January 22, 2026
view job >
📍
United Kingdom
Cleo
Data Analyst, Fraud
January 22, 2026
view job >