What youโll do
Every record in our data warehouse is vitally important for the businesses that use Stripe, so weโre looking for people with a strong background in software engineering and data to help us scale while maintaining correct and complete data. Youโll be working with a variety of internal teams across Product, Data Science, and GTM to help them solve their data needs. Your work will provide visibility into how these stakeholders and the Data Foundations organization are performing and how we can deliver a better experience to Stripe's customers.
โ
โ
Responsibilities
- Design, develop, and own data pipelines, models, and products that power the Product, Data Science, and GTM functions
- Develop strong subject matter expertise and manage the SLAs for both data pipelines and full stack web applications that support these critical stakeholders
- Build and refine Stripe's data foundations - infrastructure, pipelines, and tools to enable various teams at Stripe - working with Scala, Spark, and Airflow
- Leverage LLM and Agents at scale to produce high-quality data on ambiguous problems
- Refine our existing data marts that help the GTM organization forecast the future potential performance of the business and reliably measure ongoing attainment toward targets
- Build data services that track key product metrics and measure the impact of different strategies employed by teams in the field
- Our tech stack is Spark, Scala, Java, SQL, and Python - and while we donโt expect everyone on the team to be an expert in all of these, you will work across all of these technologies throughout your tenure on the team
โ
โ
Who you are
Weโre looking for someone who meets the minimum requirements to be considered for the role. If you meet these requirements, you are encouraged to apply. The preferred qualifications are a bonus, not a requirement.
โ
โ
Minimum requirements
- 2 - 5 years of experience in a Software Engineering role, with a focus on building and maintaining data services, or data-intensive applications.
- A strong engineering background and are interested in data
- Prior experience with writing and debugging data pipelines using a distributed data framework (Spark / Hadoop / Pig etc)
- An inquisitive nature in diving into data inconsistencies to pinpoint issues, and resolve deep rooted data quality issues
- Knowledge of a backend development language (such as Scala, Java, or Go) and strong SQL experience
- The ability to communicate cross-functionally, derive requirements and architect shared datasets
- A strong engineering background and an interest in data
โ
โ
Preferred requirements
- Experience creating and maintaining Data Marts to power business reporting needs
- Experience working with Product or GTM (Sales/Marketing) teams
โ