This job might no longer be available.
Senior Data Engineer - Experimentation
1 year ago
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
At Roblox, a deep understanding and measurement of users and creators' experience is critical to Roblox's rapid growth. The experimentation platform is enabling Roblox's success by empowering other teams at Roblox to use data and experiments to improve user acquisition, engagement, and retention.
We are building an industry leading platform and tools that allow other teams and our developers to run and automatically analyze experiments at scale. As a Data Engineer on experimentation you should be familiar with supporting Experimentation or structured ETL frameworks, and should leverage that knowledge to inform your design decisions and implementations. Our team's product will act as the interface between those defining experiments and the results of the variants across the Roblox experimentation platform. Scale is of the utmost importance as Roblox continues it's development of our world class experimentation data platform.
You Have:
- A B.Sc. equivalent in CS or sufficient experience.
- 5+ years of professional experience working with scalable ETL pipelines on industry standard ETL orchestration tools (such as. Airflow, Google Cloud Composer, AWS Step Functions or, Azure Data Factory)
- Data Engineering experience at scale (petabytes++) dealing with multiple datasets and tight SLAs
- An understanding of Data Engineering as a function deeply and have war stories to prove it
- Expertise SQL and have a deep understanding of how queries are processed by your data warehouse or database
- Expertise in one of Hive, Presto or Spark and understand how it works behind the scenes. Bonus if you are a committer to one of these projects
- Strong coding ability in at least one programming language - you are a programmer first and a data engineer next
- You have a solid understanding of compute and storage infrastructure in the cloud (AWS, GCP or Azure)
You May Have:
- Deep understanding of Airflow, data modeling at scale, Kubernetes
- Developed with Data Quality at the core of your pipelines (e.g. Great Expectations, Data Fold, etc.)
- Developed or enhanced ETL orchestrations tools
- Used statistical optimization techniques to reduce overall computation complexity
- Worked within standard GitOps workflow (branch and merge, PRs, CI / CD systems)
- Familiarity with infrastructure configuration (IaC [e.g. Terraform], cluster parameter tuning, service parameter tuning)
You Will:
- Partner with our experimentation platform team to build and scale our experimentation data processing workflow.
- Design an extensible and scalable data model to support the experimental analysis and reporting
- Design, build, and maintain efficient and reliable data pipelines in batch and streaming to fuel the core data sets
- Build ETL Frameworks to scale and extend the sheer volume of experiments that are processed
- Analyze the use cases for the data to determine appropriate SLAs
- Analyze the incoming data and upstream pipelines to determine and minimize epistemological issues.
- Determine appropriate relaxations to deterministic compute where appropriate and leverage probabilistic data structures (bloom filters, count min sketch)
- Partner with the Data Platform Team to provide approximation algorithms (approximate nearest neighbor, etc.) for high use statistics of interest.
- Determine optimal caching strategies and eviction policies to support cost effective analysis
- Drive adoption of the Core Data tables and publicize new incoming datasets to ensure consistency across the organization
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits.
Annual Salary Range
$233,840 — $283,780 USD
You’ll Love:
- Industry-leading compensation package
- Excellent medical, dental, and vision coverage
- A rewarding 401k program
- Flexible vacation policy
- Roflex - Flexible and supportive work policy
- Roblox Admin badge for your avatar
- At Roblox HQ:
- Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
- Onsite fitness center and fitness program credit
- Annual CalTrain Go Pass
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.
Create Your Profile — Game companies can contact you with their relevant job openings.