Demo

Databricks Data Engineer

Capgemini
Mc Lean, VA Full Time
POSTED ON 1/28/2026 CLOSED ON 2/26/2026

What are the responsibilities and job description for the Databricks Data Engineer position at Capgemini?

Capgemini Government Solutions is seeking an experienced Databricks Data Engineer to lead the migration of existing Pentaho (Kettle) ETL workflows to a modern Databricks-based ETL architecture, with downstream data loads into Salesforce. This role requires strong hands-on engineering skills, migration experience, and the ability to translate legacy ETL logic into scalable, cloud-native data pipelines.

The ideal candidate is comfortable working independently, partnering with stakeholders, and delivering production-ready pipelines in a secure and governed environment.

Job Responsibilities:

  •  Analyze and reverse-engineer existing Pentaho ETL jobs and transformations
  • Design and implement Databricks ETL pipelines using:
    • Apache Spark (PySpark / Spark SQL)
    • Databricks Workflows / Jobs
    • Delta Lake
  • Re-platform ETL logic including:
    • Data extraction from relational sources (PostgreSQL, SQL Server, etc.)
    • File-based ingestion (CSV, ZIP, SFTP workflows)
    • Transformations, validations, and enrichment
  • Implement data loading strategies into Salesforce, including:
    • Salesforce APIs (Bulk API, REST API)
    • Handling large volumes, retries, and error handling
    • Incremental vs full loads
  • Optimize pipelines for performance, scalability, and cost
  • Implement logging, monitoring, and data quality checks
  • Support testing, validation, and parallel runs during migration
  • Produce technical documentation for migrated pipelines
  • Collaborate with architects, platform teams, and business stakeholders

Required Qualifications:

  • U.S. Citizenship is required.
  • Eligible to obtain and maintain Government Security Clearance.
  • 5 years of experience in data engineering / ETL development
  • Strong hands-on experience with Databricks
  • Proficiency in PySpark and Spark SQL
  • Prior experience migrating from legacy ETL tools (Pentaho, Informatica, Talend, SSIS, etc.)
  • Experience integrating with Salesforce as a target system
  • Solid understanding of:
    • ETL/ELT design patterns
    • Data modeling and transformation logic
    • Error handling and restartability
  • Experience working with relational databases
  • Strong troubleshooting and problem-solving skills

Preferred Qualifications:

  • Direct experience migrating Pentaho (Kettle) to Databricks
  • Experience with Salesforce data models (objects, relationships, limits)
  • Familiarity with:
    • CI/CD for data pipelines
    • Git-based source control
    • Cloud platforms (AWS preferred)
  • Experience in regulated or compliance-driven environments (FedRAMP, HIPAA, etc.)
  • Knowledge of data governance, lineage, and auditability

About Capgemini

Capgemini is a global business and technology transformation partner, helping organizations to accelerate their dual transition to a digital and sustainable world, while creating tangible impact for enterprises and society. It is a responsible and diverse group of 340,000 team members in more than 50 countries. With its strong over 55-year heritage, Capgemini is trusted by its clients to unlock the value of technology to address the entire breadth of their business needs. It delivers end-to-end services and solutions leveraging strengths from strategy and design to engineering, all fueled by its market leading capabilities in AI, generative AI, cloud and data, combined with its deep industry expertise and partner ecosystem. The Group reported 2024 global revenues of €22.1 billion.

Get the future you want | www.capgemini.com

Disclaimer

All qualified applicants will be considered for employment based on their skills, and merit.

Please be aware that Capgemini may capture your image (video or screenshot) during the interview process and that image may be used for verification, including during the hiring and onboarding process.

Applicants for employment in the US must have valid work authorization that does not now and/or will not in the future require sponsorship of a visa for employment authorization in the US by Capgemini.

Capgemini discloses salary range information in compliance with state and local pay transparency obligations. The disclosed range represents the lowest to highest salary we, in good faith, believe we would pay for this role at the time of this posting, although we may ultimately pay more or less than the disclosed range, and the range may be modified in the future. The disclosed range takes into account the wide range of factors that are considered in making compensation decisions including, but not limited to, geographic location, relevant education, qualifications, certifications, experience, skills, seniority, performance, sales or revenue-based metrics, and business or organizational needs. At Capgemini, it is not typical for an individual to be hired at or near the top of the range for their role. The base salary range for the tagged location is 150k-170k.

This role may be eligible for other compensation including variable compensation, bonus, or commission. Full time regular employees are eligible for paid time off, medical/dental/vision insurance, 401(k), and any other benefits to eligible employees.

Note: No amount of pay is considered to be wages or compensation until such amount is earned, vested, and determinable. The amount and availability of any bonus, commission, or any other form of compensation that are allocable to a particular employee remains in the Company's sole discretion unless and until paid and may be modified at the Company’s sole discretion, consistent with the law.

Equal Opportunity Employer/Protected Veterans/Individuals with Disabilities

This employer is required to notify all applicants of their rights pursuant to federal employment laws.
For further information, please review the Know Your Rights notice from the Department of Labor.

Salary : $150,000 - $170,000

If your compensation planning software is too rigid to deploy winning incentive strategies, it’s time to find an adaptable solution. Compensation Planning
Enhance your organization's compensation strategy with salary data sets that HR and team managers can use to pay your staff right. Surveys & Data Sets

What is the career path for a Databricks Data Engineer?

Sign up to receive alerts about other jobs on the Databricks Data Engineer career path by checking the boxes next to the positions that interest you.
Income Estimation: 
$122,257 - $154,284
Income Estimation: 
$143,391 - $179,890
This job has expired.
Employees: Get a Salary Increase
View Core, Job Family, and Industry Job Skills and Competency Data for more than 15,000 Job Titles Skills Library

Job openings at Capgemini

  • Capgemini Hartford, CT
  • We are seeking a Change Manager to oversee enterprise-wide IT change governance across both infrastructure and application environments. This role requires... more
  • 4 Months Ago

  • Capgemini Bloomington, MN
  • About Capgemini: Capgemini is a global leader in consulting, digital transformation, technology and engineering services. The Group is at the forefront of ... more
  • 4 Months Ago

  • Capgemini Washington, WA
  • About The Job You’re Considering The AI Delivery Lead is responsible for driving the successful execution and delivery of AI and machine learning (ML) proj... more
  • 4 Months Ago

  • Capgemini Washington, WA
  • About The Job You’re Considering The Release Engineer role contributes to the success of client's Enterprise Data & Analytics Platform by owning, managing,... more
  • 4 Months Ago


Not the job you're looking for? Here are some other Databricks Data Engineer jobs in the Mc Lean, VA area that may be a better fit.

  • Jobs via Dice Mc Lean, VA
  • Capgemini Government Solutions is seeking an experienced Databricks Data Engineerto lead the migration of existing Pentaho (Kettle) ETL workflowsto a moder... more
  • 4 Months Ago

AI Assistant is available now!

Feel free to start your new journey!