What are the responsibilities and job description for the Data Enrichment Specialist position at The World Bank?
Application Deadline :25th June, 2025
To Submit Your Application:
Interested candidates should send their CV and a letter of interest to Saketh Mudiganti - smudiganti@worldbankgroup.org. The subject line of the email should be " Data Enrichment Specialist". Only shortlisted candidates will be contacted for an interview.
Background:
The organization is seeking a highly skilled and experienced Data Enrichment Specialist to design, develop, and implement a robust data enrichment layer and pipeline. This initiative aims to enhance the value and utility of existing data assets by integrating and transforming information from various sources, specifically focusing on documents stored in SharePoint Online and structured data within Databricks SQL Warehouse. The enriched data will primarily serve as embedding data within Google Cloud's Vertex AI platform, supporting advanced AI/ML applications.
Objective:
The primary objective of this role is to create a scalable and efficient data enrichment framework that transforms raw data from SharePoint Online and Databricks SQL Warehouse into high-quality, enriched datasets suitable for embedding and consumption by Vertex AI. This will involve developing automated pipelines, ensuring data quality, and optimizing the enrichment process for performance and accuracy.
Scope of Work:
• Data Source Analysis and Integration: Extract and integrate data from SharePoint Online and Databricks SQL Warehouse.
• Data Enrichment Layer Design: Develop techniques for text extraction, metadata standardization, and semantic enrichment.
• Embedding Data Pipeline Development: Automate pipelines for embedding data in Vertex AI.
• Documentation and Collaboration: Prepare technical documentation and collaborate with AI/ML teams.
Duties and Responsibilities:
• Analyze data sources and design extraction methods.
• Develop ETL/ELT pipelines using Python and data processing frameworks.
• Apply NLP techniques for text processing.
• Ensure data quality and collaborate with AI/ML teams.
Deliverables:
• Enriched datasets formatted for Vertex AI.
• Technical documentation and regular progress reports.
Qualifications and Experience:
• Experience in data engineering and enrichment.
• Proficiency in Python, SQL, and cloud platforms, especially GCP and Vertex AI.
• Strong problem-solving and communication skills.