Data Pipeline Developer
Job DescriptionJob Description
Job Title: Data Pipeline Developer
Summary:
The Data Pipeline Specialist is responsible for the end-to-end design, development, deployment, and maintenance of robust data pipelines supporting the IRS Enterprise Data Platform (EDP). This role drives the ingestion, transformation, and integration of large-scale, complex datasets from diverse IRS legacy and modern source systems into cloud-based analytic environments. Leveraging advanced expertise in ETL/ELT development, cloud- data engineering (e.g., Databricks, AWS RDS/Postgres, Redshift, MongoDB, DynamoDB), and scripting , the Data Pipeline Specialist ensures data accuracy, security, and continuous availability across transactional, analytical, and API data stores. The Specialist works closely with data engineers, analysts, and project stakeholders to convert business requirements and legacy code (such as PL/SQL, Greenplum, Oracle) into efficient, scalable, and compliant pipelines. This position also supports ongoing operations and maintenance, including troubleshooting, enhancements, and performance tuning, and plays a vital role in enabling advanced analytics, reporting, and real-time data access to empower IRS's IT modernization and analytics goals. The Data Pipeline Specialist ensures all solutions are developed in alignment with IRS standards, OneSDLC, and compliance mandates such as FISMA, FedRAMP, Section 508.
Responsibilities:
- Develop robust, scalable data pipelines for the ingestion, transformation, and load (ETL/ELT) of IRS source system data into the EDP platform (with focus on Databricks, AWS RDS/Postgres, Redshift, MongoDB, DynamoDB, etc.).
- Convert and optimize legacy ETL processes-including complex PL/SQL, Greenplum, and Oracle stored procedures-to modern, scalable cloud- data processing using Databricks and Informatica ETL tools.
- Implement data integration and orchestration workflows ensuring data accuracy, completeness, and integrity across transactional, analytical, and API data stores.
- Develop and document pipeline designs, deployment scripts, test plans, and system interfaces in accordance with EDP and Solution Engineering standards.
- Partner with data engineering and analytics teams to deliver high-quality, production-ready data products supporting advanced analytics and reporting use cases.
- Support the creation of API-based data access services, enabling secure and efficient integration with internal and external IRS programs.
- Perform all required testing (system, integration, performance, Section 508, security), remediate defects
- Participate in operations and maintenance support for pipeline issue resolution and enhancement
Requirements:
- Must have demonstrated development experience with at least 2 project similar in size and scope to transform and load data into Databricks.
- Hold both an active IRS and an active or non-active MBI clearance
Qualifications:
- Prior experience working with the IRS as a contractor or employee