About
Highly experienced Database Engineer with over 5 years in designing, developing, and administering robust ETL processes for complex data warehouses. A strong IT professional with a solid mathematical background, adept at optimizing data pipelines, enhancing data quality, and driving significant improvements in data accessibility and performance.
Work
→
Summary
Spearheading critical data migration and ETL optimization initiatives to enhance data infrastructure and performance.
Highlights
Led the migration of existing ETL processes from Teradata to Databricks, ensuring seamless data transition and system compatibility.
Developed and optimized complex ETL processes, significantly enhancing data flow efficiency and reliability.
Achieved substantial reductions in query numbers and data waiting times, resulting in decreased data handling costs and reduced operational complexity.
Conducted qualitative optimization of data content, minimizing errors and anomalies to improve data accuracy and integrity.
Implemented efficient data processing methods, including filtering, formatting, and delivery, to significantly enhance data accessibility and utilization for stakeholders.
→
Summary
Contributed to data orchestration and database management, focusing on comprehensive data schema implementation and performance optimization.
Highlights
Utilized Python and Apache Airflow as a data orchestrator, and Microsoft SQL as the primary database, for complex data projects.
Implemented a comprehensive data schema covering all stages of data acquisition, extraction, cleansing, transformation, integration, loading, and replication, enabling a client's store network to optimize goods arrangement.
Developed efficient data refinement and delivery processes, significantly improving client access to sales statistics and enhancing usability for faster insights.
Engineered robust SQL code, stored procedures, and triggers that substantially contributed to the high performance and data integrity of the project.
Languages
Russian
Native
English
Proficient
Skills
Data Platforms
DataBricks, Greenplum, Oracle, MS SQL.
Programming Languages
Python, PL/SQL, SQL.
ETL & Data Processing
ETL, PySpark, Apache Airflow, SAS DIS, Data Quality.
DevOps & Version Control
Docker, Git, Jenkins.