Больше информации по резюме будет доступно после регистрации

Зарегистрироваться
Was more than two weeks ago

Male, 53 years, born on 15 October 1972

Moscow, metro station Belyaevo, готов работать удалённо, not prepared for business trips

Data Engineer

Specializations:
  • Programmer, developer

Employment type: full time

Work experience 28 years 9 months

September 2022June 2025
2 years 10 months
Softline

Russia, www.softline.com

IT, System Integration, Internet... Show more

Data Engineer
- Implemented feature preparation and cleaning pipelines using Apache Kafka, Apache Spark (Python, Scala) and Apache Flink (Java) to ensure high-throughput, low-latency input data for ML models. - Developed backend services and REST APIs in Java/Spring and Python/FastAPI to operationalize ML models (Python, MLFlow) and streaming analytics (Prometheus, Grafana) in production environments. - Designed and implemented 20+ prototypes and PoCs for presales, showcasing innovative solutions in Big Data, ML, streaming, and distributed backend architectures. - Created Data Lake(Hadoop) or ETL over MPP (GreenPlum) using Spark (Python, Scala), Hadoop, Kafka with orchestration in Airflow and data layers in Hive, Clickhouse and PostgreSQL. - Integrated ML model training, tracking, and deployment workflows using MLflow, Jupyter notebooks, Python/scipy/pandas/fastapi , supporting versioned experimentation and reproducible model lifecycle management. - Deployed and managed containerized workloads with Docker, Kubernetes, Helm, and Terraform across hybrid (on-premise and Yandex Cloud) infrastructures. Domains: Financial, Industrial Tech stack: Java, Python, Scala, Spark, Flink, Kafka, Airflow, MLflow, Prometheus, Grafana, PostgreSQL, ClickHouse, Greenplum, Docker, Kubernetes, Terraform, Yandex Cloud
January 2020September 2022
2 years 9 months
GNIVC

gnivc.ru

IT, System Integration, Internet... Show more

Data Engineer
- Architected and developed a distributed data processing system for financial analytics, enabling large-scale tax data aggregation, transformation, and reporting. - Optimized Spark/Hadoop batch workflows and Kafka streaming pipelines, improving processing performance by 2–10×. - Designed and implemented backend services, batch and streaming pipelines, integrating Java, Scala, and Python jobs for high-throughput financial data processing. - Built data storage for structured and unstructured data with Hadoop, Apache Hive, Oracle, PostgreSQL, ElasticSearch, and HBase for scalable query performance. - Worked over technical architecture decisions and collaborated with cross-functional teams on system optimization, governance, and production readiness. Tech stack: Java, Scala, Python, Kafka, Spark, Hadoop, Hive, Airflow, PostgreSQL, ElasticSearch, HBase Domain: Financial / Tax Analytics
January 2017January 2020
3 years 1 month

Saint Petersburg, www.firstlinesoftware.com

IT, System Integration, Internet... Show more

Senior Software Architect
Architected and implemented large-scale financial data processing systems for the Russian Federal Tax Service. - Designed backend services, data lake, and distributed processing architectures on on-premise Hadoop/Spark clusters, ensuring scalability and reliability. - Developed and optimized ETL pipelines and batch/streaming workflows using Spark, Flink, and Java-based backend services. - Worked over technical design and architecture decisions, collaborating with customer's teams to deliver maintainable, high-performance platforms. Tech stack: Java, Scala, Python, Spark, Flink, Kafka, Hive, Linux Domain: Financial
April 2014January 2017
2 years 10 months

Moscow, www.aplana.ru

IT, System Integration, Internet... Show more

Senior Software Architect
- Architected and implemented a distributed analytics platform for mobile and GPS signal processing using Hadoop, Spark, Kafka, Java, and Scala. - Developed backend services and APIs in Java to support data ingestion, processing, and analytics workflows. - Built data pipelines to model traffic patterns and population movement, integrating batch and near-real-time processing. - Deployed and managed analytics clusters on AWS EMR/EC2, optimizing performance. - Designed scalable, fault-tolerant backend systems supporting high-throughput distributed data processing. Tech stack: Java, Scala, Python, Spark, Hadoop, Hive, MongoDB, Redis, AWS Domain: Telecom
November 2012April 2014
1 year 6 months

Moscow, www.microtest.ru

IT, System Integration, Internet... Show more

Lead Software Developer
Design and development System-112 monitoring solution for EMERCOM of Russia. Java EE 6, JBoss AS, PostgreSQL, Spring MVC, Apache Tiles, jax-ws, jax-rs, JSP, javascript, jquery, html, css, Linux Ubuntu Server Design and development System-112 integration solutions for EMERCOM of Russia. Java EE 6, GlassFish, PostgreSQL, Spring MVC, jax-ws, jax-rs, JSP, javascript, jquery, html, css, Linux Ubuntu Server Development System-112 operator work place. Java EE 6, GlassFish, PostgreSQL, Spring MVC, jax-ws, jax-rs, JSP, javascript, jquery, html, css, Linux Ubuntu Server Refactoring EMERCOM of Russia dictinaries system Java EE 6, JBoss AS, JSF, PostgreSQL, javascript. Banking system design and development
May 2012August 2012
4 months
Fujitsu Australia Ltd

Australia, www.fujitsu.com/au/

Software Developer
Designed and developed back office for http://cityofsydney.nsw.gov.au government web site Integrated back office with existing customer’s system Developed front office modules Designed and implemented deployment procedure
October 2011February 2012
5 months
WithinReach Software

Australia, withinreach.com.au

IT, System Integration, Internet... Show more

Software Developer
Short 4 months contract Developed www.gadens.com.au, public portal for Gadens Lawyers company Developed Active Directory integrated document compliance solution for Sims Metals company Worked both at customer's site and on house
October 2006June 2011
4 years 9 months

Moscow, www.aplana.ru

IT, System Integration, Internet... Show more

Senior Software Developer
Worked in outsourcing department in distributed teams for American, English customers. Participated in more then 10 projects. Developed a central part of BOSS-REFERENT document processing system Developed social network website http://tabup.com Performed large amount of R&D in projects Implemented new technologies in projects, improved development process.
May 2005October 2006
1 year 6 months

Moscow, www.sitronics.com

IT, System Integration, Internet... Show more

Software Developer
Designed and developed LB module for FORIS system - enterprise telecommunications solution for mobile and fixed phone operators. Designed, developed and migrated databases (Microsoft SQL Server, Oracle) Designed and developed CRM module for FORIS system.
October 2000April 2005
4 years 7 months
Institute of SNG Countries

Moscow, materik.ru

Government Organizations... Show more

Software Developer
Implement business solutions for the organization Gathered requirements from internal customers Managed scope, time, risk in projects for internal customers Deploy solutions to internal customer Consulted internal customers
March 1996October 2000
4 years 8 months

Moscow, www.esc.ru

IT, System Integration, Internet... Show more

Software Developer
R&D, prototyping and creating module for banking automation system, Participating in development of solution for Russian Railways resource management system Writing application modules. Worked with technologies: C++, MsVisualC, Win32API, Assembler, Delphi.

Skills

Skill proficiency levels
Big Data
Apache Spark
Apache Hadoop
Java
Scala
Python
Apache Kafka
Spring Boot
Yandex Cloud
Kubernetes
Docker
pandas
SQL

About me

Github https://github.com/dmitrypukhov Over the last 10+ years, I’ve specialized in Backend distributed systems, Big Data, and data engineering for industrial, financial, and government clients, leveraging Python, Java, Kafka, Spark, Flink, and Hadoop among others. My work spans architecture, development, and on premises or cloud deployment(Docker, Kubernetes, AWS, Yandex Cloud, Terraform) Core strengths: - Distributed backend / microservices/ data lake / streaming/ warehouses achitectures and implementations in Java/Python - Streaming & batch data pipelines (Kafka, Flink, Spark, Hadoop, Airflow) among others - Scalable, low-latency, high-throughput data systems - Cloud & DevOps: Kubernetes, Terraform, AWS, Yandex Cloud - R&D, and prototyping for Big Data & ML based presales

Higher education

2000
Higher education
Moscow institute of radio, electronics and automatics (MIREA)
Cybernetics, Mathematician-Engineer

Languages

Russian — Native

English — C2 — Proficiency

Citizenship, travel time to work

Citizenship: Russia

Permission to work: Russia

Desired travel time to work: Doesn't matter