See How Many Clients You're Missing Each Month

Simply enter your business email & Topline AI Agent will show you.

Bio

Generated by
Topline AI
Kallibek Kazbekov is a seasoned data engineer with expertise in ETL data pipelines, data warehouses, and machine learning models. He has experience working with various tools and frameworks, including Apache Spark, Apache Airflow, AWS Glue, and Azure Data Factory. With a Master of Science degree in Civil Engineering and a background in hydrology, Kazbekov has a strong foundation in data analysis and modeling. He is proficient in multiple programming languages, including Python, R, and SQL, and has completed various certifications in data engineering and analytics.

Credentials

  • Data Engineering Nanodegree
    Udacity
    Apr, 2021
    - Apr, 2026
  • Data Engineer with Python
    DataCamp
    Mar, 2021
    - Apr, 2026
  • From Data to Insights with Google Cloud Platform
    Coursera
    Aug, 2020
    - Apr, 2026
  • Big Data with PySpark
    Udemy
    Jun, 2020
    - Apr, 2026
  • AWS Cloud Practitioner
    A Cloud Guru
    Apr, 2020
    - Apr, 2026
  • Tableau Training for Data Science
    Udemy
    Feb, 2020
    - Apr, 2026
  • Data Analyst with Python
    DataCamp
    Jun, 2019
    - Apr, 2026
  • R Programming
    Coursera
    Dec, 2018
    - Apr, 2026
  • AWS Certified Data Analytics - Speciality
    Amazon Web Services (AWS)
    Oct, 2023
    - Apr, 2026
  • Databricks Certified Data Engineer Professional
    Databricks
    Nov, 2023
    - Apr, 2026
  • SnowPro Core Certification
    Snowflake
    Nov, 2023
    - Apr, 2026
  • Microsoft Certified: Azure Data Engineer Associate
    Microsoft
    Oct, 2023
    - Apr, 2026

Experience

  • Vention
    • Tashkent, Uzbekistan
    • Senior Data Engineer
      • Oct 2023 - Present
      • Tashkent, Uzbekistan

      - Design, build, and maintain ETL data pipelines in AWS and Azure using various tools and frameworks such as Apache Spark, Apache Airflow, AWS Glue, Azure Data Factory, etc.- Develop and manage data warehouses and data lakes in AWS using services such as Amazon Redshift, Amazon S3, Amazon Athena, Azure Data Lake Storage Gen2, Azure Synapse, Databricks, etc.- Consult the clients on how to optimize their existing on-premises or cloud data infrastructure, such as by migrating to the cloud, improving performance, reducing costs, enhancing security, etc.- Collaborate with other data engineers, data analysts, data scientists, and business stakeholders to deliver high-quality data solutions that meet the client’s needs and expectations.- Troubleshoot and resolve any data-related issues or challenges that arise during the project lifecycle.

  • Beeline Uzbekistan
    • Tashkent, Uzbekistan
    • Data Engineer
      • Oct 2022 - Oct 2023
      • Tashkent, Uzbekistan

      - Designed and implemented geo-analytics for small businesses to select optimal locations for their shops based on target clients, resulting in a 15% increase in revenue and a 10% decrease in costs.- Deployed credit scoring models for banks using machine learning techniques. The models achieved 90% accuracy and reduced the risk of default by 20%.- Deployed and maintained various machine learning models for an in-house adtech product, such as age, gender, churn, income, and location prediction.- Managed and processed large-scale data from all over the country using Hadoop, Spark, Hive, Postgresql, and Clickhouse, ensuring high performance, scalability, and reliability of the data pipeline.- Automated and scheduled ETL jobs using Apache Oozie, reducing the processing time by 40% and the error rate by 15%.- Collaborated with a data team of 15 people, delivering high-quality products on time and within budget, and receiving positive feedback from clients and stakeholders.

  • GNIVC
    • Tashkent, Uzbekistan
    • Data Engineer
      • Apr 2021 - Oct 2022
      • Tashkent, Uzbekistan

      - Collaborated with a data team of nine members to develop a tax monitoring tool for the Tax Authority of Uzbekistan using Hadoop, Kafka, Spark, Hive, Postgresql, Clickhouse, and Airflow.- Designed and implemented scalable and reliable ETL pipelines to process real-time and batch data from various sources across the country, such as receipts, invoices, money transfers, and other tax-related entities.- Performed unit testing and quality assurance on the ETL jobs using Spark and ensured data accuracy and integrity.- Optimized the data lake and serving layer performance by applying best practices and tuning techniques for Hadoop, Kafka, and Postgresql.- Contributed to the successful launch and deployment of the tax monitoring tool, which resulted in improved tax compliance, increased revenue collection, and enhanced transparency and accountability for the Tax Authority of Uzbekistan.

  • King County Water District 90
    • Renton, Washington, United States
    • GIS Database Engineer
      • May 2019 - Apr 2021
      • Renton, Washington, United States

      - Updated and maintained GIS database with current and relevant spatial information, ensuring data integrity and accessibility.- Analyzed geospatial data using ArcGIS, Python, and SQL to support decision-making and planning for various projects and clients.- Developed spatial ETL workflows to automate data conversion, transformation, and validation processes, resulting in improved efficiency and accuracy.- Integrated GIS in construction projects to provide spatial analysis, visualization, and reporting capabilities, enhancing project management and delivery.- Developed data models to represent spatial relationships, attributes, and constraints, facilitating data analysis and manipulation.- Produced web maps using ArcGIS Online and Web AppBuilder to disseminate spatial information and insights to internal and external stakeholders.- Monitored data quality using various tools and methods, such as topology, metadata, and error reporting, to identify and resolve data issues and inconsistencies.- Conducted GIS/GPS trainings for staff and partners to increase their knowledge and skills in using spatial technologies and applications.

  • Washington State University
    • Pullman, Washington, United States
    • Research Assistant
      • Sep 2018 - May 2020
      • Pullman, Washington, United States

      - Gathered hydrological data from various sources, including government databases, remote sensing technology, and historical records;- Developed and fine-tuned machine learning models, such as regression analysis, neural networks, and ensemble methods, to predict stream flow patterns;- Implemented feature engineering techniques to improve model performance and accuracy;- Utilized programming languages such as Python and R for coding and implementing machine learning algorithms;- Employed libraries and frameworks like TensorFlow, Scikit-learn, and Pandas for efficient model building and evaluation;- Created visualizations and dashboards to present model results and insights using tools like Matplotlib, Seaborn, and Tableau.

    • Data Intern
      • Sep 2009 - Oct 2009
      • Tashkent, Uzbekistan

      - Assisted the socio-technical survey on Water User Association in Central Asia (Kyrgyzstan, Tajikistan and Uzbekistan); - Entered survey data into the SPSS statistical software;- Assisted with the analysis of the data.

Education

  • 2018 - 2020
    Washington State University
    Master of Science - MS, Civil Engineering
  • 2018 - 2018
    San Diego State University
    English for Gradute Studies, English
  • 2012 - 2016
    Tashkent Institute of Irrigation and Melioration (TIIM)
    Bachelor's degree, Water Resources and Melioration (Irrigation Engineering)

Suggested Services

This profile is unclaimed. These are suggested service rates with 0% commision upon successful connection

Industry Focus. “IT Services and IT Consulting”

Looking to Create a Custom Project?

Need a custom project? We'll create a solution designed specifically for your project.

Get Started

References

Community

You need to have a working account to view this content. Click here to join now

Similar Profiles