Kartikey Mullick

Senior Data Engineer at pass_by
  • Claim this Profile
Contact Information
us****@****om
(386) 825-5501
Location
Spain, ES

Topline Score

Topline score feature will be out soon.

Bio

Generated by
Topline AI

You need to have a working account to view this content.
You need to have a working account to view this content.

Credentials

  • Neo4j Certified Professional
    Neo4j
    Aug, 2019
    - Nov, 2024

Experience

    • United States
    • Technology, Information and Internet
    • 1 - 100 Employee
    • Senior Data Engineer
      • Feb 2023 - Present

      • Led data governance initiatives to improve data observability and reliability. • Delivered complex geospatial data solutions, owning the underlying mullticloud data infrastructure.

    • Data Engineer
      • Nov 2021 - Feb 2023

      • Architect and maintain fault-tolerant, resilient, and scalable Airflow pipelines in GCP, processing 1 TB/day of spatiotemporal data. • Manage delivery of data products following industry standard data engineering practices, ensuring efficient data processing and storage. • Seamlessly integrate GCP services like Dataproc and BigLake to process big data while optimizing costs. • Drive transition of data infrastructure to modern data stack. • Produce comprehensive… Show more • Architect and maintain fault-tolerant, resilient, and scalable Airflow pipelines in GCP, processing 1 TB/day of spatiotemporal data. • Manage delivery of data products following industry standard data engineering practices, ensuring efficient data processing and storage. • Seamlessly integrate GCP services like Dataproc and BigLake to process big data while optimizing costs. • Drive transition of data infrastructure to modern data stack. • Produce comprehensive documentation and analytic reports, delivering concise summaries and in- sights to stakeholders. • Conduct thorough evaluations of third-party data handling solutions to ensure alignment with internal needs and stakeholder requirements.

    • Norway
    • Software Development
    • 1 - 100 Employee
    • Data Engineer
      • Oct 2020 - Oct 2021

      • Designed and executed a robust GIS data pipeline in GCP using DataFlow, optimizing data storage in zarr format. • Employed SQL and Python to process and analyze data efficiently with Dask and xarray. • Implemented a separate pipeline for integrating new sources of GIS data into GCS from rasters and shapefiles, ensuring optimal GIS data format. • Contributed to backend development for web applications by writing optimized Cloud Functions to process GIS data. • Designed and executed a robust GIS data pipeline in GCP using DataFlow, optimizing data storage in zarr format. • Employed SQL and Python to process and analyze data efficiently with Dask and xarray. • Implemented a separate pipeline for integrating new sources of GIS data into GCS from rasters and shapefiles, ensuring optimal GIS data format. • Contributed to backend development for web applications by writing optimized Cloud Functions to process GIS data.

    • India
    • Information Technology & Services
    • 1 - 100 Employee
    • Data Scientist
      • May 2019 - Aug 2021

      • Spearheaded the development of data orchestration pipelines using Airflow in AWS to facilitate on- demand data transfers from various sources to AWS Redshift. • Pioneered the creation of deep learning models for executing natural language test cases on web applications. • Leveraged data analysis techniques to develop a sophisticated malware detection model, processing 500 TB of data stored in AWS Redshift. • Spearheaded the development of data orchestration pipelines using Airflow in AWS to facilitate on- demand data transfers from various sources to AWS Redshift. • Pioneered the creation of deep learning models for executing natural language test cases on web applications. • Leveraged data analysis techniques to develop a sophisticated malware detection model, processing 500 TB of data stored in AWS Redshift.

    • India
    • Consumer Services
    • Data Analyst
      • Aug 2018 - Apr 2019

      • Wrote and optimized SQL queries for managing and monitoring electric bike fleet databases in Post- greSQL. • Utilized the Metabase data visualization tool to optimize operating costs and increase revenue. • Analyzed streaming data from GPS sensors mounted on all e-bikes in the fleet. • Wrote and optimized SQL queries for managing and monitoring electric bike fleet databases in Post- greSQL. • Utilized the Metabase data visualization tool to optimize operating costs and increase revenue. • Analyzed streaming data from GPS sensors mounted on all e-bikes in the fleet.

    • United States
    • Software Development
    • 700 & Above Employee
    • Software Engineer
      • Jan 2018 - Jun 2018

      • Developed, tested, and deployed a data backup integrity feature from scratch in Java for an enterprise data management software product. • Worked in an Agile software development environment, utilizing Git for version control. • Delivered clear and concise progress updates in weekly presentations. • Developed, tested, and deployed a data backup integrity feature from scratch in Java for an enterprise data management software product. • Worked in an Agile software development environment, utilizing Git for version control. • Delivered clear and concise progress updates in weekly presentations.

Education

  • Birla Institute of Technology and Science, Pilani
    Postgraduate Diploma, Machine Learning and AI
    2019 - 2020
  • Birla Institute of Technology and Science, Pilani
    Bachelor of Engineering - BE, Electronics and Instrumentation engineering
    2013 - 2017

Community

You need to have a working account to view this content. Click here to join now