Kartikey Mullick
Senior Data Engineer at pass_by- Claim this Profile
Click to upgrade to our gold package
for the full feature experience.
Topline Score
Bio
Credentials
-
Neo4j Certified Professional
Neo4jAug, 2019- Nov, 2024
Experience
-
pass_by
-
United States
-
Technology, Information and Internet
-
1 - 100 Employee
-
Senior Data Engineer
-
Feb 2023 - Present
• Led data governance initiatives to improve data observability and reliability. • Delivered complex geospatial data solutions, owning the underlying mullticloud data infrastructure.
-
-
Data Engineer
-
Nov 2021 - Feb 2023
• Architect and maintain fault-tolerant, resilient, and scalable Airflow pipelines in GCP, processing 1 TB/day of spatiotemporal data. • Manage delivery of data products following industry standard data engineering practices, ensuring efficient data processing and storage. • Seamlessly integrate GCP services like Dataproc and BigLake to process big data while optimizing costs. • Drive transition of data infrastructure to modern data stack. • Produce comprehensive… Show more • Architect and maintain fault-tolerant, resilient, and scalable Airflow pipelines in GCP, processing 1 TB/day of spatiotemporal data. • Manage delivery of data products following industry standard data engineering practices, ensuring efficient data processing and storage. • Seamlessly integrate GCP services like Dataproc and BigLake to process big data while optimizing costs. • Drive transition of data infrastructure to modern data stack. • Produce comprehensive documentation and analytic reports, delivering concise summaries and in- sights to stakeholders. • Conduct thorough evaluations of third-party data handling solutions to ensure alignment with internal needs and stakeholder requirements.
-
-
-
Glint Solar
-
Norway
-
Software Development
-
1 - 100 Employee
-
Data Engineer
-
Oct 2020 - Oct 2021
• Designed and executed a robust GIS data pipeline in GCP using DataFlow, optimizing data storage in zarr format. • Employed SQL and Python to process and analyze data efficiently with Dask and xarray. • Implemented a separate pipeline for integrating new sources of GIS data into GCS from rasters and shapefiles, ensuring optimal GIS data format. • Contributed to backend development for web applications by writing optimized Cloud Functions to process GIS data. • Designed and executed a robust GIS data pipeline in GCP using DataFlow, optimizing data storage in zarr format. • Employed SQL and Python to process and analyze data efficiently with Dask and xarray. • Implemented a separate pipeline for integrating new sources of GIS data into GCS from rasters and shapefiles, ensuring optimal GIS data format. • Contributed to backend development for web applications by writing optimized Cloud Functions to process GIS data.
-
-
-
CoreView Systems Private Limited
-
India
-
Information Technology & Services
-
1 - 100 Employee
-
Data Scientist
-
May 2019 - Aug 2021
• Spearheaded the development of data orchestration pipelines using Airflow in AWS to facilitate on- demand data transfers from various sources to AWS Redshift. • Pioneered the creation of deep learning models for executing natural language test cases on web applications. • Leveraged data analysis techniques to develop a sophisticated malware detection model, processing 500 TB of data stored in AWS Redshift. • Spearheaded the development of data orchestration pipelines using Airflow in AWS to facilitate on- demand data transfers from various sources to AWS Redshift. • Pioneered the creation of deep learning models for executing natural language test cases on web applications. • Leveraged data analysis techniques to develop a sophisticated malware detection model, processing 500 TB of data stored in AWS Redshift.
-
-
-
LEAP
-
India
-
Consumer Services
-
Data Analyst
-
Aug 2018 - Apr 2019
• Wrote and optimized SQL queries for managing and monitoring electric bike fleet databases in Post- greSQL. • Utilized the Metabase data visualization tool to optimize operating costs and increase revenue. • Analyzed streaming data from GPS sensors mounted on all e-bikes in the fleet. • Wrote and optimized SQL queries for managing and monitoring electric bike fleet databases in Post- greSQL. • Utilized the Metabase data visualization tool to optimize operating costs and increase revenue. • Analyzed streaming data from GPS sensors mounted on all e-bikes in the fleet.
-
-
-
Veritas Technologies LLC
-
United States
-
Software Development
-
700 & Above Employee
-
Software Engineer
-
Jan 2018 - Jun 2018
• Developed, tested, and deployed a data backup integrity feature from scratch in Java for an enterprise data management software product. • Worked in an Agile software development environment, utilizing Git for version control. • Delivered clear and concise progress updates in weekly presentations. • Developed, tested, and deployed a data backup integrity feature from scratch in Java for an enterprise data management software product. • Worked in an Agile software development environment, utilizing Git for version control. • Delivered clear and concise progress updates in weekly presentations.
-
-
Education
-
Birla Institute of Technology and Science, Pilani
Postgraduate Diploma, Machine Learning and AI -
Birla Institute of Technology and Science, Pilani
Bachelor of Engineering - BE, Electronics and Instrumentation engineering