Deepyaman Datta
Founding Machine Learning Engineer at Claypot AI- Claim this Profile
Click to upgrade to our gold package
for the full feature experience.
Topline Score
Bio
Credentials
-
Building Transformer-Based Natural Language Processing Applications
NVIDIAJul, 2021- Nov, 2024 -
Fundamentals of Deep Learning
NVIDIAJul, 2021- Nov, 2024 -
Triplebyte Certified Data Scientist
TriplebyteOct, 2020- Nov, 2024 -
Triplebyte Certified Machine Learning Engineer
TriplebyteAug, 2020- Nov, 2024
Experience
-
Claypot AI
-
United States
-
Software Development
-
1 - 100 Employee
-
Founding Machine Learning Engineer
-
Oct 2022 - Present
-
-
-
Kedro
-
United States
-
Data Infrastructure and Analytics
-
Maintainer
-
Jun 2022 - Present
-
-
Contributor
-
Aug 2018 - Present
-
-
-
QuantumBlack, AI by McKinsey
-
United Kingdom
-
IT Services and IT Consulting
-
300 - 400 Employee
-
Staff Machine Learning Engineer
-
Jul 2021 - Oct 2022
-
-
Junior Principal Data Engineer
-
Jul 2020 - Jun 2021
• Played tech lead for McKinsey’s central COVID-19 epidemiological model, providing clients (US Department of Defense, Commonwealth of Virginia, state of New York, etc.) with data, analyses, and scenarios to help them make informed decisions • Lead data engineering teams and mentored junior colleagues and clients on engagements across industries (disaster response and recovery, semiconductor R&D effectiveness, agrochemical fermentation yield optimization, airline crew planning and… Show more • Played tech lead for McKinsey’s central COVID-19 epidemiological model, providing clients (US Department of Defense, Commonwealth of Virginia, state of New York, etc.) with data, analyses, and scenarios to help them make informed decisions • Lead data engineering teams and mentored junior colleagues and clients on engagements across industries (disaster response and recovery, semiconductor R&D effectiveness, agrochemical fermentation yield optimization, airline crew planning and scheduling) • Lead Data Engineering R&D for North America (defined R&D process with other global leads, drafted and reviewed proposals)
-
-
Senior Data Engineering Consultant
-
Aug 2018 - Jun 2020
-
-
-
McKinsey & Company
-
Business Consulting and Services
-
700 & Above Employee
-
Staff Machine Learning Engineer
-
Jul 2021 - Oct 2022
-
-
Junior Principal Data Engineer
-
Jun 2020 - Jun 2021
-
-
Senior Data Engineering Consultant
-
Aug 2018 - Jun 2020
-
-
-
SLB
-
United States
-
Technology, Information and Internet
-
700 & Above Employee
-
Research Scientist
-
Dec 2017 - Jun 2018
• Accelerated unsupervised learning to compute fluid volumes from NMR T1-T2 data using Apache Beam/Spark on Google Cloud • Processed and visualized key information from hundreds of millions of data lake records to help other scientists find relevant data
-
-
Analytics Engineer
-
Aug 2016 - Nov 2017
• Modeled Google Cloud Platform costs as function of data lake user activity to further understanding and inform pricing decisions • Conducted threat modeling assessment of data lake API and Discovery Service; mitigated risk in discovery using Cloud Functions • Replicated CircleCI build environment in VSTS using Docker to support GitHub project in compliance with legal restrictions • Authored Python packages for well and wellbore designation parsing and crawling, parsing, and ingesting… Show more • Modeled Google Cloud Platform costs as function of data lake user activity to further understanding and inform pricing decisions • Conducted threat modeling assessment of data lake API and Discovery Service; mitigated risk in discovery using Cloud Functions • Replicated CircleCI build environment in VSTS using Docker to support GitHub project in compliance with legal restrictions • Authored Python packages for well and wellbore designation parsing and crawling, parsing, and ingesting LAS files into data lake • Placed third (out of 20 teams) in We Hackathon: Google Cloud Platform Coding Bootcamp
-
-
-
Ericsson
-
Sweden
-
Telecommunications
-
700 & Above Employee
-
Data Scientist Intern
-
Jun 2015 - Sep 2015
• Developed Elasticsearch plugin for Grafana dashboard and graph editor that supports metrics (not only annotations) • Built process that feeds network operations center portal data into Graphite numeric time-series data store • Developed Elasticsearch plugin for Grafana dashboard and graph editor that supports metrics (not only annotations) • Built process that feeds network operations center portal data into Graphite numeric time-series data store
-
-
-
Morgan Stanley
-
United States
-
Financial Services
-
700 & Above Employee
-
Technology Associate
-
Dec 2011 - Aug 2014
• Reduced firm’s reputational risk by validating XML files to check data integrity before submission to FINRA • Attained highest level of standards compliance in firm by introducing TDD, Crucible code review, and Jenkins Violations reports • Architected and implemented entitlements framework that supports application- and row-level user entitlements • Designed, developed, and integrated framework that lets users drill from aggregated measures to source system for each… Show more • Reduced firm’s reputational risk by validating XML files to check data integrity before submission to FINRA • Attained highest level of standards compliance in firm by introducing TDD, Crucible code review, and Jenkins Violations reports • Architected and implemented entitlements framework that supports application- and row-level user entitlements • Designed, developed, and integrated framework that lets users drill from aggregated measures to source system for each metric • Decreased run time of ETL process for AutoSys job run history data from 6 hours per day to few seconds every hour by analyzing scripts called by process, providing faster and more complete, up-to-date data • Developed IBM Cognos report that indicates whether streams, boxes, and jobs met SLA and graphs historical trends, facilitating identification of potential problems; also includes drill-through functionality between levels • Redesigned QlikView summary dashboard to add 30-day trend line and detailed RAG (Red Amber Green) status for each metric
-
-
Technology Analyst Program
-
Aug 2011 - Nov 2011
• Built internal website that allows business users to select groups of business entities and their values, thereby removing need for IT to manually analyze subscription requests • Received intensive technical training emphasizing UNIX, C++, relational data modeling, Java, and C#
-
-
-
JPMorgan Chase & Co.
-
United States
-
Financial Services
-
700 & Above Employee
-
CDP Technology Summer Analyst Development Program
-
Jun 2010 - Aug 2010
• Developed website to host feed statistics, service level agreements, and contact information for JPMorgan Collateral Management and Legal Documentation; user interface included datepicker for previous feed statistics and tabs • Wrote program to automatically map JPMorgan GENESIS feeds to JPMorgan GLOBALNET feeds and extract timestamps for these feeds • Created spreadsheets to automatically calculate time difference statistics for feeds, highlight outliers, and calculate new service… Show more • Developed website to host feed statistics, service level agreements, and contact information for JPMorgan Collateral Management and Legal Documentation; user interface included datepicker for previous feed statistics and tabs • Wrote program to automatically map JPMorgan GENESIS feeds to JPMorgan GLOBALNET feeds and extract timestamps for these feeds • Created spreadsheets to automatically calculate time difference statistics for feeds, highlight outliers, and calculate new service level agreement times for JPMorgan GENESIS and GLOBALNET Show less • Developed website to host feed statistics, service level agreements, and contact information for JPMorgan Collateral Management and Legal Documentation; user interface included datepicker for previous feed statistics and tabs • Wrote program to automatically map JPMorgan GENESIS feeds to JPMorgan GLOBALNET feeds and extract timestamps for these feeds • Created spreadsheets to automatically calculate time difference statistics for feeds, highlight outliers, and calculate new service… Show more • Developed website to host feed statistics, service level agreements, and contact information for JPMorgan Collateral Management and Legal Documentation; user interface included datepicker for previous feed statistics and tabs • Wrote program to automatically map JPMorgan GENESIS feeds to JPMorgan GLOBALNET feeds and extract timestamps for these feeds • Created spreadsheets to automatically calculate time difference statistics for feeds, highlight outliers, and calculate new service level agreement times for JPMorgan GENESIS and GLOBALNET Show less
-
-
-
Applied Research Labs
-
Software Development
-
1 - 100 Employee
-
Undergraduate Student Scholars Program
-
Jun 2008 - May 2010
• Wrote program to detect TCP servers on network to identify rogue servers • Demonstrated vulnerabilities via response time covert channels in Xen virtualization platform • Created internal browser-based application for employees to look up network address translation of internal or external address • Used Google Web Toolkit to design linked data explorer that searches online JSON data • Implemented tools to quickly identify suspicious network activity and extract key information… Show more • Wrote program to detect TCP servers on network to identify rogue servers • Demonstrated vulnerabilities via response time covert channels in Xen virtualization platform • Created internal browser-based application for employees to look up network address translation of internal or external address • Used Google Web Toolkit to design linked data explorer that searches online JSON data • Implemented tools to quickly identify suspicious network activity and extract key information from network address translation records • Developed utility to convert data from MATLAB-based format to ARFF for Weka machine learning software Show less • Wrote program to detect TCP servers on network to identify rogue servers • Demonstrated vulnerabilities via response time covert channels in Xen virtualization platform • Created internal browser-based application for employees to look up network address translation of internal or external address • Used Google Web Toolkit to design linked data explorer that searches online JSON data • Implemented tools to quickly identify suspicious network activity and extract key information… Show more • Wrote program to detect TCP servers on network to identify rogue servers • Demonstrated vulnerabilities via response time covert channels in Xen virtualization platform • Created internal browser-based application for employees to look up network address translation of internal or external address • Used Google Web Toolkit to design linked data explorer that searches online JSON data • Implemented tools to quickly identify suspicious network activity and extract key information from network address translation records • Developed utility to convert data from MATLAB-based format to ARFF for Weka machine learning software Show less
-
-
-
Infosys
-
India
-
IT Services and IT Consulting
-
700 & Above Employee
-
InStep: Infosys’ Global Internship Program
-
Jun 2009 - Aug 2009
• Identified OCL extended with regular expressions as appropriate language to represent constraints on IDEF1X-based models • Created Infosys InFlux-to-Eclipse Modeling Framework conversion software to support language • Developed part of automatic test data generation suite that interfaces between business rule representation and data generation API • Identified OCL extended with regular expressions as appropriate language to represent constraints on IDEF1X-based models • Created Infosys InFlux-to-Eclipse Modeling Framework conversion software to support language • Developed part of automatic test data generation suite that interfaces between business rule representation and data generation API
-
-
Education
-
Stanford University
Master's Degree, Computer Science -
The University of Texas at Austin
Bachelor of Science with Honors, Electrical and Computer Engineering -
The University of Texas at Austin
Bachelor of Arts with High Honors, Economics