Deepyaman Datta

Founding Machine Learning Engineer at Claypot AI
  • Claim this Profile
Contact Information
us****@****om
(386) 825-5501
Location
Salt Lake City, Utah, United States, US

Topline Score

Topline score feature will be out soon.

Bio

Generated by
Topline AI

You need to have a working account to view this content.
You need to have a working account to view this content.

Credentials

  • Building Transformer-Based Natural Language Processing Applications
    NVIDIA
    Jul, 2021
    - Nov, 2024
  • Fundamentals of Deep Learning
    NVIDIA
    Jul, 2021
    - Nov, 2024
  • Triplebyte Certified Data Scientist
    Triplebyte
    Oct, 2020
    - Nov, 2024
  • Triplebyte Certified Machine Learning Engineer
    Triplebyte
    Aug, 2020
    - Nov, 2024

Experience

    • United States
    • Software Development
    • 1 - 100 Employee
    • Founding Machine Learning Engineer
      • Oct 2022 - Present

    • United States
    • Data Infrastructure and Analytics
    • Maintainer
      • Jun 2022 - Present

    • Contributor
      • Aug 2018 - Present

    • United Kingdom
    • IT Services and IT Consulting
    • 300 - 400 Employee
    • Staff Machine Learning Engineer
      • Jul 2021 - Oct 2022

    • Junior Principal Data Engineer
      • Jul 2020 - Jun 2021

      • Played tech lead for McKinsey’s central COVID-19 epidemiological model, providing clients (US Department of Defense, Commonwealth of Virginia, state of New York, etc.) with data, analyses, and scenarios to help them make informed decisions • Lead data engineering teams and mentored junior colleagues and clients on engagements across industries (disaster response and recovery, semiconductor R&D effectiveness, agrochemical fermentation yield optimization, airline crew planning and… Show more • Played tech lead for McKinsey’s central COVID-19 epidemiological model, providing clients (US Department of Defense, Commonwealth of Virginia, state of New York, etc.) with data, analyses, and scenarios to help them make informed decisions • Lead data engineering teams and mentored junior colleagues and clients on engagements across industries (disaster response and recovery, semiconductor R&D effectiveness, agrochemical fermentation yield optimization, airline crew planning and scheduling) • Lead Data Engineering R&D for North America (defined R&D process with other global leads, drafted and reviewed proposals)

    • Senior Data Engineering Consultant
      • Aug 2018 - Jun 2020

    • Business Consulting and Services
    • 700 & Above Employee
    • Staff Machine Learning Engineer
      • Jul 2021 - Oct 2022

    • Junior Principal Data Engineer
      • Jun 2020 - Jun 2021

    • Senior Data Engineering Consultant
      • Aug 2018 - Jun 2020

    • United States
    • Technology, Information and Internet
    • 700 & Above Employee
    • Research Scientist
      • Dec 2017 - Jun 2018

      • Accelerated unsupervised learning to compute fluid volumes from NMR T1-T2 data using Apache Beam/Spark on Google Cloud • Processed and visualized key information from hundreds of millions of data lake records to help other scientists find relevant data

    • Analytics Engineer
      • Aug 2016 - Nov 2017

      • Modeled Google Cloud Platform costs as function of data lake user activity to further understanding and inform pricing decisions • Conducted threat modeling assessment of data lake API and Discovery Service; mitigated risk in discovery using Cloud Functions • Replicated CircleCI build environment in VSTS using Docker to support GitHub project in compliance with legal restrictions • Authored Python packages for well and wellbore designation parsing and crawling, parsing, and ingesting… Show more • Modeled Google Cloud Platform costs as function of data lake user activity to further understanding and inform pricing decisions • Conducted threat modeling assessment of data lake API and Discovery Service; mitigated risk in discovery using Cloud Functions • Replicated CircleCI build environment in VSTS using Docker to support GitHub project in compliance with legal restrictions • Authored Python packages for well and wellbore designation parsing and crawling, parsing, and ingesting LAS files into data lake • Placed third (out of 20 teams) in We Hackathon: Google Cloud Platform Coding Bootcamp

    • Sweden
    • Telecommunications
    • 700 & Above Employee
    • Data Scientist Intern
      • Jun 2015 - Sep 2015

      • Developed Elasticsearch plugin for Grafana dashboard and graph editor that supports metrics (not only annotations) • Built process that feeds network operations center portal data into Graphite numeric time-series data store • Developed Elasticsearch plugin for Grafana dashboard and graph editor that supports metrics (not only annotations) • Built process that feeds network operations center portal data into Graphite numeric time-series data store

    • United States
    • Financial Services
    • 700 & Above Employee
    • Technology Associate
      • Dec 2011 - Aug 2014

      • Reduced firm’s reputational risk by validating XML files to check data integrity before submission to FINRA • Attained highest level of standards compliance in firm by introducing TDD, Crucible code review, and Jenkins Violations reports • Architected and implemented entitlements framework that supports application- and row-level user entitlements • Designed, developed, and integrated framework that lets users drill from aggregated measures to source system for each… Show more • Reduced firm’s reputational risk by validating XML files to check data integrity before submission to FINRA • Attained highest level of standards compliance in firm by introducing TDD, Crucible code review, and Jenkins Violations reports • Architected and implemented entitlements framework that supports application- and row-level user entitlements • Designed, developed, and integrated framework that lets users drill from aggregated measures to source system for each metric • Decreased run time of ETL process for AutoSys job run history data from 6 hours per day to few seconds every hour by analyzing scripts called by process, providing faster and more complete, up-to-date data • Developed IBM Cognos report that indicates whether streams, boxes, and jobs met SLA and graphs historical trends, facilitating identification of potential problems; also includes drill-through functionality between levels • Redesigned QlikView summary dashboard to add 30-day trend line and detailed RAG (Red Amber Green) status for each metric

    • Technology Analyst Program
      • Aug 2011 - Nov 2011

      • Built internal website that allows business users to select groups of business entities and their values, thereby removing need for IT to manually analyze subscription requests • Received intensive technical training emphasizing UNIX, C++, relational data modeling, Java, and C#

    • United States
    • Financial Services
    • 700 & Above Employee
    • CDP Technology Summer Analyst Development Program
      • Jun 2010 - Aug 2010

      • Developed website to host feed statistics, service level agreements, and contact information for JPMorgan Collateral Management and Legal Documentation; user interface included datepicker for previous feed statistics and tabs • Wrote program to automatically map JPMorgan GENESIS feeds to JPMorgan GLOBALNET feeds and extract timestamps for these feeds • Created spreadsheets to automatically calculate time difference statistics for feeds, highlight outliers, and calculate new service… Show more • Developed website to host feed statistics, service level agreements, and contact information for JPMorgan Collateral Management and Legal Documentation; user interface included datepicker for previous feed statistics and tabs • Wrote program to automatically map JPMorgan GENESIS feeds to JPMorgan GLOBALNET feeds and extract timestamps for these feeds • Created spreadsheets to automatically calculate time difference statistics for feeds, highlight outliers, and calculate new service level agreement times for JPMorgan GENESIS and GLOBALNET Show less • Developed website to host feed statistics, service level agreements, and contact information for JPMorgan Collateral Management and Legal Documentation; user interface included datepicker for previous feed statistics and tabs • Wrote program to automatically map JPMorgan GENESIS feeds to JPMorgan GLOBALNET feeds and extract timestamps for these feeds • Created spreadsheets to automatically calculate time difference statistics for feeds, highlight outliers, and calculate new service… Show more • Developed website to host feed statistics, service level agreements, and contact information for JPMorgan Collateral Management and Legal Documentation; user interface included datepicker for previous feed statistics and tabs • Wrote program to automatically map JPMorgan GENESIS feeds to JPMorgan GLOBALNET feeds and extract timestamps for these feeds • Created spreadsheets to automatically calculate time difference statistics for feeds, highlight outliers, and calculate new service level agreement times for JPMorgan GENESIS and GLOBALNET Show less

    • Software Development
    • 1 - 100 Employee
    • Undergraduate Student Scholars Program
      • Jun 2008 - May 2010

      • Wrote program to detect TCP servers on network to identify rogue servers • Demonstrated vulnerabilities via response time covert channels in Xen virtualization platform • Created internal browser-based application for employees to look up network address translation of internal or external address • Used Google Web Toolkit to design linked data explorer that searches online JSON data • Implemented tools to quickly identify suspicious network activity and extract key information… Show more • Wrote program to detect TCP servers on network to identify rogue servers • Demonstrated vulnerabilities via response time covert channels in Xen virtualization platform • Created internal browser-based application for employees to look up network address translation of internal or external address • Used Google Web Toolkit to design linked data explorer that searches online JSON data • Implemented tools to quickly identify suspicious network activity and extract key information from network address translation records • Developed utility to convert data from MATLAB-based format to ARFF for Weka machine learning software Show less • Wrote program to detect TCP servers on network to identify rogue servers • Demonstrated vulnerabilities via response time covert channels in Xen virtualization platform • Created internal browser-based application for employees to look up network address translation of internal or external address • Used Google Web Toolkit to design linked data explorer that searches online JSON data • Implemented tools to quickly identify suspicious network activity and extract key information… Show more • Wrote program to detect TCP servers on network to identify rogue servers • Demonstrated vulnerabilities via response time covert channels in Xen virtualization platform • Created internal browser-based application for employees to look up network address translation of internal or external address • Used Google Web Toolkit to design linked data explorer that searches online JSON data • Implemented tools to quickly identify suspicious network activity and extract key information from network address translation records • Developed utility to convert data from MATLAB-based format to ARFF for Weka machine learning software Show less

    • India
    • IT Services and IT Consulting
    • 700 & Above Employee
    • InStep: Infosys’ Global Internship Program
      • Jun 2009 - Aug 2009

      • Identified OCL extended with regular expressions as appropriate language to represent constraints on IDEF1X-based models • Created Infosys InFlux-to-Eclipse Modeling Framework conversion software to support language • Developed part of automatic test data generation suite that interfaces between business rule representation and data generation API • Identified OCL extended with regular expressions as appropriate language to represent constraints on IDEF1X-based models • Created Infosys InFlux-to-Eclipse Modeling Framework conversion software to support language • Developed part of automatic test data generation suite that interfaces between business rule representation and data generation API

Education

  • Stanford University
    Master's Degree, Computer Science
    2014 - 2016
  • The University of Texas at Austin
    Bachelor of Science with Honors, Electrical and Computer Engineering
    2007 - 2011
  • The University of Texas at Austin
    Bachelor of Arts with High Honors, Economics
    2007 - 2011

Community

You need to have a working account to view this content. Click here to join now