Ashish Yadav

Lead Machine Learning Engineer at Spectrum Labs
  • Claim this Profile
Contact Information
us****@****om
(386) 825-5501
Location
Seattle, US

Topline Score

Topline score feature will be out soon.

Bio

Generated by
Topline AI

5.0

/5.0
/ Based on 2 ratings
  • (2)
  • (0)
  • (0)
  • (0)
  • (0)

Filter reviews by:

Eric Finkel

It was an absolute pleasure working alongside Ashish while at Spectrum. Ending up in similar roles but having come from different backgrounds, I learned a ton about high quality software and data engineering from him. You can tell that Ashish cares immensely about the quality of his work. He is very thoughtful around how he designs the tools he builds and is meticulous around producing high quality code. He is not only technically very adept but also a very supportive teammate that significantly contributes to company culture. I wish him all the best in his career and hope we get to work together again in the future!

LinkedIn User

I am delighted to write a recommendation for my talented colleague and friend, Ashish Yadav, who has proven to be an exceptional professional in the fields of Machine Learning and Artificial Intelligence. I have had the pleasure of working closely with Ashish for one year and I still like to have technical discussions with him after all these years. During the time I have known him, I have been consistently impressed by his technical expertise, dedication, and ability to deliver outstanding results. Ashish possesses a deep understanding of machine learning concepts and algorithms, which he applies with precision and creativity to solve complex problems. His proficiency in Python, C, C++ and Java allows him to develop robust and efficient solutions, while his knowledge of Hadoop, Spark, and Scala enables him to tackle big data and data engineering challenges with ease. Ashish has an impressive track record of successfully implementing machine learning models, optimizing algorithms, and building scalable data processing pipelines. What truly sets him apart is his exceptional problem-solving skills and analytical mindset. He has an innate ability to identify key insights from complex datasets and translate them into actionable recommendations. He is not only proficient in the technical aspects of his work but also possesses strong communication skills, making him an excellent team player and collaborator. He is always eager to share his knowledge, provide guidance, and assist others in achieving goals. During our time working together, I have witnessed his commitment to excellence and his relentless pursuit of learning and growth. He stays up-to-date with the latest advancements in technology and continuously seek out new challenges to expand his skill set. His passion for work is infectious and inspires those around him. He would definitely prove to be an invaluable asset to any organization he joins.

You need to have a working account to view this content.
You need to have a working account to view this content.

Experience

    • United States
    • Technology, Information and Internet
    • 1 - 100 Employee
    • Lead Machine Learning Engineer
      • Jul 2021 - Present

      • Spearheaded the development of cutting-edge AI and ML solutions to combat online toxicity on gaming platforms, resulting in an impressive 85% average reduction in online abuse complaints by the clients. Collaborated closely with stakeholders to understand unique gaming industry challenges, enabling the design of targeted AI models and algorithms for precise detection and effective mitigation of toxic behavior. (Python, Scala, PyTorch, Weights & Biases, Argo, Docker, Kubernetes) •… Show more • Spearheaded the development of cutting-edge AI and ML solutions to combat online toxicity on gaming platforms, resulting in an impressive 85% average reduction in online abuse complaints by the clients. Collaborated closely with stakeholders to understand unique gaming industry challenges, enabling the design of targeted AI models and algorithms for precise detection and effective mitigation of toxic behavior. (Python, Scala, PyTorch, Weights & Biases, Argo, Docker, Kubernetes) • Enhanced data extraction and metrics generation by architecting and implementing a cloud-based scalable backend service. Successfully automated the pipeline, resulting in a remarkable reduction in data processing and analysis time by 80%, while eliminating the manual overhead of data extraction, scoring, and metrics generation. (Microservices, gRPC-based APIs, Snowflake) • Engineered and implemented feature extraction pipelines to unlock the hidden potential of raw data, improving the accuracy and efficiency of online toxicity detection models and achieving a precision enhancement from 70% to 90%. (Python, Pandas, NumPy) • Led the establishment of an ML tools ecosystem by transforming diverse ML tools into importable Python packages, integrating multiple tools, and automating workflows, developing Python libraries to remove code duplication across the organization and enhancing team productivity by minimizing manual effort. • Crafted a cutting-edge web scraping tool, enabling efficient data extraction from diverse audio and video platforms and generating audio files. This groundbreaking solution paved the way for the establishment of a dedicated unit within the company, specializing in Audio Abuse Detection using AI-based learning solutions. (Python, PyTorch) • Exemplified strong management and mentoring skills by guiding and mentoring junior engineers & interns, fostering the adoption of best practices in SDLC & design patterns. Show less • Spearheaded the development of cutting-edge AI and ML solutions to combat online toxicity on gaming platforms, resulting in an impressive 85% average reduction in online abuse complaints by the clients. Collaborated closely with stakeholders to understand unique gaming industry challenges, enabling the design of targeted AI models and algorithms for precise detection and effective mitigation of toxic behavior. (Python, Scala, PyTorch, Weights & Biases, Argo, Docker, Kubernetes) •… Show more • Spearheaded the development of cutting-edge AI and ML solutions to combat online toxicity on gaming platforms, resulting in an impressive 85% average reduction in online abuse complaints by the clients. Collaborated closely with stakeholders to understand unique gaming industry challenges, enabling the design of targeted AI models and algorithms for precise detection and effective mitigation of toxic behavior. (Python, Scala, PyTorch, Weights & Biases, Argo, Docker, Kubernetes) • Enhanced data extraction and metrics generation by architecting and implementing a cloud-based scalable backend service. Successfully automated the pipeline, resulting in a remarkable reduction in data processing and analysis time by 80%, while eliminating the manual overhead of data extraction, scoring, and metrics generation. (Microservices, gRPC-based APIs, Snowflake) • Engineered and implemented feature extraction pipelines to unlock the hidden potential of raw data, improving the accuracy and efficiency of online toxicity detection models and achieving a precision enhancement from 70% to 90%. (Python, Pandas, NumPy) • Led the establishment of an ML tools ecosystem by transforming diverse ML tools into importable Python packages, integrating multiple tools, and automating workflows, developing Python libraries to remove code duplication across the organization and enhancing team productivity by minimizing manual effort. • Crafted a cutting-edge web scraping tool, enabling efficient data extraction from diverse audio and video platforms and generating audio files. This groundbreaking solution paved the way for the establishment of a dedicated unit within the company, specializing in Audio Abuse Detection using AI-based learning solutions. (Python, PyTorch) • Exemplified strong management and mentoring skills by guiding and mentoring junior engineers & interns, fostering the adoption of best practices in SDLC & design patterns. Show less

    • United States
    • Biotechnology
    • 700 & Above Employee
    • Engineering Manager, Data Engineering
      • Jan 2021 - Jul 2021

      • Architected and developed robust database architecture on AWS Redshift and successfully led the migration of large-scale healthcare data assets of about 400 GB from multiple hospitals data streams to AWS Redshift, optimizing backend support for data queries, improving data access capabilities, and reducing operation cost by 30%. • Demonstrated exceptional managerial acumen, leveraging strong communication skills to foster productive collaboration with stakeholders across all… Show more • Architected and developed robust database architecture on AWS Redshift and successfully led the migration of large-scale healthcare data assets of about 400 GB from multiple hospitals data streams to AWS Redshift, optimizing backend support for data queries, improving data access capabilities, and reducing operation cost by 30%. • Demonstrated exceptional managerial acumen, leveraging strong communication skills to foster productive collaboration with stakeholders across all organizational levels. Mentored interns and junior engineers and creating a nurturing environment for their professional growth and development.

    • Lead Software Developer, Data Engineering
      • Jan 2018 - Dec 2020

      • Designed and developed time sensitive COVID data pipelines, providing critical information during the pandemic and contributing to Regeneron's COVID therapeutic medicine development. (Spark, Scala, AWS, Databricks) • Implemented ETL pipelines for UK Biobank healthcare data (100 GB from 500K patients’ data), enabling comprehensive disease research. Transformed raw clinical data into regression-friendly matrix formats, enhancing drug and target discovery confidence. (Spark, Scala, AWS… Show more • Designed and developed time sensitive COVID data pipelines, providing critical information during the pandemic and contributing to Regeneron's COVID therapeutic medicine development. (Spark, Scala, AWS, Databricks) • Implemented ETL pipelines for UK Biobank healthcare data (100 GB from 500K patients’ data), enabling comprehensive disease research. Transformed raw clinical data into regression-friendly matrix formats, enhancing drug and target discovery confidence. (Spark, Scala, AWS, Databricks) • Created interactive web applications with rich data visualizations, analyzing clinical data (2 million patients), exploring comorbidities, and deriving insights for drug development. (Spark, Scala, Play) • Spearheaded Keras based image analysis pipeline for diagnosing fundus images from UK Biobank, leading to the establishment of a new clinical research unit at Regeneron Genetics Center. (Python, PyTorch) • Developed Spark streaming workflows to capture real-time application logs, enabling further analysis, anomaly detection, and alerting. (Spark, Scala) • Established a robust CI/CD pipeline using Atlassian Bamboo and AWS, automating internal application build and deployment, significantly improving efficiency and reliability.

    • Software Developer
      • Jul 2016 - Jan 2018

      • Developed the Gene Variants web application, enabling interactive exploration of gene mutations across approximately 1 million patients’ genetic records. (Spark, Scala, Play Framework) • Implemented a comprehensive mapping system to link diseases with unique concept codes, establishing a dictionary/ontology for bi-directional searching of related diseases. (Spark, Scala) • Leveraged Spark and Scala to analyze association results between diseases and gene mutations in a dataset… Show more • Developed the Gene Variants web application, enabling interactive exploration of gene mutations across approximately 1 million patients’ genetic records. (Spark, Scala, Play Framework) • Implemented a comprehensive mapping system to link diseases with unique concept codes, establishing a dictionary/ontology for bi-directional searching of related diseases. (Spark, Scala) • Leveraged Spark and Scala to analyze association results between diseases and gene mutations in a dataset of over 1.5 million patients, facilitating the prediction of potential drug targets and gain valuable insights. • Established continuous data integration processes to incorporate new patient records from diverse collaborators across the globe, ensuring up-to-date and comprehensive data analysis capabilities. Implemented automated testing environments and pipelines to generate dynamic test datasets, enabling thorough functionality testing of data integration. (Spark, Scala)

    • United States
    • Biotechnology
    • 700 & Above Employee
    • Software Developer(Big Data) Intern
      • Feb 2016 - May 2016

      • Created an optimized version of the AWS web console for internal use, employing the Django Framework and various AWS services, elevating user experience and productivity. • Performed data analysis on large-scale genomic datasets using Spark on Amazon EMR, harnessing distributed computing capabilities to derive meaningful insights from genomic data. • Created an optimized version of the AWS web console for internal use, employing the Django Framework and various AWS services, elevating user experience and productivity. • Performed data analysis on large-scale genomic datasets using Spark on Amazon EMR, harnessing distributed computing capabilities to derive meaningful insights from genomic data.

    • Higher Education
    • 700 & Above Employee
    • Software Developer (Part-time)
      • Oct 2015 - Dec 2015

      Worked as a software developer on the project for finding out best locations for creating solar farms all over U.S.A., with ClearGrid Energy, under the SPIKE fellowship offered by New York University. Worked as a software developer on the project for finding out best locations for creating solar farms all over U.S.A., with ClearGrid Energy, under the SPIKE fellowship offered by New York University.

    • United States
    • Biotechnology
    • 700 & Above Employee
    • Software Developer Intern
      • Jun 2015 - Aug 2015

      • Enhanced search functionality in MacOS by developing Mac widgets, integrating JavaScript, jQuery, and JSON for seamless authentication with valid user credentials. • Implemented sharding and clustering techniques on MongoDB collections, boosting insertion in the database performance by 36% for 25 million rows. • Enhanced search functionality in MacOS by developing Mac widgets, integrating JavaScript, jQuery, and JSON for seamless authentication with valid user credentials. • Implemented sharding and clustering techniques on MongoDB collections, boosting insertion in the database performance by 36% for 25 million rows.

  • ClearGrid Innovations
    • Greater New York City Area
    • Software Developer
      • Mar 2015 - May 2015

      Developed the Python equivalent for a multivariate regression model which aims to increase the awareness for use of solar energy in United States households by predicting the solar energy requirement and its availability in each and every place all over the Unites States of America. Developed the Python equivalent for a multivariate regression model which aims to increase the awareness for use of solar energy in United States households by predicting the solar energy requirement and its availability in each and every place all over the Unites States of America.

    • Telecommunications
    • 200 - 300 Employee
    • Software Developer
      • Aug 2013 - Aug 2014

      • Automated Access-Lists command testing modules using Spirent's iTest automation tool, eliminating manual annotation and significantly improving efficiency and accuracy. • Devised a comprehensive high-level design for Access-Lists implementation on the Winpath architecture, utilizing Parser and Classifier Engine to develop scripts that seamlessly executed the design. Developed the High Level Design for implementation of Access-Lists on Winpath architecture using Parser and… Show more • Automated Access-Lists command testing modules using Spirent's iTest automation tool, eliminating manual annotation and significantly improving efficiency and accuracy. • Devised a comprehensive high-level design for Access-Lists implementation on the Winpath architecture, utilizing Parser and Classifier Engine to develop scripts that seamlessly executed the design. Developed the High Level Design for implementation of Access-Lists on Winpath architecture using Parser and Classifier Engine and developed the scripts to implement it. Show less • Automated Access-Lists command testing modules using Spirent's iTest automation tool, eliminating manual annotation and significantly improving efficiency and accuracy. • Devised a comprehensive high-level design for Access-Lists implementation on the Winpath architecture, utilizing Parser and Classifier Engine to develop scripts that seamlessly executed the design. Developed the High Level Design for implementation of Access-Lists on Winpath architecture using Parser and… Show more • Automated Access-Lists command testing modules using Spirent's iTest automation tool, eliminating manual annotation and significantly improving efficiency and accuracy. • Devised a comprehensive high-level design for Access-Lists implementation on the Winpath architecture, utilizing Parser and Classifier Engine to develop scripts that seamlessly executed the design. Developed the High Level Design for implementation of Access-Lists on Winpath architecture using Parser and Classifier Engine and developed the scripts to implement it. Show less

    • India
    • IT Services and IT Consulting
    • 1 - 100 Employee
    • Software Developer Internship
      • Dec 2012 - Jan 2013

      Developed and implemented intranet which was aimed to increase the interaction among employees and management. Implemented different modules including document sharing, issue tracking, employee management, version control to manage various tasks. Developed and implemented intranet which was aimed to increase the interaction among employees and management. Implemented different modules including document sharing, issue tracking, employee management, version control to manage various tasks.

  • Air India
    • New Delhi Area, India
    • Software Developer Internship
      • Jun 2012 - Jul 2012

      Developed IT Department portal which contained modules for Administrator and Department Staff providing different authorizations. Implemented forum for discussion, chat functionality, login requirement and database was designed and integrated with the application code. Developed IT Department portal which contained modules for Administrator and Department Staff providing different authorizations. Implemented forum for discussion, chat functionality, login requirement and database was designed and integrated with the application code.

Education

  • Massachusetts Institute of Technology
    MicroMasters degree, Statistics and Data Science
    2022 - 2024
  • New York University, Courant Institute of Mathematical Sciences
    Master of Science - MS, Computer Science
    2014 - 2016
  • Delhi College of Engineering (now Delhi Technological University)
    Bachelor of Engineering (B.E.), Computer Engineering
    2009 - 2013

Community

You need to have a working account to view this content. Click here to join now