Yinghai Yu

Data Scientist at GrubMarket Inc.
  • Claim this Profile
Contact Information
us****@****om
(386) 825-5501
Location
San Francisco Bay Area
Languages
  • Chinese Native or bilingual proficiency
  • English Full professional proficiency
  • Japanese Elementary proficiency

Topline Score

Topline score feature will be out soon.

Bio

Generated by
Topline AI

You need to have a working account to view this content.
You need to have a working account to view this content.

Credentials

  • SQL (Advanced)
    HackerRank
    Sep, 2020
    - Nov, 2024
  • SQL (Intermediate)
    HackerRank
    Sep, 2020
    - Nov, 2024
  • SQL Basic
    HackerRank
    Sep, 2020
    - Nov, 2024
  • Apache Spark
    Udemy
    Jul, 2020
    - Nov, 2024
  • AWS Cloud Practitioner Essentials
    Amazon Web Services (AWS)
    Dec, 2019
    - Nov, 2024
  • AWS Concepts
    Linux Academy
    Dec, 2019
    - Nov, 2024
  • Exam Readiness AWS Certified Solutions Architect - Associate
    Amazon Web Services (AWS)
    Dec, 2019
    - Nov, 2024
  • Convolutional Neural Networks in TensorFlow
    Coursera
    May, 2019
    - Nov, 2024
  • Neural Networks and Deep Learning
    Coursera
    May, 2019
    - Nov, 2024
  • Practical Time Series Analysis
    Coursera
    May, 2019
    - Nov, 2024
  • Best Practices for Data Warehousing with Amazon Redshift
    Amazon Web Services (AWS)
    Apr, 2019
    - Nov, 2024
  • Machine Learning and Data Science
    来Offer (LaiOffer)
    Aug, 2018
    - Nov, 2024
  • AWS Certified Solutions Architect – Associate
    Amazon Web Services (AWS)
    Mar, 2020
    - Nov, 2024
  • Amazon Web Services Solutions Architect Associate
    Amazon Web Services (AWS)
    Mar, 2020
    - Nov, 2024

Experience

    • Technology, Information and Internet
    • 1 - 100 Employee
    • Data Scientist
      • Jun 2022 - Present

      ● Designed reliable data pipeline to make sure the accuracy and correctness of transactions happened at subsidiaries. ● Set up connections between AWS S3 and Snowflakes to perform data process procedures using SQL queries. ● Designed reliable data pipeline to make sure the accuracy and correctness of transactions happened at subsidiaries. ● Set up connections between AWS S3 and Snowflakes to perform data process procedures using SQL queries.

    • United States
    • Information Technology & Services
    • 1 - 100 Employee
    • Data Scientist
      • Nov 2020 - Jun 2022

      ● Initiated VDOT sponsored AI-DSS project and established AWS architecture for ML models ● Managed Traffic Network Optimization project and coached data science team members● Lead Cloud-based Covid-19 Website Development and ML Ops for Virginia Department of Health

    • Software Engineer
      • Nov 2019 - Oct 2020

      ● Mentored interns and defined algorithm for reconciling ACH files between user inputs and bank records using Java.● Collaborated closely with cross-functional teams to design and implement reconciliation algorithms on the App● Coordinated with team members and managers on unit testing for the App development life cycle and reviewed code.● Benchmarked the algorithm’s performance and improved algorithm efficiency by 17%.

    • China
    • Software Development
    • 700 & Above Employee
    • Data Science Intern
      • Jun 2019 - Aug 2019

      ● Conducted a market survey on the market share and DAU of mobile APP stores including Tencent APP store. ● Wrote SQL scripts to query data from the database, and sorted app stores according to downloads and DAU, MAU. ● Visualized the queried data in dashboard using Power BI, generated survey reports and submitted them to product managers. ● Conducted a market survey on the market share and DAU of mobile APP stores including Tencent APP store. ● Wrote SQL scripts to query data from the database, and sorted app stores according to downloads and DAU, MAU. ● Visualized the queried data in dashboard using Power BI, generated survey reports and submitted them to product managers.

    • United States
    • IT Services and IT Consulting
    • 1 - 100 Employee
    • Quantitative Strategist
      • Aug 2018 - Dec 2018

      Conducted statistical analysis and contributed web crawler code for company’s upcoming Sustainable Development Goals. ● Established a web crawler framework to scrape greenhouse gas emission and large time series data from websites in PyCharm. ● Built up data pipeline in web crawler framework to receive and store the scraped data in local database. ● Implemented exploratory data analysis with R ggplot2 to obtain business insights after querying data from database. ● Performed markov regime switching model for GDP nowcasting and reduced short-term prediction error by ~3%. ● Delivered report and key findings to team leader and gained approval for participating in further visualization using plotly. Show less

    • China
    • Higher Education
    • 1 - 100 Employee
    • Research Assistant
      • Jan 2017 - Jul 2017

      ● Wrote SQL scripts to query China highway mileage data from ArcGIS geodatabase and handled missing values. ● Defined weighted average travel time and asserted its significant spatial autocorrelation by calculating Moran’s Index. ● Conducted Spatial Autoregression Model to evaluate economic impacts of highway accessibility gain using ArcGIS. ● Calculated highway accessibility of Jiangsu Province and assessed model’s robustness and found the reduction of passengers weighted average travel time can lead to economic growth. ● Discovered that the improvement of economic growth due to accessibility gain is between 0.15B USD to 0.22B USD. Show less

  • Weihong Airport
    • Shanghai City, China
    • Data Analyst Intern
      • Aug 2016 - Dec 2016

      • Participated in the establishment of Ground Services Network 2.0 Project for the Ground Services of China Eastern Airlines. • Extracted, transformed, and loaded (ETL) data given by the company using MySQL and Python.implemented data cleaning and visualize traffic congestion by generating heat map using Python. • Reduced airport traffic delay by 3% by applying Dijkstra algorithm. • Participated in the establishment of Ground Services Network 2.0 Project for the Ground Services of China Eastern Airlines. • Extracted, transformed, and loaded (ETL) data given by the company using MySQL and Python.implemented data cleaning and visualize traffic congestion by generating heat map using Python. • Reduced airport traffic delay by 3% by applying Dijkstra algorithm.

Education

  • Georgia Institute of Technology
    Master of Science - MS, Computer Science
    2021 - 2023
  • Penn State University
    Master's degree, Transportation and Highway Engineering
    2017 - 2019
  • Shandong University of Technology
    Bachelor's degree, Transportation and Highway Engineering
    2013 - 2017

Community

You need to have a working account to view this content. Click here to join now