Tianyi Li

Data Scientist at Calculated Systems
  • Claim this Profile
Contact Information
us****@****om
(386) 825-5501
Location
Los Angeles, California, United States, US

Topline Score

Topline score feature will be out soon.

Bio

Generated by
Topline AI

You need to have a working account to view this content.
You need to have a working account to view this content.

Experience

    • United States
    • IT Services and IT Consulting
    • 1 - 100 Employee
    • Data Scientist
      • Jun 2022 - Mar 2023

      • Led the design and implementation of real-time social media analytics toolkit. Built robust Natural Language Processing pipeline by CoreNLP, NLTK and HanLP on large volume of streaming data, approx. 150 million feeds per day. Deployed on GCP using Docker and Cloud Run. • Developed real-time hot topic extraction, detection, and summarization pipeline using Python, Jupyter Lab and Google Pub/Sub, capable of customized tuning and filtering. Set up Postgres database for ETL. • Trained and… Show more • Led the design and implementation of real-time social media analytics toolkit. Built robust Natural Language Processing pipeline by CoreNLP, NLTK and HanLP on large volume of streaming data, approx. 150 million feeds per day. Deployed on GCP using Docker and Cloud Run. • Developed real-time hot topic extraction, detection, and summarization pipeline using Python, Jupyter Lab and Google Pub/Sub, capable of customized tuning and filtering. Set up Postgres database for ETL. • Trained and Tuned Neural Machine Translation models based on Bi-directional GRU in TensorFlow via Jupyter Lab and Vertex AI on Google Cloud Platform. Using creative tokenization methods to translate Chinese language. Achieved high accuracy of 0.9 on validation data built from social media texts. Show less • Led the design and implementation of real-time social media analytics toolkit. Built robust Natural Language Processing pipeline by CoreNLP, NLTK and HanLP on large volume of streaming data, approx. 150 million feeds per day. Deployed on GCP using Docker and Cloud Run. • Developed real-time hot topic extraction, detection, and summarization pipeline using Python, Jupyter Lab and Google Pub/Sub, capable of customized tuning and filtering. Set up Postgres database for ETL. • Trained and… Show more • Led the design and implementation of real-time social media analytics toolkit. Built robust Natural Language Processing pipeline by CoreNLP, NLTK and HanLP on large volume of streaming data, approx. 150 million feeds per day. Deployed on GCP using Docker and Cloud Run. • Developed real-time hot topic extraction, detection, and summarization pipeline using Python, Jupyter Lab and Google Pub/Sub, capable of customized tuning and filtering. Set up Postgres database for ETL. • Trained and Tuned Neural Machine Translation models based on Bi-directional GRU in TensorFlow via Jupyter Lab and Vertex AI on Google Cloud Platform. Using creative tokenization methods to translate Chinese language. Achieved high accuracy of 0.9 on validation data built from social media texts. Show less

    • Actuarial Intern
      • Jan 2021 - Mar 2021

      • Created and adjusted statistical models for price calculations and predictions. Worked on large scale and complex Bayesian probability models in R. • Supported finalizing pricing and strategy for multiple new products before release in February 2021. • Created and adjusted statistical models for price calculations and predictions. Worked on large scale and complex Bayesian probability models in R. • Supported finalizing pricing and strategy for multiple new products before release in February 2021.

    • China
    • Pharmaceutical Manufacturing
    • 1 - 100 Employee
    • Data Analyst Intern
      • Jul 2020 - Oct 2020

      Handled data cleaning, processing, and visualization for raw medical research data through Tableau and Microsoft Power BI. Handled data cleaning, processing, and visualization for raw medical research data through Tableau and Microsoft Power BI.

Education

  • University of Southern California
    Master's degree, Applied Data Science
    2021 - 2023
  • University of Rochester
    Bachelor of Science - BS, Computer Science
    2017 - 2021
  • University of Rochester
    Bachelor of Arts - BA, Statistics
    2017 - 2021
  • No. 2 High School Affiliated to East China Normal University
    High School Diploma
    2014 - 2017
  • University of Rochester

Community

You need to have a working account to view this content. Click here to join now