Christine Straub

Machine Learning Engineer at unstructured.io
  • Claim this Profile
Contact Information
Location
Irvine, California, United States, US
Languages
  • English Professional working proficiency
  • Spanish Limited working proficiency
  • Tagalog Native or bilingual proficiency

Topline Score

Bio

Generated by
Topline AI

0

/5.0
/ Based on 0 ratings
  • (0)
  • (0)
  • (0)
  • (0)
  • (0)

Filter reviews by:

No reviews to display There are currently no reviews available.

0

/5.0
/ Based on 0 ratings
  • (0)
  • (0)
  • (0)
  • (0)
  • (0)

Filter reviews by:

No reviews to display There are currently no reviews available.
You need to have a working account to view this content. Click here to join now

Credentials

  • Decisions, Decisions: Dashboards and Reports
    Coursera
    Jul, 2023
    - Sep, 2024
  • Google Business Intelligence Certificate
    Coursera
    Jul, 2023
    - Sep, 2024
  • Improving Deep Neural Networks: Hyperparameter Tuning, Regularization and Optimization
    Coursera
    Jul, 2023
    - Sep, 2024
  • Neural Networks and Deep Learning
    Coursera
    Jul, 2023
    - Sep, 2024
  • Structuring Machine Learning Projects
    Coursera
    Jul, 2023
    - Sep, 2024
  • Supervised Machine Learning: Regression and Classification
    Coursera
    Jun, 2023
    - Sep, 2024
  • Advanced Learning Algorithms
    Coursera
    Jun, 2023
    - Sep, 2024
  • Foundations of Business Intelligence
    Coursera
    Jun, 2023
    - Sep, 2024
  • Google Business Intelligence Specialization
    Coursera
    Jun, 2023
    - Sep, 2024
  • Machine Learning Specialization
    Coursera
    Jun, 2023
    - Sep, 2024
  • Unsupervised Learning, Recommenders, Reinforcement Learning
    Coursera
    Jun, 2023
    - Sep, 2024
  • Analyze Data to Answer Questions
    Coursera
    May, 2023
    - Sep, 2024
  • ChatGPT for Beginners: Using AI For Market Research
    Coursera
    May, 2023
    - Sep, 2024
  • Data Analysis with R Programming
    Coursera
    May, 2023
    - Sep, 2024
  • Digital Transformation with Google Cloud
    Google Cloud Skills Boost
    May, 2023
    - Sep, 2024
  • Google Data Analytics Capstone: Complete a Case Study
    Coursera
    May, 2023
    - Sep, 2024
  • Google Data Analytics Certificate
    Coursera
    May, 2023
    - Sep, 2024
  • Google Data Analytics Specialization
    Coursera
    May, 2023
    - Sep, 2024
  • Innovating with Data and Google Cloud
    Google Cloud Skills Boost
    May, 2023
    - Sep, 2024
  • Introduction to Generative AI
    Google Cloud Skills Boost
    May, 2023
    - Sep, 2024
  • Share Data Through the Art of Visualization
    Coursera
    May, 2023
    - Sep, 2024
  • AI For Everyone
    Coursera
    Apr, 2023
    - Sep, 2024
  • Ask Questions to Make Data-Driven Decisions
    Coursera
    Apr, 2023
    - Sep, 2024
  • Prepare Data for Exploration
    Coursera
    Apr, 2023
    - Sep, 2024
  • Process Data from Dirty to Clean
    Coursera
    Apr, 2023
    - Sep, 2024
  • Machine Learning
    Stanford University
    Jan, 2019
    - Sep, 2024
  • Certified Scrum Product Owner (CSPO)
    Scrum Alliance
    Oct, 2018
    - Sep, 2024
  • Supervised Machine Learning: Regression and Classification
    Coursera
  • Foundations: Data, Data, Everywhere
    Coursera
    Apr, 2023
    - Sep, 2024
  • Convolutional Neural Networks
    Coursera

Experience

    • United States
    • Software Development
    • 1 - 100 Employee
    • Machine Learning Engineer
      • May 2023 - Present

      Unstructured Technologies transforms natural language data from raw to machine learning-ready. Its open-source libraries and APIs build custom preprocessing pipelines for labeling, training, or production machine learning pipelines. • Implemented Full-Page OCR (vs Individual-Block OCR). • Implemented supplementing document layout detected by layout detection model with elements from the Full-Page OCR. • Improved elements ordering when extracting elements from PDF document. Unstructured Technologies transforms natural language data from raw to machine learning-ready. Its open-source libraries and APIs build custom preprocessing pipelines for labeling, training, or production machine learning pipelines. • Implemented Full-Page OCR (vs Individual-Block OCR). • Implemented supplementing document layout detected by layout detection model with elements from the Full-Page OCR. • Improved elements ordering when extracting elements from PDF document.

    • United States
    • IT Services and IT Consulting
    • 1 - 100 Employee
    • Senior AI/ML Engineer (Tech Lead)
      • Jul 2023 - Present

      • Design and Develop Sensitive Site Exploitation and Language Application System for Marine Corps. • Developed a system to provide an Optical Character Recognition (OCR) capability to ingest locally taken pictures or documents for use in Common Operational Picture (COP) updates, or detailed reporting. • Developed a system to utilize the results of the OCR capability or other ingested foreign language material if translate from French, Arabic, and Chinese into English. • Design and Develop Sensitive Site Exploitation and Language Application System for Marine Corps. • Developed a system to provide an Optical Character Recognition (OCR) capability to ingest locally taken pictures or documents for use in Common Operational Picture (COP) updates, or detailed reporting. • Developed a system to utilize the results of the OCR capability or other ingested foreign language material if translate from French, Arabic, and Chinese into English.

    • United States
    • Industrial Automation
    • 1 - 100 Employee
    • Machine Learning Ops Engineer | Machine Learning Engineer
      • May 2022 - Jul 2023

      𝑲𝒆𝒚 𝑺𝒌𝒊𝒍𝒍𝒔: 𝑴𝑳𝑶𝒑𝒔, 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑳𝒆𝒂𝒓𝒏𝒊𝒏𝒈 , 𝑪𝒐𝒎𝒑𝒖𝒕𝒆𝒓 𝑽𝒊𝒔𝒊𝒐𝒏, 𝑬𝒅𝒈𝒆 𝑪𝒐𝒎𝒑𝒖𝒕𝒊𝒏𝒈, 𝑨𝒖𝒕𝒐 𝑴𝑳 𝑷𝒊𝒑𝒆𝒍𝒊𝒏𝒆, 𝑴𝒐𝒅𝒆𝒍 𝒅𝒆𝒑𝒍𝒐𝒚𝒎𝒆𝒏𝒕 𝒂𝒖𝒕𝒐𝒎𝒂𝒕𝒊𝒐𝒏, 𝑫𝒂𝒕𝒂 𝑬𝒏𝒈𝒊𝒏𝒆𝒆𝒓𝒊𝒏𝒈. ★ Built and deployed AI-powered end-to-end robotic workcells that integrate within existing workflows to help enterprises automate their entire factories, warehouses, or supply chain operations. ★ Designed, prototyped, trained and delivered… Show more 𝑲𝒆𝒚 𝑺𝒌𝒊𝒍𝒍𝒔: 𝑴𝑳𝑶𝒑𝒔, 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑳𝒆𝒂𝒓𝒏𝒊𝒏𝒈 , 𝑪𝒐𝒎𝒑𝒖𝒕𝒆𝒓 𝑽𝒊𝒔𝒊𝒐𝒏, 𝑬𝒅𝒈𝒆 𝑪𝒐𝒎𝒑𝒖𝒕𝒊𝒏𝒈, 𝑨𝒖𝒕𝒐 𝑴𝑳 𝑷𝒊𝒑𝒆𝒍𝒊𝒏𝒆, 𝑴𝒐𝒅𝒆𝒍 𝒅𝒆𝒑𝒍𝒐𝒚𝒎𝒆𝒏𝒕 𝒂𝒖𝒕𝒐𝒎𝒂𝒕𝒊𝒐𝒏, 𝑫𝒂𝒕𝒂 𝑬𝒏𝒈𝒊𝒏𝒆𝒆𝒓𝒊𝒏𝒈. ★ Built and deployed AI-powered end-to-end robotic workcells that integrate within existing workflows to help enterprises automate their entire factories, warehouses, or supply chain operations. ★ Designed, prototyped, trained and delivered machine learning models to revolutionize manufacturing, supply chains, e-commerce, and other industrial applications. ★ Worked on Data ETL (extract, transform and load) tasks which includes extracting image data from compressed formats and transforming data for ingestion into training pipelines. ★ Performed Feature extraction, transformation and standardization. ★ Performed Metadata generation. ★ Performed Data registration. ★ Collected appropriate data for model training and development, worked with Data acquisition engineers. ★ Architected, prototype, test and help deploy modern ML models and algorithms. ★ Solved real world robotics problems using ML and CV(computer vision) solutions. ★ Improved the performance of prototype with tuning model performance in the field through continual data acquisition. ★ Designed Data pipeline and data management with the custom/in-house datasets with State-of-the-Art models. ★ Utilized Auto ML pipeline for training customized object detection and recognition. ★ Developed APIs for interacting with backend storage infrastructure. ★ Architected and developed the data infrastructure for machine learning applications. ★ Worked on data ingestion pipelines (Multi-cloud / multiple physical locations) for databases such as SQL, NoSQL, and cloud solutions (data typing and learning frameworks). Show less 𝑲𝒆𝒚 𝑺𝒌𝒊𝒍𝒍𝒔: 𝑴𝑳𝑶𝒑𝒔, 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑳𝒆𝒂𝒓𝒏𝒊𝒏𝒈 , 𝑪𝒐𝒎𝒑𝒖𝒕𝒆𝒓 𝑽𝒊𝒔𝒊𝒐𝒏, 𝑬𝒅𝒈𝒆 𝑪𝒐𝒎𝒑𝒖𝒕𝒊𝒏𝒈, 𝑨𝒖𝒕𝒐 𝑴𝑳 𝑷𝒊𝒑𝒆𝒍𝒊𝒏𝒆, 𝑴𝒐𝒅𝒆𝒍 𝒅𝒆𝒑𝒍𝒐𝒚𝒎𝒆𝒏𝒕 𝒂𝒖𝒕𝒐𝒎𝒂𝒕𝒊𝒐𝒏, 𝑫𝒂𝒕𝒂 𝑬𝒏𝒈𝒊𝒏𝒆𝒆𝒓𝒊𝒏𝒈. ★ Built and deployed AI-powered end-to-end robotic workcells that integrate within existing workflows to help enterprises automate their entire factories, warehouses, or supply chain operations. ★ Designed, prototyped, trained and delivered… Show more 𝑲𝒆𝒚 𝑺𝒌𝒊𝒍𝒍𝒔: 𝑴𝑳𝑶𝒑𝒔, 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑳𝒆𝒂𝒓𝒏𝒊𝒏𝒈 , 𝑪𝒐𝒎𝒑𝒖𝒕𝒆𝒓 𝑽𝒊𝒔𝒊𝒐𝒏, 𝑬𝒅𝒈𝒆 𝑪𝒐𝒎𝒑𝒖𝒕𝒊𝒏𝒈, 𝑨𝒖𝒕𝒐 𝑴𝑳 𝑷𝒊𝒑𝒆𝒍𝒊𝒏𝒆, 𝑴𝒐𝒅𝒆𝒍 𝒅𝒆𝒑𝒍𝒐𝒚𝒎𝒆𝒏𝒕 𝒂𝒖𝒕𝒐𝒎𝒂𝒕𝒊𝒐𝒏, 𝑫𝒂𝒕𝒂 𝑬𝒏𝒈𝒊𝒏𝒆𝒆𝒓𝒊𝒏𝒈. ★ Built and deployed AI-powered end-to-end robotic workcells that integrate within existing workflows to help enterprises automate their entire factories, warehouses, or supply chain operations. ★ Designed, prototyped, trained and delivered machine learning models to revolutionize manufacturing, supply chains, e-commerce, and other industrial applications. ★ Worked on Data ETL (extract, transform and load) tasks which includes extracting image data from compressed formats and transforming data for ingestion into training pipelines. ★ Performed Feature extraction, transformation and standardization. ★ Performed Metadata generation. ★ Performed Data registration. ★ Collected appropriate data for model training and development, worked with Data acquisition engineers. ★ Architected, prototype, test and help deploy modern ML models and algorithms. ★ Solved real world robotics problems using ML and CV(computer vision) solutions. ★ Improved the performance of prototype with tuning model performance in the field through continual data acquisition. ★ Designed Data pipeline and data management with the custom/in-house datasets with State-of-the-Art models. ★ Utilized Auto ML pipeline for training customized object detection and recognition. ★ Developed APIs for interacting with backend storage infrastructure. ★ Architected and developed the data infrastructure for machine learning applications. ★ Worked on data ingestion pipelines (Multi-cloud / multiple physical locations) for databases such as SQL, NoSQL, and cloud solutions (data typing and learning frameworks). Show less

    • United States
    • Technology, Information and Internet
    • 1 - 100 Employee
    • Senior Software Architect
      • Jun 2022 - Jun 2023

      𝑺𝒑𝒆𝒆𝒄𝒉𝒍𝒂𝒃 𝒖𝒔𝒆𝒔 𝑨𝑰 𝒕𝒐 𝒂𝒖𝒕𝒐𝒎𝒂𝒕𝒆 𝒔𝒑𝒆𝒆𝒄𝒉-𝒕𝒐-𝒔𝒑𝒆𝒆𝒄𝒉 𝒕𝒓𝒂𝒏𝒔𝒍𝒂𝒕𝒊𝒐𝒏 𝒇𝒐𝒓 𝒅𝒖𝒃𝒃𝒊𝒏𝒈 𝒐𝒇 𝒑𝒓𝒐𝒇𝒆𝒔𝒔𝒊𝒐𝒏𝒂𝒍𝒍𝒚 𝒑𝒓𝒐𝒅𝒖𝒄𝒆𝒅 𝒂𝒖𝒅𝒊𝒐 𝒂𝒏𝒅 𝒗𝒊𝒅𝒆𝒐. 𝑺𝒑𝒆𝒆𝒄𝒉𝒍𝒂𝒃 𝒊𝒔 𝒃𝒂𝒄𝒌𝒆𝒅 𝒃𝒚 𝑨𝒏𝒅𝒓𝒆𝒘 𝑵𝒈'𝒔 𝑨𝑰 𝒇𝒖𝒏𝒅. 𝑲𝒆𝒚 𝑺𝒌𝒊𝒍𝒍𝒔: 𝑨𝑾𝑺 𝑪𝒍𝒐𝒖𝒅 𝑨𝒓𝒄𝒉𝒊𝒕𝒆𝒄𝒕𝒖𝒓𝒆, 𝑺𝒑𝒆𝒆𝒄𝒉 𝑹𝒆𝒄𝒐𝒈𝒏𝒊𝒕𝒊𝒐𝒏, 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑻𝒓𝒂𝒏𝒔𝒍𝒂𝒕𝒊𝒐𝒏, 𝑻𝒆𝒙𝒕-𝒕𝒐-𝑺𝒑𝒆𝒆𝒄𝒉, 𝑴𝑬𝑹𝑵 𝒔𝒕𝒂𝒄𝒌. ★… Show more 𝑺𝒑𝒆𝒆𝒄𝒉𝒍𝒂𝒃 𝒖𝒔𝒆𝒔 𝑨𝑰 𝒕𝒐 𝒂𝒖𝒕𝒐𝒎𝒂𝒕𝒆 𝒔𝒑𝒆𝒆𝒄𝒉-𝒕𝒐-𝒔𝒑𝒆𝒆𝒄𝒉 𝒕𝒓𝒂𝒏𝒔𝒍𝒂𝒕𝒊𝒐𝒏 𝒇𝒐𝒓 𝒅𝒖𝒃𝒃𝒊𝒏𝒈 𝒐𝒇 𝒑𝒓𝒐𝒇𝒆𝒔𝒔𝒊𝒐𝒏𝒂𝒍𝒍𝒚 𝒑𝒓𝒐𝒅𝒖𝒄𝒆𝒅 𝒂𝒖𝒅𝒊𝒐 𝒂𝒏𝒅 𝒗𝒊𝒅𝒆𝒐. 𝑺𝒑𝒆𝒆𝒄𝒉𝒍𝒂𝒃 𝒊𝒔 𝒃𝒂𝒄𝒌𝒆𝒅 𝒃𝒚 𝑨𝒏𝒅𝒓𝒆𝒘 𝑵𝒈'𝒔 𝑨𝑰 𝒇𝒖𝒏𝒅. 𝑲𝒆𝒚 𝑺𝒌𝒊𝒍𝒍𝒔: 𝑨𝑾𝑺 𝑪𝒍𝒐𝒖𝒅 𝑨𝒓𝒄𝒉𝒊𝒕𝒆𝒄𝒕𝒖𝒓𝒆, 𝑺𝒑𝒆𝒆𝒄𝒉 𝑹𝒆𝒄𝒐𝒈𝒏𝒊𝒕𝒊𝒐𝒏, 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑻𝒓𝒂𝒏𝒔𝒍𝒂𝒕𝒊𝒐𝒏, 𝑻𝒆𝒙𝒕-𝒕𝒐-𝑺𝒑𝒆𝒆𝒄𝒉, 𝑴𝑬𝑹𝑵 𝒔𝒕𝒂𝒄𝒌. ★ Key member in designing system architecture based on Amazon Web Services. ★ Build core components of the API to access large language models. ★ Optimize the API and interfaces for performance, robustness, and ease of use. ★ Implement tools and processes to monitor model behavior and performance using Sentry and Loggly. ★ Implemented thumbnail generation for uploaded video files ★ Built utility backend services with AWS Lambda ☆ Built a service that processes Machine Learning Output and calls the backend API ☆ Built a service that is called by the backend API and then generates SRT formatted text file on the AWS S3 bucket ☆ Built a service that notifies new user registration ★ Wrote APIs for content sharing ☆ Direct content sharing by sending an invitation email that contains an invitation link using AWS SES ☆ General content sharing using a public invitation link ☆ Permission management for shared users ★ Wrote APIs and backend services to support multi-language translation and multi-accent dubbing. ★ Designed data models, implemented backend logic, and wrote APIs to implement the Paywall system by integrating the Paddle platform. ★ Implemented backend logic and APIs to keep transcribed language pronunciations for highlighted words/phrases in the translated languages and dubs. ★ Implemented backend logic and APIs to synchronize audio and text. Show less 𝑺𝒑𝒆𝒆𝒄𝒉𝒍𝒂𝒃 𝒖𝒔𝒆𝒔 𝑨𝑰 𝒕𝒐 𝒂𝒖𝒕𝒐𝒎𝒂𝒕𝒆 𝒔𝒑𝒆𝒆𝒄𝒉-𝒕𝒐-𝒔𝒑𝒆𝒆𝒄𝒉 𝒕𝒓𝒂𝒏𝒔𝒍𝒂𝒕𝒊𝒐𝒏 𝒇𝒐𝒓 𝒅𝒖𝒃𝒃𝒊𝒏𝒈 𝒐𝒇 𝒑𝒓𝒐𝒇𝒆𝒔𝒔𝒊𝒐𝒏𝒂𝒍𝒍𝒚 𝒑𝒓𝒐𝒅𝒖𝒄𝒆𝒅 𝒂𝒖𝒅𝒊𝒐 𝒂𝒏𝒅 𝒗𝒊𝒅𝒆𝒐. 𝑺𝒑𝒆𝒆𝒄𝒉𝒍𝒂𝒃 𝒊𝒔 𝒃𝒂𝒄𝒌𝒆𝒅 𝒃𝒚 𝑨𝒏𝒅𝒓𝒆𝒘 𝑵𝒈'𝒔 𝑨𝑰 𝒇𝒖𝒏𝒅. 𝑲𝒆𝒚 𝑺𝒌𝒊𝒍𝒍𝒔: 𝑨𝑾𝑺 𝑪𝒍𝒐𝒖𝒅 𝑨𝒓𝒄𝒉𝒊𝒕𝒆𝒄𝒕𝒖𝒓𝒆, 𝑺𝒑𝒆𝒆𝒄𝒉 𝑹𝒆𝒄𝒐𝒈𝒏𝒊𝒕𝒊𝒐𝒏, 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑻𝒓𝒂𝒏𝒔𝒍𝒂𝒕𝒊𝒐𝒏, 𝑻𝒆𝒙𝒕-𝒕𝒐-𝑺𝒑𝒆𝒆𝒄𝒉, 𝑴𝑬𝑹𝑵 𝒔𝒕𝒂𝒄𝒌. ★… Show more 𝑺𝒑𝒆𝒆𝒄𝒉𝒍𝒂𝒃 𝒖𝒔𝒆𝒔 𝑨𝑰 𝒕𝒐 𝒂𝒖𝒕𝒐𝒎𝒂𝒕𝒆 𝒔𝒑𝒆𝒆𝒄𝒉-𝒕𝒐-𝒔𝒑𝒆𝒆𝒄𝒉 𝒕𝒓𝒂𝒏𝒔𝒍𝒂𝒕𝒊𝒐𝒏 𝒇𝒐𝒓 𝒅𝒖𝒃𝒃𝒊𝒏𝒈 𝒐𝒇 𝒑𝒓𝒐𝒇𝒆𝒔𝒔𝒊𝒐𝒏𝒂𝒍𝒍𝒚 𝒑𝒓𝒐𝒅𝒖𝒄𝒆𝒅 𝒂𝒖𝒅𝒊𝒐 𝒂𝒏𝒅 𝒗𝒊𝒅𝒆𝒐. 𝑺𝒑𝒆𝒆𝒄𝒉𝒍𝒂𝒃 𝒊𝒔 𝒃𝒂𝒄𝒌𝒆𝒅 𝒃𝒚 𝑨𝒏𝒅𝒓𝒆𝒘 𝑵𝒈'𝒔 𝑨𝑰 𝒇𝒖𝒏𝒅. 𝑲𝒆𝒚 𝑺𝒌𝒊𝒍𝒍𝒔: 𝑨𝑾𝑺 𝑪𝒍𝒐𝒖𝒅 𝑨𝒓𝒄𝒉𝒊𝒕𝒆𝒄𝒕𝒖𝒓𝒆, 𝑺𝒑𝒆𝒆𝒄𝒉 𝑹𝒆𝒄𝒐𝒈𝒏𝒊𝒕𝒊𝒐𝒏, 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑻𝒓𝒂𝒏𝒔𝒍𝒂𝒕𝒊𝒐𝒏, 𝑻𝒆𝒙𝒕-𝒕𝒐-𝑺𝒑𝒆𝒆𝒄𝒉, 𝑴𝑬𝑹𝑵 𝒔𝒕𝒂𝒄𝒌. ★ Key member in designing system architecture based on Amazon Web Services. ★ Build core components of the API to access large language models. ★ Optimize the API and interfaces for performance, robustness, and ease of use. ★ Implement tools and processes to monitor model behavior and performance using Sentry and Loggly. ★ Implemented thumbnail generation for uploaded video files ★ Built utility backend services with AWS Lambda ☆ Built a service that processes Machine Learning Output and calls the backend API ☆ Built a service that is called by the backend API and then generates SRT formatted text file on the AWS S3 bucket ☆ Built a service that notifies new user registration ★ Wrote APIs for content sharing ☆ Direct content sharing by sending an invitation email that contains an invitation link using AWS SES ☆ General content sharing using a public invitation link ☆ Permission management for shared users ★ Wrote APIs and backend services to support multi-language translation and multi-accent dubbing. ★ Designed data models, implemented backend logic, and wrote APIs to implement the Paywall system by integrating the Paddle platform. ★ Implemented backend logic and APIs to keep transcribed language pronunciations for highlighted words/phrases in the translated languages and dubs. ★ Implemented backend logic and APIs to synchronize audio and text. Show less

    • Senior Software Engineer Technical Lead
      • May 2021 - Feb 2023

      𝑨𝒓𝒂𝒚𝒂 𝑨𝒑𝒑 𝒊𝒔 𝒂𝒏 𝑬𝒍𝒆𝒄𝒕𝒓𝒐𝒏𝒊𝒄 𝑯𝒆𝒂𝒍𝒕𝒉 𝑹𝒆𝒄𝒐𝒓𝒅 (𝑬𝑯𝑹) 𝒂𝒑𝒑𝒍𝒊𝒄𝒂𝒕𝒊𝒐𝒏 𝒇𝒐𝒓 𝒑𝒂𝒕𝒊𝒆𝒏𝒕 𝒓𝒆𝒈𝒊𝒔𝒕𝒓𝒂𝒕𝒊𝒐𝒏 𝒂𝒏𝒅 𝒑𝒂𝒕𝒊𝒆𝒏𝒕 𝒎𝒂𝒏𝒂𝒈𝒆𝒎𝒆𝒏𝒕 [𝑯𝑰𝑷𝑷𝑨 𝑪𝒐𝒎𝒑𝒍𝒊𝒂𝒏𝒕]. ★ Architected cloud-based EHR software system (AWS EC2 Infrastructure ). ★ Architected EHR’s Communication System, Network, Patient Registration System, Hospital Registration & other External Services. ★ Architected, designed & developed the application… Show more 𝑨𝒓𝒂𝒚𝒂 𝑨𝒑𝒑 𝒊𝒔 𝒂𝒏 𝑬𝒍𝒆𝒄𝒕𝒓𝒐𝒏𝒊𝒄 𝑯𝒆𝒂𝒍𝒕𝒉 𝑹𝒆𝒄𝒐𝒓𝒅 (𝑬𝑯𝑹) 𝒂𝒑𝒑𝒍𝒊𝒄𝒂𝒕𝒊𝒐𝒏 𝒇𝒐𝒓 𝒑𝒂𝒕𝒊𝒆𝒏𝒕 𝒓𝒆𝒈𝒊𝒔𝒕𝒓𝒂𝒕𝒊𝒐𝒏 𝒂𝒏𝒅 𝒑𝒂𝒕𝒊𝒆𝒏𝒕 𝒎𝒂𝒏𝒂𝒈𝒆𝒎𝒆𝒏𝒕 [𝑯𝑰𝑷𝑷𝑨 𝑪𝒐𝒎𝒑𝒍𝒊𝒂𝒏𝒕]. ★ Architected cloud-based EHR software system (AWS EC2 Infrastructure ). ★ Architected EHR’s Communication System, Network, Patient Registration System, Hospital Registration & other External Services. ★ Architected, designed & developed the application in accordance with HIPAA compliance requirements. ★ Supervised implementation and deployments of HIPAA regulatory compliance requirements such as: Authentication, Auto Log-off, Adit & Alerts, Encryption, Hosting & Infrastructure & Authorization. ★ Supervised Production Environment Setup which involves various components of the application: AWS Amazon Virtual Private Cloud, ECR, ECS, EC2, and AWS RDS. ★ Managed a team of engineers to make necessary system improvements to satisfy physician and staff needs for improved services. ★ Hired / Interviewed Full-Stack engineers, Dev-Ops Engineer, Quality Engineer to support the development of the application. ★ Work with Dev-Ops Engineer on implementing CICD Pipeline in production using GitHub Actions. ★ Implemented Role Based Access Control for Doctors, nurses, and other staff to secure the application. ★ Implemented a mechanism to authenticate ePHI: Phone Verification (Vonage) & Email Verification ( SendGrid Mail Server) for Multi Factor Authentication. ★ Implemented Encryption and decryption: AWS RDS encryption feature using KMS & Completed SSL Certificate installation on the server for data transfers. ★ Implemented logs and audit controls: AWS RDS Alarm,, AWS EC2 Monitor Alarm, Web Server(NGINX) Error Alarm, EPM backend(Django) Error Alarm, & AWS CloudWatch. ★ Integrated CollaborateMD, eClaimStatus. epowerdoc to the application. ★ Integrated Sonicwall Firewall ★ Good knowledge of HL7 and FHIR protocols.

    • Machine Learning Engineer
      • Jan 2021 - May 2021

      𝑨𝒑𝒑𝒍𝒚 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑳𝒆𝒂𝒓𝒏𝒊𝒏𝒈 𝒕𝒐 𝒅𝒆𝒕𝒆𝒓𝒎𝒊𝒏𝒆 𝒕𝒉𝒆 𝒄𝒐𝒏𝒕𝒆𝒏𝒕 𝒐𝒇 𝒕𝒉𝒆 𝑨𝑻𝑻&𝑪𝑲 𝒇𝒓𝒂𝒎𝒆𝒘𝒐𝒓𝒌 𝒂𝒏𝒅 𝒕𝒉𝒆 𝒅𝒆𝒔𝒄𝒓𝒊𝒑𝒕𝒊𝒐𝒏 𝒐𝒇 𝒆𝒂𝒄𝒉 𝒅𝒊𝒈𝒊𝒕𝒂𝒍 𝒗𝒂𝒄𝒄𝒊𝒏𝒆 𝒊𝒏 𝒕𝒉𝒆 𝒅𝒂𝒕𝒂𝒔𝒆𝒕. ★ Built UIs (React.js) to visualize aggregation results of the machine learning model that classifies cyberattack techniques based on the MITRE ATT&CK framework. ★ Utilized Python and Natural Language Processing libraries to developed model that… Show more 𝑨𝒑𝒑𝒍𝒚 𝑴𝒂𝒄𝒉𝒊𝒏𝒆 𝑳𝒆𝒂𝒓𝒏𝒊𝒏𝒈 𝒕𝒐 𝒅𝒆𝒕𝒆𝒓𝒎𝒊𝒏𝒆 𝒕𝒉𝒆 𝒄𝒐𝒏𝒕𝒆𝒏𝒕 𝒐𝒇 𝒕𝒉𝒆 𝑨𝑻𝑻&𝑪𝑲 𝒇𝒓𝒂𝒎𝒆𝒘𝒐𝒓𝒌 𝒂𝒏𝒅 𝒕𝒉𝒆 𝒅𝒆𝒔𝒄𝒓𝒊𝒑𝒕𝒊𝒐𝒏 𝒐𝒇 𝒆𝒂𝒄𝒉 𝒅𝒊𝒈𝒊𝒕𝒂𝒍 𝒗𝒂𝒄𝒄𝒊𝒏𝒆 𝒊𝒏 𝒕𝒉𝒆 𝒅𝒂𝒕𝒂𝒔𝒆𝒕. ★ Built UIs (React.js) to visualize aggregation results of the machine learning model that classifies cyberattack techniques based on the MITRE ATT&CK framework. ★ Utilized Python and Natural Language Processing libraries to developed model that predicted the context of the ATT&CK framework data with 94% accuracy; resulting in reduction of false-positives in vaccine mis-identification by 50%. ★Developed advanced machine learning models to detect threats, with 90% accuracy in predicting malicious activities, and automated the deployment of ML models on production servers.

    • United States
    • IT Services and IT Consulting
    • 300 - 400 Employee
    • Senior Machine Learning Consultant
      • Jan 2018 - Feb 2023

      - Developed product and managed over 70+ projects through Upwork. - 100% Top Rated Plus Status - Earned at least 700k - Worked on several NLP and AI Chatbot development projects which analyze and processes the text with NLP, NLU, and Deep Learning techniques. - Worked on Data Engineering, Data Science, Big Query, Data Warehousing and ML projects. - Developed a strategy engine made with LSTM and other machine learning and statistical algorithms used as trading bots. -… Show more - Developed product and managed over 70+ projects through Upwork. - 100% Top Rated Plus Status - Earned at least 700k - Worked on several NLP and AI Chatbot development projects which analyze and processes the text with NLP, NLU, and Deep Learning techniques. - Worked on Data Engineering, Data Science, Big Query, Data Warehousing and ML projects. - Developed a strategy engine made with LSTM and other machine learning and statistical algorithms used as trading bots. - Worked on FullStack Development projects Show less - Developed product and managed over 70+ projects through Upwork. - 100% Top Rated Plus Status - Earned at least 700k - Worked on several NLP and AI Chatbot development projects which analyze and processes the text with NLP, NLU, and Deep Learning techniques. - Worked on Data Engineering, Data Science, Big Query, Data Warehousing and ML projects. - Developed a strategy engine made with LSTM and other machine learning and statistical algorithms used as trading bots. -… Show more - Developed product and managed over 70+ projects through Upwork. - 100% Top Rated Plus Status - Earned at least 700k - Worked on several NLP and AI Chatbot development projects which analyze and processes the text with NLP, NLU, and Deep Learning techniques. - Worked on Data Engineering, Data Science, Big Query, Data Warehousing and ML projects. - Developed a strategy engine made with LSTM and other machine learning and statistical algorithms used as trading bots. - Worked on FullStack Development projects Show less

    • United States
    • Software Development
    • 1 - 100 Employee
    • Natural Language Processing Engineer
      • Nov 2021 - Jun 2022

      Objective: Build a Conversational bot using Rasa Chatbot Engine. - A new learning experience that is both informative and interactive. - Ability for learners to follow the program at their own pace and access interactive content 24/7. - A high sign-up of learners and good feedback on the educational program, so more Digital Colleagues can be created. Accomplishments: - Designed and Built requisite services for Dominique, a digital teaching assistant built by Soul… Show more Objective: Build a Conversational bot using Rasa Chatbot Engine. - A new learning experience that is both informative and interactive. - Ability for learners to follow the program at their own pace and access interactive content 24/7. - A high sign-up of learners and good feedback on the educational program, so more Digital Colleagues can be created. Accomplishments: - Designed and Built requisite services for Dominique, a digital teaching assistant built by Soul Machines for Bill O'Connor's Innovation class at Maryville University. - Participated in designing the entire conversation flow in Whimsical. - Designed a chatbot system architecture Based on Google Cloud Platform and its services. - Built a fully customized chatbot interface to be embedded in the Canvas learning management system with React.js. - Built orchestration services to integrate chatbot models with Node.js and Express.js. - Built chatbot models that can understand the intent of the user, learn from it, respond intelligently and perform actions if required with an efficient learning mechanism with RASA and Google Dialogflow CX. RASA - Improved intent matching and entity extraction accuracy up to 98%. - Built a dialogue management model that predicts what action or response this chatbot should make based on the state of the conversation. Improved prediction accuracy up to 95%. - Built a custom action server that consumes a customized NLP model built by Spacy. Dialogflow CX: - Created a multi-flow agent. - Built a webhook that fetches data from third-party APIs and database with Google Cloud Function. Show less Objective: Build a Conversational bot using Rasa Chatbot Engine. - A new learning experience that is both informative and interactive. - Ability for learners to follow the program at their own pace and access interactive content 24/7. - A high sign-up of learners and good feedback on the educational program, so more Digital Colleagues can be created. Accomplishments: - Designed and Built requisite services for Dominique, a digital teaching assistant built by Soul… Show more Objective: Build a Conversational bot using Rasa Chatbot Engine. - A new learning experience that is both informative and interactive. - Ability for learners to follow the program at their own pace and access interactive content 24/7. - A high sign-up of learners and good feedback on the educational program, so more Digital Colleagues can be created. Accomplishments: - Designed and Built requisite services for Dominique, a digital teaching assistant built by Soul Machines for Bill O'Connor's Innovation class at Maryville University. - Participated in designing the entire conversation flow in Whimsical. - Designed a chatbot system architecture Based on Google Cloud Platform and its services. - Built a fully customized chatbot interface to be embedded in the Canvas learning management system with React.js. - Built orchestration services to integrate chatbot models with Node.js and Express.js. - Built chatbot models that can understand the intent of the user, learn from it, respond intelligently and perform actions if required with an efficient learning mechanism with RASA and Google Dialogflow CX. RASA - Improved intent matching and entity extraction accuracy up to 98%. - Built a dialogue management model that predicts what action or response this chatbot should make based on the state of the conversation. Improved prediction accuracy up to 95%. - Built a custom action server that consumes a customized NLP model built by Spacy. Dialogflow CX: - Created a multi-flow agent. - Built a webhook that fetches data from third-party APIs and database with Google Cloud Function. Show less

    • United States
    • Education Administration Programs
    • 200 - 300 Employee
    • Google Data Engineer | Natural Language Processing Engineer
      • May 2021 - Apr 2022

      - Created near-real time event based ETL pipelines using Google Cloud Functions to fetch data from external API’s for example Phoneburner and LMS Canvas. - Enabled CICD using Github and Github Actions. - Created ETL pipelines in fivetran to integrate data from multiple sources e.g. SQL Server, Google Sheets and Salesforce. - Enabled automatic scaling and handle schema evolutions. - Used dbt models, seeds, exposure to transform and document data in BigQuery. - Created Data Lineage… Show more - Created near-real time event based ETL pipelines using Google Cloud Functions to fetch data from external API’s for example Phoneburner and LMS Canvas. - Enabled CICD using Github and Github Actions. - Created ETL pipelines in fivetran to integrate data from multiple sources e.g. SQL Server, Google Sheets and Salesforce. - Enabled automatic scaling and handle schema evolutions. - Used dbt models, seeds, exposure to transform and document data in BigQuery. - Created Data Lineage using DBT. - Built dashboards using thoughtspot to provide reports of data marts. - Extracted audio calls from phoneburner, converted to text using Cloud Speech to Text and performed sentiment analysis using Cloud Natural Language API. - Ingested assessment form data from Nice for analysis in BigQuery in order to enable stakeholders to track rep call performance improvement. - Prepared data representation for phoneburner in details and major entities involved. - Performed Performance Testing, including data sizes and time taken to fetch complete data for required routes. - Developed data fetch strategy based on performance test results. - Tested Cloud Run service for phoneburner and performed data integrity checks. - Provided assessment of potential roadmaps for cloud technologies with a focus on Synapse on-prem infrastructure and Measurement Enablement GCP infrastructure. - Tuned existing ETL and reporting models. - Reported to executive level to provide decision making insights. - Built prototype on various enhancements. - Created event-driven workflows using Google Cloud Functions and Google Pub/Sub. - Designed and built data marts and cubes for financial reporting and analytics. - Interacted with business analysts and marketing teams for requirement gathering and estimations. Show less - Created near-real time event based ETL pipelines using Google Cloud Functions to fetch data from external API’s for example Phoneburner and LMS Canvas. - Enabled CICD using Github and Github Actions. - Created ETL pipelines in fivetran to integrate data from multiple sources e.g. SQL Server, Google Sheets and Salesforce. - Enabled automatic scaling and handle schema evolutions. - Used dbt models, seeds, exposure to transform and document data in BigQuery. - Created Data Lineage… Show more - Created near-real time event based ETL pipelines using Google Cloud Functions to fetch data from external API’s for example Phoneburner and LMS Canvas. - Enabled CICD using Github and Github Actions. - Created ETL pipelines in fivetran to integrate data from multiple sources e.g. SQL Server, Google Sheets and Salesforce. - Enabled automatic scaling and handle schema evolutions. - Used dbt models, seeds, exposure to transform and document data in BigQuery. - Created Data Lineage using DBT. - Built dashboards using thoughtspot to provide reports of data marts. - Extracted audio calls from phoneburner, converted to text using Cloud Speech to Text and performed sentiment analysis using Cloud Natural Language API. - Ingested assessment form data from Nice for analysis in BigQuery in order to enable stakeholders to track rep call performance improvement. - Prepared data representation for phoneburner in details and major entities involved. - Performed Performance Testing, including data sizes and time taken to fetch complete data for required routes. - Developed data fetch strategy based on performance test results. - Tested Cloud Run service for phoneburner and performed data integrity checks. - Provided assessment of potential roadmaps for cloud technologies with a focus on Synapse on-prem infrastructure and Measurement Enablement GCP infrastructure. - Tuned existing ETL and reporting models. - Reported to executive level to provide decision making insights. - Built prototype on various enhancements. - Created event-driven workflows using Google Cloud Functions and Google Pub/Sub. - Designed and built data marts and cubes for financial reporting and analytics. - Interacted with business analysts and marketing teams for requirement gathering and estimations. Show less

    • United States
    • Software Development
    • 100 - 200 Employee
    • Senior Data Engineer
      • Jun 2021 - Dec 2021

      • Built data pipeline to ingest data to feed utility and internal websites using Python scripts, AWS Kinesis, and Apache Spark, which improved the metrics of the data used by the websites. • Designed and deployed a data pipeline for data analysis using Python scripts, AWS Elasticsearch/Kibana, and S3, to extract, transform, and load data from application logs and present the data to a dashboard. • Created visualizations using React.js, D3.JS library, and Python to collect metrics of ML… Show more • Built data pipeline to ingest data to feed utility and internal websites using Python scripts, AWS Kinesis, and Apache Spark, which improved the metrics of the data used by the websites. • Designed and deployed a data pipeline for data analysis using Python scripts, AWS Elasticsearch/Kibana, and S3, to extract, transform, and load data from application logs and present the data to a dashboard. • Created visualizations using React.js, D3.JS library, and Python to collect metrics of ML models, and descriptive data (e.g. being able to show orders on map, visitors on map, make the map interactive, change the map and rest of the data changes, etc.) to increase user engagement by 10% • Developed a set of utility and internal websites for the company, enabling the end users (both internal and external) to get instant price estimates for shipping services, such as FTL, LTL and Domestic shipments. • Developed unit and component tests using Jest and Enzyme, resulting in 95% test coverage. • Reduced average data transformation time for a data set of over 100,000 records from more than one second to less than 100 milliseconds by implementing client-side data transformations in the browser using JavaScript. • Supported dynamic forms by developing algorithms for providing market data analysis for food and beverage companies making projections about product/factory capacity, cost, and several other factors. • Created dynamically populated data filters as dropdowns, checkbox groups, input fields, and sliders, resulting in a 60% increase in page views, in order to evaluate and pivot market data analysis charts. • Used Chrome DevTools to investigate and fix front-end rendering performance issues and computationally intensive bottlenecks. • Deployed and managed containerized web applications using Docker and Amazon ECS service which reduced deployment time from 1 week to 2 days, and reduced development and maintenance costs by 50%

    • Senior Software Engineer
      • Apr 2021 - Jun 2021

      - Developed utility and internal websites for the company. The tools are used to estimate the shipping price, the goal is to estimate the price for FTL, LTL and Domestic shipment to be used for both internal (marketing and sales team) and external use (providing service to users). - Bug fixing to ensure smooth delivery and functioning of the applications. - Conducted testing & maintained the code quality. - Collaborated with a team of 15 engineers to define, design and ship new… Show more - Developed utility and internal websites for the company. The tools are used to estimate the shipping price, the goal is to estimate the price for FTL, LTL and Domestic shipment to be used for both internal (marketing and sales team) and external use (providing service to users). - Bug fixing to ensure smooth delivery and functioning of the applications. - Conducted testing & maintained the code quality. - Collaborated with a team of 15 engineers to define, design and ship new features & correct multiple bugs

    • United States
    • Telecommunications
    • 100 - 200 Employee
    • Natural Language Processing Engineer
      • Mar 2020 - May 2021

      - Built an NLP worker engine that extracts key phrases from thousands of voice calls daily using unsupervised machine learning techniques. - Converted Voice calls to text using Google Speech to Text. - Implemented unsupervised keyphrase extraction algorithms like Topicrank, Textrank, Yake, Autophrase. - Extracted keyphrases from textual calls which are then passed via ensembles, and various pre and post-processing techniques to aggregate top keyphrases. - Deployed NLP worker on… Show more - Built an NLP worker engine that extracts key phrases from thousands of voice calls daily using unsupervised machine learning techniques. - Converted Voice calls to text using Google Speech to Text. - Implemented unsupervised keyphrase extraction algorithms like Topicrank, Textrank, Yake, Autophrase. - Extracted keyphrases from textual calls which are then passed via ensembles, and various pre and post-processing techniques to aggregate top keyphrases. - Deployed NLP worker on Google App Engine. - Improved cloud memory store used for performance improvement of NLP worker. - Utilized Google Cloud AutoML and AutoML tables for building keyphrase and call segmentation models on human-annotated data. - Worked on model training, evaluation, serving, and re-training automated. - Automated sentiment analysis performed using Google Natural Language API and Vader. - Sentiments generated via these algorithms were fed to BERT to train a custom sentiment analysis model. - Researched, prototyped (from research papers), built features and optimized(hyper-parameter tuning) the state-of-the-art machine learning and deep learning techniques like SVM, Logistic Regression, Random Forest regression, LSTM, CNN etc., using scikit-Learn, Keras, TensorFlow on CPU/GPU environments for student text-classification. - Utilized ELI5 and LIME for model interpretation. - Final scores/sentiments benchmarked Logistic Regression based text model. - Built dashboards for analyzing keyphrase data on Google Data Studio by connecting via BigQuery - Worked on pre-processing and data preparation for ML models done via BigQuery using complex SQL transformations and joins. - Built Rest APIs for serving ML models and integrated with MongoDB for better performance and scalability Key achievement: Deployed Call Segmentation and Sentiment Analysis models using BERT.

    • Data Engineer | Data Scientist
      • Apr 2019 - Mar 2020

      - Built ETL pipeline to ingest data from CloudSQL to BigQuery, ETL pipeline is capable of scaling to terabytes of data and runs throughout the day. - Built dashboards for analyzing keyphrase data on Google Data Studio by connecting via BigQuery

    • Morocco
    • Business Consulting and Services
    • 1 - 100 Employee
    • Software Engineer
      • Sep 2017 - Apr 2021

      - Worked productively with Product Team to understand requirements and business specifications around Portfolio Management, Analytic and Risk. - Effectively coded software changes and alterations based on specific design specifications. - Designed and developed automation framework for functional and regressing testing using Javascript, Coffeescript, Java, Selenium, Rest-Assured, Maven, Test NG, Junit, Postman. - Develop and load test data into test environments. - Designed Location… Show more - Worked productively with Product Team to understand requirements and business specifications around Portfolio Management, Analytic and Risk. - Effectively coded software changes and alterations based on specific design specifications. - Designed and developed automation framework for functional and regressing testing using Javascript, Coffeescript, Java, Selenium, Rest-Assured, Maven, Test NG, Junit, Postman. - Develop and load test data into test environments. - Designed Location Intelligence Products and other Insurance products - Extensive experience in preparing Test Strategy, Test plan, Test scenarios, Test cases, and Test scripts based on User requirements and System Requirements - Extensively Worked on Creation of Data-Driven, Modular driven and Page Object Module Frameworks Show less - Worked productively with Product Team to understand requirements and business specifications around Portfolio Management, Analytic and Risk. - Effectively coded software changes and alterations based on specific design specifications. - Designed and developed automation framework for functional and regressing testing using Javascript, Coffeescript, Java, Selenium, Rest-Assured, Maven, Test NG, Junit, Postman. - Develop and load test data into test environments. - Designed Location… Show more - Worked productively with Product Team to understand requirements and business specifications around Portfolio Management, Analytic and Risk. - Effectively coded software changes and alterations based on specific design specifications. - Designed and developed automation framework for functional and regressing testing using Javascript, Coffeescript, Java, Selenium, Rest-Assured, Maven, Test NG, Junit, Postman. - Develop and load test data into test environments. - Designed Location Intelligence Products and other Insurance products - Extensive experience in preparing Test Strategy, Test plan, Test scenarios, Test cases, and Test scripts based on User requirements and System Requirements - Extensively Worked on Creation of Data-Driven, Modular driven and Page Object Module Frameworks Show less

    • Mexico
    • Automotive
    • Software Architect | CTO
      • Apr 2019 - Aug 2020

      RKAnywhere connects trainers and athletes to train harder and achieve via iOS app, Web, and Alexa. Download “RKAnywhere” on the App Store: https://apps.apple.com/us/app/rkanywhere/id1471521980 - Provide design specifications and architecture for new feature development - Collaborate with team to help define product development teams on product and technical roadmaps - Influence different aspects of the development process like API creation, design, and product. RKAnywhere connects trainers and athletes to train harder and achieve via iOS app, Web, and Alexa. Download “RKAnywhere” on the App Store: https://apps.apple.com/us/app/rkanywhere/id1471521980 - Provide design specifications and architecture for new feature development - Collaborate with team to help define product development teams on product and technical roadmaps - Influence different aspects of the development process like API creation, design, and product.

    • Software Engineer - Developer for Data Science in Elasticsearch
      • Feb 2020 - Jul 2020

      Logserver is a solution 100% based on ELK open source projects. The system is aimed to collect logs and metrics from IT infrastructure. The exact use case may differ and fully depend on customer expectations. The goal is to forecast the values depending on the time series selected, to build a mechanism for predictive calculations. - Developed a python mechanism based on Sklearn that will calculate predictions for numeric data stored in the Elasticsearch index, and also process historical… Show more Logserver is a solution 100% based on ELK open source projects. The system is aimed to collect logs and metrics from IT infrastructure. The exact use case may differ and fully depend on customer expectations. The goal is to forecast the values depending on the time series selected, to build a mechanism for predictive calculations. - Developed a python mechanism based on Sklearn that will calculate predictions for numeric data stored in the Elasticsearch index, and also process historical value and return future estimation. - There are multiple data sources connected to the system. Those sources transmit a massive amount of values per metric per host. - Implemented find saved search exported from Kibana. Manage the data with the Elasticsearch DSL query. - Processed Time Series Data: resampling, filtering - Implemented Prediction models as input parameter read number of prediction to be returned (one sample per timeframe): RNN(LSTM), ANN(MultiLayerPerceptrons), ARIMA(AutoRegressiveIntegratedMovingAverage). Time series prediction: - Big data and ElasticSearch - Data analysis and Prediction - The time series prediction algorithms based on deep learning AI approaches CNN: - Approximate nonLinear functions - Robust to noise - Handle multi variable inputs - Handle multi-step forecast - Feature learning (Limitation: Cannot learn temporal dependence) RNN - benefits of CNN - Temporal dependencies Libraries: Pandas, Sklearn, Keras, Tensorflow, Matplotlib, Numpy Show less Logserver is a solution 100% based on ELK open source projects. The system is aimed to collect logs and metrics from IT infrastructure. The exact use case may differ and fully depend on customer expectations. The goal is to forecast the values depending on the time series selected, to build a mechanism for predictive calculations. - Developed a python mechanism based on Sklearn that will calculate predictions for numeric data stored in the Elasticsearch index, and also process historical… Show more Logserver is a solution 100% based on ELK open source projects. The system is aimed to collect logs and metrics from IT infrastructure. The exact use case may differ and fully depend on customer expectations. The goal is to forecast the values depending on the time series selected, to build a mechanism for predictive calculations. - Developed a python mechanism based on Sklearn that will calculate predictions for numeric data stored in the Elasticsearch index, and also process historical value and return future estimation. - There are multiple data sources connected to the system. Those sources transmit a massive amount of values per metric per host. - Implemented find saved search exported from Kibana. Manage the data with the Elasticsearch DSL query. - Processed Time Series Data: resampling, filtering - Implemented Prediction models as input parameter read number of prediction to be returned (one sample per timeframe): RNN(LSTM), ANN(MultiLayerPerceptrons), ARIMA(AutoRegressiveIntegratedMovingAverage). Time series prediction: - Big data and ElasticSearch - Data analysis and Prediction - The time series prediction algorithms based on deep learning AI approaches CNN: - Approximate nonLinear functions - Robust to noise - Handle multi variable inputs - Handle multi-step forecast - Feature learning (Limitation: Cannot learn temporal dependence) RNN - benefits of CNN - Temporal dependencies Libraries: Pandas, Sklearn, Keras, Tensorflow, Matplotlib, Numpy Show less

    • Consumer Goods Rental
    • Software Product Management Intern
      • Jun 2017 - Sep 2017

      - Building machine learning-based models for data analysis and visualization on spark and zeppelin. - Resourcefully performed multiple statistical analyses on the company’s datasets and based on their results proposed important changes to the data-processing pipeline. - Executed analysis using R, Python, and PySpark in zeppelin. - Building machine learning-based models for data analysis and visualization on spark and zeppelin. - Resourcefully performed multiple statistical analyses on the company’s datasets and based on their results proposed important changes to the data-processing pipeline. - Executed analysis using R, Python, and PySpark in zeppelin.

    • United Kingdom
    • Insurance
    • Software Design Engineer
      • Jun 2017 - Aug 2017

      • Reduced the number of fraudulent activities by 80% by analyzing and reporting customer behavior patterns. • Designed and implemented programs to detect and prevent fraudulent activities, resulting in a 2% reduction in number of incidents of fraud. • Analyzed how to identify data by breaking it into separate parts, resulting in the ability to analyze data by identifying the underlying principles, reasons, or facts of information. • Implemented a scalable low-latency architecture for a… Show more • Reduced the number of fraudulent activities by 80% by analyzing and reporting customer behavior patterns. • Designed and implemented programs to detect and prevent fraudulent activities, resulting in a 2% reduction in number of incidents of fraud. • Analyzed how to identify data by breaking it into separate parts, resulting in the ability to analyze data by identifying the underlying principles, reasons, or facts of information. • Implemented a scalable low-latency architecture for a data processing pipeline, including input data validation, data cleaning and transformation, and user interface for visualization. The architecture was developed using frameworks such as Kafka, HBase, HDFS, Flume, Spark, SparkStreaming, and Impala among many others. Show less • Reduced the number of fraudulent activities by 80% by analyzing and reporting customer behavior patterns. • Designed and implemented programs to detect and prevent fraudulent activities, resulting in a 2% reduction in number of incidents of fraud. • Analyzed how to identify data by breaking it into separate parts, resulting in the ability to analyze data by identifying the underlying principles, reasons, or facts of information. • Implemented a scalable low-latency architecture for a… Show more • Reduced the number of fraudulent activities by 80% by analyzing and reporting customer behavior patterns. • Designed and implemented programs to detect and prevent fraudulent activities, resulting in a 2% reduction in number of incidents of fraud. • Analyzed how to identify data by breaking it into separate parts, resulting in the ability to analyze data by identifying the underlying principles, reasons, or facts of information. • Implemented a scalable low-latency architecture for a data processing pipeline, including input data validation, data cleaning and transformation, and user interface for visualization. The architecture was developed using frameworks such as Kafka, HBase, HDFS, Flume, Spark, SparkStreaming, and Impala among many others. Show less

    • United States
    • Non-profit Organizations
    • 700 & Above Employee
    • Student Instructor @UC Berkeley
      • Feb 2016 - May 2017

      Taught students in building a Micromouse autonomous vehicle [Micromouse is a robotics project that involves building a small robotic car to autonomously solve a maze as quickly as possible]. Taught students in building a Micromouse autonomous vehicle [Micromouse is a robotics project that involves building a small robotic car to autonomously solve a maze as quickly as possible].

    • United States
    • Education Administration Programs
    • 1 - 100 Employee
    • CS61A Lab Assistant
      • Jan 2016 - Dec 2016

      - Assisted and tutored students with computer programming ideas in the course taught using Python, Scheme, Spark, and SQL. - Assisted a class with about 40 students in utilizing UNIX-based computers for CS61A assignments, projects, and labs.

    • Undergraduate Student Researcher
      • Aug 2015 - Aug 2016

      -Worked on Algorithms for Automation of Surgical Subtasks in Robot-Assisted Minimally Invasive Surgery -Worked on Automation of Radiation Treatment Planning and Delivery. -Created ROS objects for visualizing trajectories before executing on robot -Wrote utility for recording demonstrated videos and trajectories

    • United States
    • Higher Education
    • 1 - 100 Employee
    • Math, Physics and Chemistry Tutor
      • Aug 2011 - Aug 2014

      -Created worksheets and practice quizzes to help high school students prepare for their Math, Chemistry and Physics exams. -Help student’s improved their grade by 10-20 percent. - Assisted students in acquiring better understanding of targeted weak areas within a subject or a subject as a whole. - Successfully made three groups of grade D students acquire grade B levels in mock examinations.

    • Teacher's Assistant for General Chemistry Courses
      • Feb 2013 - Jun 2014

      - Prepared the lab for introductory chemistry courses. - Assisted with the chemistry supply inventory. - Assisted in the set up of major experiments. - Set up and conducted chemical and other experiments. - Learn how to read and utilize chromatography, spectroscopy and microscopy to test and analyze lab results. - Worked with other students and laboratory technicians in the microbiological and chemical testing to accredit standard and methods. - Graded exams, quizzes and… Show more - Prepared the lab for introductory chemistry courses. - Assisted with the chemistry supply inventory. - Assisted in the set up of major experiments. - Set up and conducted chemical and other experiments. - Learn how to read and utilize chromatography, spectroscopy and microscopy to test and analyze lab results. - Worked with other students and laboratory technicians in the microbiological and chemical testing to accredit standard and methods. - Graded exams, quizzes and test.

Education

  • University of California, Berkeley
    Bachelor of Arts - BA, Computer Science
    2012 - 2017
  • University of California, Berkeley
    Bachelor of Arts (B.A.), Cognitive Science
    2012 - 2017

Community

You need to have a working account to view this content. Click here to join now