Maria Obedkova

Senior NLP Engineer at TrustYou
  • Claim this Profile
Contact Information
Location
Berlin, Berlin, Germany, DE
Languages
  • Russian Native or bilingual proficiency
  • English Full professional proficiency
  • German Limited working proficiency
  • French Elementary proficiency

Topline Score

Topline score feature will be out soon.

Bio

Generated by
Topline AI

5.0

/5.0
/ Based on 2 ratings
  • (2)
  • (0)
  • (0)
  • (0)
  • (0)

Filter reviews by:

Ivan Bilan

Maria is an outstanding NLP Engineer and Data Scientist. She has an in-depth understanding of the latest NLP approaches and can apply them to actual industry problems. Specifically, Maria has an expert-level understanding of Transformer based neural networks. In addition to that, Maria is proficient with Python and PyTorch. She also has worked extensively with PySpark and other big data frameworks. As a colleague, Maria was always a team player and put a great emphasis on knowledge sharing and collaboration. Moreover, Maria always strived to become a better engineer as well as a team member, through consistently seeking feedback from her peers. Furthermore, Maria showed great skills in managing various team projects and being accountable for multiple team initiatives. For instance, she has supervised Thesis students, as well as, interns in our NLP team. She will be an invaluable asset to any software engineering team.

Wilhelm Hagg

Maria did her master thesis "Data-Driven Pronunciation Generation for ASR" in our company Sony Europe B.V. from 01.02.2019 until 31.07.2019. During this time i was acting as her supervisor within the company. In ASR systems dictionaries are used to describe the pronunciations of all words of a language. The dictionary translates each word into a sequence of phonemes. Such dictionaries are typically hand crafted by a linguist expert. In this thesis we aimed at a data-driven approach for generating pronunciations for a dictionary. Pronunciation generation from data is considered a quite difficult problem and hand crafted dictionaries are still state of the art. During her thesis Maria implemented different methods for generating pronunciations for words from existing audio samples. In particular different pronunciation clustering methods have been researched and compared with existing baselines. In this context she used Neural Networks to learn phoneme characteristics. Maria has a very good linguistic and statistical background which was required for the project. She also showed to have the required good programming skills to solve the given tasks. She did a Python implementation of all the methods researched, basically from scratch. She also dealed with the complexity of the related Kaldi ASR toolkit in order to access the data and models required for her work. She always had a very structured and disciplined work style. Due to the difficulty of the task we had many discussions and she had always very creative ideas for solving the problems. She is a highly motivated researcher and she is always moving directly towards the goals. She has very good communication skills and it was a pleasure to work with her.

You need to have a working account to view this content.
You need to have a working account to view this content.

Credentials

  • Введение в машинное обучение
    Coursera Course Certificates
    Mar, 2016
    - Sep, 2024

Experience

    • Germany
    • Hospitality
    • 100 - 200 Employee
    • Senior NLP Engineer
      • Sep 2022 - Present

      - Improving a Transformer-based solution for ABSA and scaling it up- Researching approaches for multilingual ABSA- Defining system designs for ML and DL applications

    • NLP Engineer
      • Apr 2020 - Sep 2022

      - Developed a Transformer-based solution for ABSA and put it in production- Researed different approaches to ABSA and performed various Data Analysis tasks- Supported and maintained the legacy system that performs Sentiment Analysis with the help of CFG

    • Entertainment Providers
    • 700 & Above Employee
    • ASR Research Intern
      • Feb 2019 - Jul 2019

      - Researched different Deep Learning approaches of pronunciation generation for Speech Recognition - Investigated Acoustic Word Embeddings and improved their quality for a pronunciation discrimination task - Developed a completely new data-driven method of pronunciation generation for ASR purposes - Researched different Deep Learning approaches of pronunciation generation for Speech Recognition - Investigated Acoustic Word Embeddings and improved their quality for a pronunciation discrimination task - Developed a completely new data-driven method of pronunciation generation for ASR purposes

    • India
    • Education Administration Programs
    • NLP Research Engineer
      • Oct 2016 - Dec 2018

      - Implemented the morphological-syntactical pipeline and improved its quality - Developed the anaphora resolution module for news texts using a Machine Learning approach - Supervised linguists and coordinated the interaction of linguists and programmers in a team - Implemented the morphological-syntactical pipeline and improved its quality - Developed the anaphora resolution module for news texts using a Machine Learning approach - Supervised linguists and coordinated the interaction of linguists and programmers in a team

    • Computational linguist
      • Feb 2017 - Sep 2017

      - Developed various solutions for Fact Extraction using ABBYY tools - Implemented unit testing for the Fact Extraction system - Developed various solutions for Fact Extraction using ABBYY tools - Implemented unit testing for the Fact Extraction system

    • Russian Federation
    • Higher Education
    • 700 & Above Employee
    • Developer
      • Sep 2016 - Jun 2017

      Developed and maintained the corpus of 19th century texts on http://www.web-corpora.net/19thcentury for Russian language. Developed and maintained the corpus of 19th century texts on http://www.web-corpora.net/19thcentury for Russian language.

    • United States
    • IT Services and IT Consulting
    • 700 & Above Employee
    • Intern
      • Jan 2016 - Jun 2016

      Developed the advanced tokenizer for Russian corpora in Geekrya project. Developed the advanced tokenizer for Russian corpora in Geekrya project.

Education

  • Univerzita Karlova v Praze
    Master's degree, Language and Communication Technology
    2017 - 2019
  • UPV/EHU
    Master's degree, Language and Communication Technology
    2017 - 2019
  • Higher School of Economics
    Bachelor's degree, Computational linguistics
    2013 - 2017

Community

You need to have a working account to view this content. Click here to join now