Ekaterina Zerniukova
Senior Data Analyst at Centro (Ortnec)- Claim this Profile
Click to upgrade to our gold package
for the full feature experience.
Topline Score
Bio
Credentials
-
Practical Big Data Course
BigData TeamApr, 2022- Nov, 2024
Experience
-
Centro (Ortnec)
-
Cyprus
-
IT Services and IT Consulting
-
1 - 100 Employee
-
Senior Data Analyst
-
Aug 2023 - Present
Languages: Python, SQL Environment: Tableau, Jupyter notebook Storages: ClickHouse, MariaDB/MySQL Libraries: ML (sklearn/scikit-learn), Exploratory data analysis (Pandas, Numpy, Plotly, Seaborn, Scipy, Statsmodels) Tools: Jira, Confluence Skills: Big Data · Python · Tableau · Machine Learning Languages: Python, SQL Environment: Tableau, Jupyter notebook Storages: ClickHouse, MariaDB/MySQL Libraries: ML (sklearn/scikit-learn), Exploratory data analysis (Pandas, Numpy, Plotly, Seaborn, Scipy, Statsmodels) Tools: Jira, Confluence Skills: Big Data · Python · Tableau · Machine Learning
-
-
-
CodeXteam
-
United Arab Emirates
-
IT Services and IT Consulting
-
1 - 100 Employee
-
Business Intelligence Developer / Data Engineer
-
Jun 2022 - Present
Developed and maintained BI reports to monitor and support decision-making for a big medical company in the US with more than 300 facilities. Built a Data warehouse in AWS for reliable data storage and fast big data manipulation. Developed and extended ETL code and data pipelines for regular data collection, data analysis, and data interpretation from various sources. Languages: Python, PySpark, SQL Environment: AWS (Glue, S3, Redshift, CodeCommit, SNS), Tableau RDBMS: PostgreSQL, AWS Redshift Storages: Data Lake (S3), Data Warehouse (Redshift), HubSpot, SharePoint, partners REST API, Databases Libraries: ML classification (sklearn), Exploratory data analysis (Pandas, Numpy, Plotly, Seaborn, Scipy, Statsmodels), ETL (awsglue), aggregation/transformation (pyspark), multiprocessing (concurrent), REST API (requests) Tools: Jira, Confluence Show less
-
-
-
X5 Group
-
Russian Federation
-
Retail
-
700 & Above Employee
-
Data Analyst / Data Engineer
-
Jul 2021 - Jun 2022
Set up an ETL process with Data quality analysis to collect data from different sources. Built data pipelines, designed data marts and BI reports to monitor business indicators (NPS, CSI) in a chain of 1000+ online and offline stores. Used Natural Language Processing (NLP) to analyze customer feedback to identify problems in the stores and improve their performance Languages: Python, PySpark, Spark DataFrame API, SQL (HiveQL, PL/pgSQL) NLP/ML libraries: normalization (nltk), classification (sklearn), Pandas, Numpy, Scipy Environment: JupyterHub, PyCharm, DBeaver, DataGrip, Ambari RDBMS\NoSQL: GreenPlum, PostgreSQL, MongoDB, Hive Storages: Data Lake (Hadoop), Enterprise Data Warehouse (EDW), xcloud, Databases Dataflow: REST API, Kafka, SAP ERP Structured formats: Parquet, ORC, Hive tables, CSV, JSON Data Pipeline: Airflow, Ataccama Monitoring: Tez, Grafana, Spark History Server, Hadoop YARN Tools: GitLab, Jira, YouTrack, Confluence Show less
-
-
-
VTB
-
Russian Federation
-
Banking
-
700 & Above Employee
-
Data Engineer
-
Jan 2021 - Jul 2021
Developed aggregated data marts and set up data quality checks for analysis and monitoring Refactored legacy SQL code and optimized it with Spark DataFrame API to increase data processing speed. Languages: Python, Spark DataFrame API, SQL Modules: Pyspark, Pandas, Numpy, Scipy Environment: JupyterHub, PyCharm, DBeaver, Hue DBMS: Oracle, Impala/Hive Storages: Data Lake (Hadoop), Data Warehouse (DWH) Tools: Git (BitBucket), Jira, Confluence Developed aggregated data marts and set up data quality checks for analysis and monitoring Refactored legacy SQL code and optimized it with Spark DataFrame API to increase data processing speed. Languages: Python, Spark DataFrame API, SQL Modules: Pyspark, Pandas, Numpy, Scipy Environment: JupyterHub, PyCharm, DBeaver, Hue DBMS: Oracle, Impala/Hive Storages: Data Lake (Hadoop), Data Warehouse (DWH) Tools: Git (BitBucket), Jira, Confluence
-
-
-
Airbnb
-
United States
-
Software Development
-
700 & Above Employee
-
Independent Investigation
-
Oct 2020 - Dec 2020
Anomaly and Correlation Detection in Airbnb Public Data Languages: Python, SQL Modules: Pandas, Numpy, Matplotlib, Seaborn, Scipy, Statsmodels Environment: Jupyter Notebook, PyCharm, DBeaver, Linux RDBMS: PostgreSQL, MySQL, SQLite VCS: Git (GitHub) Anomaly and Correlation Detection in Airbnb Public Data Languages: Python, SQL Modules: Pandas, Numpy, Matplotlib, Seaborn, Scipy, Statsmodels Environment: Jupyter Notebook, PyCharm, DBeaver, Linux RDBMS: PostgreSQL, MySQL, SQLite VCS: Git (GitHub)
-
-
-
Rosneft
-
Oil and Gas
-
700 & Above Employee
-
Financial Analyst
-
Jan 2015 - Oct 2020
Analysis of financial indicators to make financial planning, pricing, contracting Progressed from specialist to team leader Environment: SAP ERP, Power Pivot Analysis of financial indicators to make financial planning, pricing, contracting Progressed from specialist to team leader Environment: SAP ERP, Power Pivot
-
-
-
bp
-
United Kingdom
-
Oil and Gas
-
700 & Above Employee
-
Contracting Group Analyst
-
Apr 2013 - Jan 2015
Support of reorganization procedures (analysis of the company's property on the balance, assessment of assets, preparation of documents for the deal) Environment: SAP ERP, Power Pivot Support of reorganization procedures (analysis of the company's property on the balance, assessment of assets, preparation of documents for the deal) Environment: SAP ERP, Power Pivot
-
-
-
Nokia Siemens Networks
-
Telecommunications
-
1 - 100 Employee
-
Junior Financial Controller
-
Jul 2011 - Jan 2013
Preparing contracts and checking accounting documents Calculate accruals and monitor payment Environment: SAP ERP, Power Pivot Preparing contracts and checking accounting documents Calculate accruals and monitor payment Environment: SAP ERP, Power Pivot
-
-
Education
-
Moscow State Technological University "Stankin"
Bachelor's degree, Strategic financial management