Maziyar P.
Big Data Project Manager at Institut des Systèmes Complexes - Paris Île-de-France (UPS 3611 / CNRS)- Claim this Profile
Click to upgrade to our gold package
for the full feature experience.
-
English Native or bilingual proficiency
-
Persian Native or bilingual proficiency
-
French Limited working proficiency
Topline Score
Bio
Credentials
-
Apache Spark 3 - Databricks Certified Associate Developer
UdemyDec, 2021- Nov, 2024 -
Big Data Analysis with Scala and Spark
CourseraOct, 2021- Nov, 2024 -
Functional Programming Principles in Scala
CourseraOct, 2021- Nov, 2024 -
Analyzing Big Data with Hive
LinkedInMar, 2019- Nov, 2024 -
Hadoop for Data Science Tips, Tricks, & Techniques
LinkedInDec, 2018- Nov, 2024 -
Advanced NoSQL for Data Science
LinkedInFeb, 2018- Nov, 2024 -
Big Data Foundations: Techniques and Concepts
LinkedInFeb, 2018- Nov, 2024 -
Learning Redis
LinkedInFeb, 2018- Nov, 2024 -
NoSQL for SQL Professionals
LinkedInFeb, 2018- Nov, 2024 -
AWS Essentials Training
Amazon Web ServicesAug, 2014- Nov, 2024 -
Architecting on AWS Training
Amazon Web ServicesAug, 2014- Nov, 2024 -
M102: MongoDB for DBAs
MongoDBJul, 2014- Nov, 2024 -
AWS Essentials Training
Amazon Web ServicesJun, 2014- Nov, 2024 -
AWSome Day
Amazon Web ServicesJun, 2014- Nov, 2024 -
AWS Cloud Training
Amazon Web ServicesOct, 2013- Nov, 2024
Experience
-
Institut des Systèmes Complexes - Paris Île-de-France (UPS 3611 / CNRS)
-
France
-
Research Services
-
1 - 100 Employee
-
Big Data Project Manager
-
Dec 2015 - Present
Summary: over 300 billion documents, project manager and lead engineer of a Big Data platform (Multivac) with over 140 servers (+2000 core and 320TB storage) and a Cloud-based Hadoop cluster with 280TB of HDFS: - Principal AI/ML/NLP Engineer (Distributed NLP, TensorFlow, PyTorch, ONNX, Spark, etc.) - Lead Big data Engineer (Hadoop/Spark cluster with more than 260 billion data (Cloudera), search engine clusters (Elastic Stack) with more than 7 billion documents, DBA (MongoDB) with more… Show more Summary: over 300 billion documents, project manager and lead engineer of a Big Data platform (Multivac) with over 140 servers (+2000 core and 320TB storage) and a Cloud-based Hadoop cluster with 280TB of HDFS: - Principal AI/ML/NLP Engineer (Distributed NLP, TensorFlow, PyTorch, ONNX, Spark, etc.) - Lead Big data Engineer (Hadoop/Spark cluster with more than 260 billion data (Cloudera), search engine clusters (Elastic Stack) with more than 7 billion documents, DBA (MongoDB) with more than 16 billion documents, and Redis with over 500 million data, and full-stack developer) - Senior Cloud architect/engineer (AWS, Azure, OpenStack, and OpenNebula), - Project Manager of Multivac platform: https://multivacplatform.org/ - Network engineer, Linux/Windows system administrator, and information security officer (CSSI) Show less Summary: over 300 billion documents, project manager and lead engineer of a Big Data platform (Multivac) with over 140 servers (+2000 core and 320TB storage) and a Cloud-based Hadoop cluster with 280TB of HDFS: - Principal AI/ML/NLP Engineer (Distributed NLP, TensorFlow, PyTorch, ONNX, Spark, etc.) - Lead Big data Engineer (Hadoop/Spark cluster with more than 260 billion data (Cloudera), search engine clusters (Elastic Stack) with more than 7 billion documents, DBA (MongoDB) with more… Show more Summary: over 300 billion documents, project manager and lead engineer of a Big Data platform (Multivac) with over 140 servers (+2000 core and 320TB storage) and a Cloud-based Hadoop cluster with 280TB of HDFS: - Principal AI/ML/NLP Engineer (Distributed NLP, TensorFlow, PyTorch, ONNX, Spark, etc.) - Lead Big data Engineer (Hadoop/Spark cluster with more than 260 billion data (Cloudera), search engine clusters (Elastic Stack) with more than 7 billion documents, DBA (MongoDB) with more than 16 billion documents, and Redis with over 500 million data, and full-stack developer) - Senior Cloud architect/engineer (AWS, Azure, OpenStack, and OpenNebula), - Project Manager of Multivac platform: https://multivacplatform.org/ - Network engineer, Linux/Windows system administrator, and information security officer (CSSI) Show less
-
-
-
John Snow Labs
-
United States
-
IT Services and IT Consulting
-
1 - 100 Employee
-
Principal AI/ML Engineer - Senior Team Lead
-
Jan 2019 - Present
Principal AI / ML engineer and a senior Team Lead with over a decade-long experience in public research. I lead a team behind Spark NLP at John Snow Labs, one of the most widely used NLP libraries in the enterprise. I develop scalable NLP components using the latest techniques in deep learning and machine learning that includes classic ML, Language Models, Speech Recognition, and Computer Vision. I am an expert in designing, deploying, and maintaining ML and DL models in the JVM… Show more Principal AI / ML engineer and a senior Team Lead with over a decade-long experience in public research. I lead a team behind Spark NLP at John Snow Labs, one of the most widely used NLP libraries in the enterprise. I develop scalable NLP components using the latest techniques in deep learning and machine learning that includes classic ML, Language Models, Speech Recognition, and Computer Vision. I am an expert in designing, deploying, and maintaining ML and DL models in the JVM ecosystem and distributed computing engine (Apache Spark) at the production level. Spark NLP is the only open-source NLP library in production that offers state-of-the-art transformers such as BERT, CamemBERT, ALBERT, ELECTRA, XLNet, DistilBERT, RoBERTa, DeBERTa, XLM-RoBERTa, Longformer, ELMO, Universal Sentence Encoder, Google T5, MarianMT, GPT2, and Vision Transformers (ViT) not only to Python and R, but also to JVM ecosystem (Java, Scala, and Kotlin) at scale by extending Apache Spark natively. Show less Principal AI / ML engineer and a senior Team Lead with over a decade-long experience in public research. I lead a team behind Spark NLP at John Snow Labs, one of the most widely used NLP libraries in the enterprise. I develop scalable NLP components using the latest techniques in deep learning and machine learning that includes classic ML, Language Models, Speech Recognition, and Computer Vision. I am an expert in designing, deploying, and maintaining ML and DL models in the JVM… Show more Principal AI / ML engineer and a senior Team Lead with over a decade-long experience in public research. I lead a team behind Spark NLP at John Snow Labs, one of the most widely used NLP libraries in the enterprise. I develop scalable NLP components using the latest techniques in deep learning and machine learning that includes classic ML, Language Models, Speech Recognition, and Computer Vision. I am an expert in designing, deploying, and maintaining ML and DL models in the JVM ecosystem and distributed computing engine (Apache Spark) at the production level. Spark NLP is the only open-source NLP library in production that offers state-of-the-art transformers such as BERT, CamemBERT, ALBERT, ELECTRA, XLNet, DistilBERT, RoBERTa, DeBERTa, XLM-RoBERTa, Longformer, ELMO, Universal Sentence Encoder, Google T5, MarianMT, GPT2, and Vision Transformers (ViT) not only to Python and R, but also to JVM ecosystem (Java, Scala, and Kotlin) at scale by extending Apache Spark natively. Show less
-
-
-
CNRS
-
France
-
Research Services
-
700 & Above Employee
-
Chef de projet Infrastructure
-
Jun 2021 - Present
À ce jour, mes activités relèvent de la BAP E et correspondent à plusieurs emplois types : Chef de projet en Infrastructure (E1B42) et Chef de projet en Ingénierie logicielle (E1C43).
-
-
Administrateur des systèmes
-
Dec 2015 - May 2022
-
-
Big Data Engineer and Full-stack Developer
-
Sep 2014 - Dec 2015
-
-
-
Ecole nationale des Sciences géographiques
-
France
-
Higher Education
-
1 - 100 Employee
-
Big Data Teacher
-
Feb 2018 - Present
Big Data Platform & Data Analytics / NoSQL courses for Master students Syllabus highlights: NoSQL Course: Introduction to SQL databases, Introduction to NoSQL databases, Introduction to Key-Value Databases Introduction to Column-based Databases Introduction to Document-based Databases Introduction to Graph-based Databases NoSQL Workshop (Redis, MongoDB, Neo4j, and Elasticsearch) Big Data Course (Distributed Systems, Data Analytics, and Data… Show more Big Data Platform & Data Analytics / NoSQL courses for Master students Syllabus highlights: NoSQL Course: Introduction to SQL databases, Introduction to NoSQL databases, Introduction to Key-Value Databases Introduction to Column-based Databases Introduction to Document-based Databases Introduction to Graph-based Databases NoSQL Workshop (Redis, MongoDB, Neo4j, and Elasticsearch) Big Data Course (Distributed Systems, Data Analytics, and Data Visualizations) : Introduction to Big Data, Big Data Ethics and Security MapReduce, HDFS, and YARN Apache Hadoop Workshop (Cloudera) Apache Spark 2.2 (DataFrame, SQL, and ML) Apache Spark Workshop (Shell/Submit, IntelliJ, and Apache Zeppelin) Introduction to Machine Learning, Deep Learning, and Text Mining (Spark-NLP or Stanford CoreNLP) Big Data Analytics Workshop Show less Big Data Platform & Data Analytics / NoSQL courses for Master students Syllabus highlights: NoSQL Course: Introduction to SQL databases, Introduction to NoSQL databases, Introduction to Key-Value Databases Introduction to Column-based Databases Introduction to Document-based Databases Introduction to Graph-based Databases NoSQL Workshop (Redis, MongoDB, Neo4j, and Elasticsearch) Big Data Course (Distributed Systems, Data Analytics, and Data… Show more Big Data Platform & Data Analytics / NoSQL courses for Master students Syllabus highlights: NoSQL Course: Introduction to SQL databases, Introduction to NoSQL databases, Introduction to Key-Value Databases Introduction to Column-based Databases Introduction to Document-based Databases Introduction to Graph-based Databases NoSQL Workshop (Redis, MongoDB, Neo4j, and Elasticsearch) Big Data Course (Distributed Systems, Data Analytics, and Data Visualizations) : Introduction to Big Data, Big Data Ethics and Security MapReduce, HDFS, and YARN Apache Hadoop Workshop (Cloudera) Apache Spark 2.2 (DataFrame, SQL, and ML) Apache Spark Workshop (Shell/Submit, IntelliJ, and Apache Zeppelin) Introduction to Machine Learning, Deep Learning, and Text Mining (Spark-NLP or Stanford CoreNLP) Big Data Analytics Workshop Show less
-
-
-
University of Malaya
-
Education Management
-
700 & Above Employee
-
Associate Member
-
Sep 2014 - Sep 2015
-
-
Full stack Developer / Cloud Architect
-
Jun 2013 - Sep 2014
-
-
-
Multimedia University
-
Malaysia
-
Higher Education
-
700 & Above Employee
-
Research Officer
-
May 2011 - May 2013
-
-
-
-
IT Engineer
-
May 2010 - Jul 2010
-
-
-
IEIC Group
-
Iran
-
Web Developer
-
Feb 2007 - Jul 2010
-
-
-
Dadeh Pardazi Pouya
-
Iran
-
Network Enginner
-
Jun 2005 - Jan 2007
-
-
-
-
Freelance Programmer
-
2003 - 2005
-
-
Education
-
Multimedia University
Master's degree -
Islamic Azad University
Bachelor -
Islamic Azad University
Associate's degree -
School of engineering
High School Diploma