Niccolo Pabustan
Cloud Architect at Multiple Myeloma Research Foundation - MMRF- Claim this Profile
Click to upgrade to our gold package
for the full feature experience.
Topline Score
Bio
Credentials
-
AWS Certified Solutions Architect – Associate
Amazon Web Services (AWS)Jun, 2023- Nov, 2024
Experience
-
Multiple Myeloma Research Foundation - MMRF
-
United States
-
Non-profit Organizations
-
1 - 100 Employee
-
Cloud Architect
-
Mar 2023 - Present
With cloud architecture and Bioinformatics expertise, I drive transformative solutions in precision medicine. Leveraging cutting-edge technologies, I design secure, scalable cloud infrastructures for cancer genomics data analysis and collaboration.Key Responsibilities: -Designing and implementing cutting-edge cloud computing solutions, robust database architectures, and overseeing the adaption process.-Successfully launched AWS platform at the MMRF and implemented the AWS Well-Architected framework for secure and compliant data governance.-Leading large-scale data migrations, automated processing pipelines, and secure delivery frameworks in multi-cloud environments (AWS, GCP, VLAB).-Utilizing AWS-native solutions like Control Tower, CloudFormation, and CodePipeline for end-to-end Cloud DevOps frameworks, automating data ingestion, processing, and delivery. -Built end-to-end framework for MMRF CoMMpass Study [largest genomic dataset and most widely published studies in Multiple Myeloma] and integration into MMRF's VLAB platform for data sharing and research collaboration.-Developing networking and security architectures for hybrid and multi-cloud.-Leading stakeholder engagements, managing SOW, contracting, discovery, execution, UAT, & product delivery to external stakeholders & researchers.-Monitoring cloud costs, developing billing systems, and presenting financial reviews and reports to executive leadership and committees.-Collaborating with DevOps, IT, and software engineering teams, providing technical leadership and guidance as the lead tech stakeholder for MMRF in engagements.-Strategically planning and managing cloud solutions to align with organizational objectives.-Optimizing bioinformatics workflows and data analysis methodologies using Linux bash, Python, and R.-Serving as a Data Engineer and Cloud super admin in day-to-day tasks, implementing QA/processing frameworks, and coordinating scheduled data releases. Show less
-
-
Bioinformatics Programmer
-
Jul 2022 - May 2023
Large-scale multi-institutional myeloma multi-omic data harmonization and analysis pipeline development on cloud environments. Infrastructure role to achieve MMRF's strategic pillars of data access and analyses.
-
-
-
CAMP4 Therapeutics
-
United States
-
Biotechnology Research
-
1 - 100 Employee
-
Bioinformatics Co-op | Data Sciences
-
Jan 2022 - Jul 2022
Building and optimizing predictive models of gene expression using machine learning methods to identify de-risked druggable targets and improve therapeutic predictability. Cloud computing and storage using AWS services. Building and optimizing predictive models of gene expression using machine learning methods to identify de-risked druggable targets and improve therapeutic predictability. Cloud computing and storage using AWS services.
-
-
-
Keck Medicine of USC
-
United States
-
Hospitals and Health Care
-
700 & Above Employee
-
Bioinformatician/Biostatistican
-
Sep 2019 - Jan 2022
• Evaluated project needs of research group and performed bioinformatics tasks including software development/scripting, pipeline development and automation, genomic data warehousing, Anaconda package management, and high-performance research computing in Linux shell command line, R, and python programming environments. • Implemented and automated modular software pipelines and scripts in Linux shell, R and python to provide support for NGS (RNA-seq & WGBS) data analysis in human blood and immune cells, including sequence alignment from fastq to BAM, samtools data-processing, QC, quantification, differential expression analysis, empirical Bayes smoothing of counts, cluster analysis and DGE-classification of disease network pathways. • Utilized SLURM job schedule scripts and facilitated multi-job HPC automation of the RNA-seq and WGBS analysis workflow in the high-performance research computing environment while maintaining data stewardship of server-based assets (greater than 20 terabytes). Show less
-
-
-
UCLA
-
United States
-
Higher Education
-
700 & Above Employee
-
Undergraduate Bioinformatics Research Assistant
-
Sep 2018 - Jan 2019
• Evaluated poly-A tail polymorphism and the gene expression signatures on pancreatic adenocarcinoma patients Unix-based software and file-handling computing environment and sequence alignment/assembly using various data file formats (FASTQ, BAM/SAM conversions). • Extracted data from National Cancer Institute GDC (Genomic Data Commons) for pancreatic cancer patients for RNA-seq data and alignments and stored/pre-processed large files for transcriptomic data analysis on the UC cloud-cluster HPC “Hoffman2.” Show less
-
-
Education
-
University of California, Los Angeles
Bachelor of Science (B.S.), Molecular, Cell, and Developmental Biology, Minor in Bioinformatics -
Lewis University
Master of Science - MS, Data Science concentration in Computational Biology and Bioinformatics