Joshua Smith
Data Engineer at Third Sector Intelligence- Claim this Profile
Click to upgrade to our gold package
for the full feature experience.
Topline Score
Bio
Christopher Grulke
Joshua was brought in to perform straightforward ETL operations to manage and curate content in the databases supporting EPA's Comptox Chemicals Dashboard. He arrived with excellent scripting skills, but limited software development experience. In the past year, not only has he become expert in developing solutions for ETL operations using Python, he has done so while learning DRY development techniques and ensuring the reusability of his code. His expertise and efficiency grew so quickly, that I quickly ran out of ETL tasks to properly utilize his skill set and moved him into data analytical tasks (specifically QSAR) to slake his thirst for knowledge. His tenacity and perseverance in this new field with only limited guidance from me has been impressive. He has become well-versed in learning methods and dataset balancing techniques. He has become an avid reader of machine learning literature bringing cutting edge methods to bear on the common problems experienced in QSAR modeling (lack of quality training data and target variable distribution skew). I look forward to his continued improvement as he is well on his way to becoming an effective data scientist with a specialization in life sciences. With his strong work ethic and willingness to learn, I expect that he will be very successful in any situation he finds himself.
Christopher Grulke
Joshua was brought in to perform straightforward ETL operations to manage and curate content in the databases supporting EPA's Comptox Chemicals Dashboard. He arrived with excellent scripting skills, but limited software development experience. In the past year, not only has he become expert in developing solutions for ETL operations using Python, he has done so while learning DRY development techniques and ensuring the reusability of his code. His expertise and efficiency grew so quickly, that I quickly ran out of ETL tasks to properly utilize his skill set and moved him into data analytical tasks (specifically QSAR) to slake his thirst for knowledge. His tenacity and perseverance in this new field with only limited guidance from me has been impressive. He has become well-versed in learning methods and dataset balancing techniques. He has become an avid reader of machine learning literature bringing cutting edge methods to bear on the common problems experienced in QSAR modeling (lack of quality training data and target variable distribution skew). I look forward to his continued improvement as he is well on his way to becoming an effective data scientist with a specialization in life sciences. With his strong work ethic and willingness to learn, I expect that he will be very successful in any situation he finds himself.
Christopher Grulke
Joshua was brought in to perform straightforward ETL operations to manage and curate content in the databases supporting EPA's Comptox Chemicals Dashboard. He arrived with excellent scripting skills, but limited software development experience. In the past year, not only has he become expert in developing solutions for ETL operations using Python, he has done so while learning DRY development techniques and ensuring the reusability of his code. His expertise and efficiency grew so quickly, that I quickly ran out of ETL tasks to properly utilize his skill set and moved him into data analytical tasks (specifically QSAR) to slake his thirst for knowledge. His tenacity and perseverance in this new field with only limited guidance from me has been impressive. He has become well-versed in learning methods and dataset balancing techniques. He has become an avid reader of machine learning literature bringing cutting edge methods to bear on the common problems experienced in QSAR modeling (lack of quality training data and target variable distribution skew). I look forward to his continued improvement as he is well on his way to becoming an effective data scientist with a specialization in life sciences. With his strong work ethic and willingness to learn, I expect that he will be very successful in any situation he finds himself.
Christopher Grulke
Joshua was brought in to perform straightforward ETL operations to manage and curate content in the databases supporting EPA's Comptox Chemicals Dashboard. He arrived with excellent scripting skills, but limited software development experience. In the past year, not only has he become expert in developing solutions for ETL operations using Python, he has done so while learning DRY development techniques and ensuring the reusability of his code. His expertise and efficiency grew so quickly, that I quickly ran out of ETL tasks to properly utilize his skill set and moved him into data analytical tasks (specifically QSAR) to slake his thirst for knowledge. His tenacity and perseverance in this new field with only limited guidance from me has been impressive. He has become well-versed in learning methods and dataset balancing techniques. He has become an avid reader of machine learning literature bringing cutting edge methods to bear on the common problems experienced in QSAR modeling (lack of quality training data and target variable distribution skew). I look forward to his continued improvement as he is well on his way to becoming an effective data scientist with a specialization in life sciences. With his strong work ethic and willingness to learn, I expect that he will be very successful in any situation he finds himself.
Experience
-
Third Sector Intelligence
-
United States
-
Software Development
-
1 - 100 Employee
-
Data Engineer
-
Mar 2022 - Present
-
-
-
Public Company Accounting Oversight Board (PCAOB)
-
United States
-
Financial Services
-
700 & Above Employee
-
Data Analyst
-
Apr 2019 - Mar 2022
• Python, SQL server, jupyter notebooks. • Angular, Javascript, Typescript • Managing multiple interns • Parsing and building models on unstructured data • Automation and web scraping • Creation of entire ETL pipelines • Natural Language Processing (Classification, sentiment, and intent analysis) • Python, SQL server, jupyter notebooks. • Angular, Javascript, Typescript • Managing multiple interns • Parsing and building models on unstructured data • Automation and web scraping • Creation of entire ETL pipelines • Natural Language Processing (Classification, sentiment, and intent analysis)
-
-
-
US Environmental Protection Agency (EPA)
-
United States
-
Government Administration
-
700 & Above Employee
-
Data Analyst and Machine Learning
-
Nov 2017 - Mar 2019
• Working professionally with SQL and Python • All steps of data analysis including gathering, formatting, transforming data and developing machine learning models using TensorFlow and scikit-learn in Python • Developed QSAR (Machine learning) models from scratch, including improvements on physical property prediction methods • MongoDB: complex queries, comparisons, data structures • Writing and maintaining python scripts, utilizing ORM in order to create and manage various SQL databases • Efficient query, transformation, and visualization of large data sets • Web scraping and API interaction. Including complex queries and properly parsing requested information for use in QSAR models Show less
-
-
-
Washington State University
-
United States
-
Higher Education
-
700 & Above Employee
-
Research Assistant
-
Jan 2016 - May 2017
Worked in development of an amine functionalized, silica shell, gold nanoparticle which catalyzed addition of amine groups selectively across the 2nd carbon in aklyne chains. The position mostly consisted of wet chemistry but also exposed me to (and reinforced) various analytical techniques such as TGA, NMR, TEM, basic Matlab programming, TEM, and chemical analysis. Many hours were also spent combing through relevant research papers. Worked in development of an amine functionalized, silica shell, gold nanoparticle which catalyzed addition of amine groups selectively across the 2nd carbon in aklyne chains. The position mostly consisted of wet chemistry but also exposed me to (and reinforced) various analytical techniques such as TGA, NMR, TEM, basic Matlab programming, TEM, and chemical analysis. Many hours were also spent combing through relevant research papers.
-
-
-
JJ's Fish House
-
Poulsbo, WA
-
Cook
-
2009 - 2012
During my time at JJ's I was officially a cook, but the restaurant had a high turnover, a lot of no-call no-shows, and general understaffing. Because of this I worked every aspect of the kitchen. This includes managing the line, cooking, dishwashing, prep cooking, serving, bussing, cleaning, fixing broken equipment and more. During my time at JJ's I was officially a cook, but the restaurant had a high turnover, a lot of no-call no-shows, and general understaffing. Because of this I worked every aspect of the kitchen. This includes managing the line, cooking, dishwashing, prep cooking, serving, bussing, cleaning, fixing broken equipment and more.
-
-
Education
-
Washington State University
Bachelor's degree, Chemical Engineering