Shuyan Zhang

Senior Statistical Programmer at SynteractHCR
  • Claim this Profile
Contact Information
us****@****om
(386) 825-5501
Location
San Diego, California, United States, US
Languages
  • Engliseh -
  • Chinese -

Topline Score

Topline score feature will be out soon.

Bio

Generated by
Topline AI

You need to have a working account to view this content.
You need to have a working account to view this content.

Credentials

  • SAS 9 Base Statistical Programming Certification
    SAS
    Apr, 2011
    - Nov, 2024

Experience

    • United States
    • Biotechnology Research
    • 100 - 200 Employee
    • Senior Statistical Programmer
      • Nov 2013 - Present

      (1) Project lead of Phase III clinical trial studies. (2) Create CDISC SDTM and ADAM datasets and specifications. (3) Create tables/listings/figures for clinical trial studies (4) Perform programming validation, tables/listings/figures QC, and/or edit checks (5) Works with Sponsor/statistician to write/review SAP, including mock tables and listings. (1) Project lead of Phase III clinical trial studies. (2) Create CDISC SDTM and ADAM datasets and specifications. (3) Create tables/listings/figures for clinical trial studies (4) Perform programming validation, tables/listings/figures QC, and/or edit checks (5) Works with Sponsor/statistician to write/review SAP, including mock tables and listings.

    • United States
    • Business Consulting and Services
    • 700 & Above Employee
    • Senior Associate
      • Nov 2011 - Nov 2013

      (1) Awarded for excellence in statistical consulting and technical performance. (2) Manage the national HIV patients medical and survey data for the Center for Disease Control and Prevention using SAS and EXCEL (3) Analyze the categorical data using logistic regression and other generalized linear model techniques (4) Analyze the complex sample survey data with design factors such as weights, clustering and stratification (5) Use the Multiple Imputation technique to handle the missing entries in the medical monitoring datasets Show less

    • Data Analyst
      • Jun 2009 - Jun 2010

      (1) Processed and managed large-scale environmental datasets from various resources using SAS, Excel and R. (2) Developed multiple linear regression models to predict Enterococci densities at South Shore Beach (Wisconsin). The results are integral to a cost-benefit analysis wherein the gains in model performance are weighed against the additional costs of gathering onsite data versus using only publicly available near-site meteorological data. (3) Used diagnostic statistics (PRESS, AIC and other criterion based procedures) to make comparisons of on-site versus near-site models in regards to model fit and predictive ability. The results were integrated into the EPA VirtualBeach software (Version 2.0) and are being used by beach managers now. (4) Analyzed the specificity and sensitivity of the models using threshold analysis techniques and found the best algorithm to optimize the threshold selection. Show less

    • Higher Education
    • 700 & Above Employee
    • Statistical Consultant
      • Jan 2009 - May 2009

      (1) Used SAS and R to analyze the listeners' reaction time data and develop multivariate predictive model. The resulted model accounted for over 93% of the variability in the data set(2) Analyzed listeners' reaction data in various listening contexts with ANOVA technique, and found significant differences in the listeners' adaptability behaviors in various circumstances.(3) Performed various hypothesis testing and multiple comparisons to test the listeners' adaptability performances, providing the clients with statistics-based conclusions. Show less

    • Research Assistant
      • Jan 2008 - May 2009

      (1) Applied support vector machine statistical techniques in a case-based system for spam email filtering with concept drifting.(2) Generated machine learning models to pick out the significant features which can effectively recognize spam emails. The results are well justified with cross validation process.(3) Applied statistical classification techniques for spam email filtering, yielding a correct identification rate of more than 85%.(4) Developed both R and Matlab packages for spam email filtering analysis with data mining algorithm. Show less

Education

  • The University of Georgia
    M. S, Statistics
    2008 - 2010
  • the College of William and
    Ph. D, Physics
    2000 - 2007
  • Fudan University
    B. S, Physics
    1992 - 1997

Community

You need to have a working account to view this content. Click here to join now