2025 Summer Intern – Data Analytics and Visualization (Early Clinical Development)
Job location
South San Francisco, California, United States of America
Type
Full Time
Responsibilities:
Analyze clinical trials and real-world data to generate actionable insights and improve decision-making.
Design, develop, and implement analytical workflows and visualizations using tools such as R, Python, SAS, and Spotfire.
Contribute to data quality improvement initiatives and advanced data science efforts, including statistical analysis and advanced data analytics.
Develop and implement advanced text preprocessing techniques and apply string similarity algorithms and fuzzy matching techniques to standardize and reconcile inconsistent text data.
Manage and query large datasets using database management systems, such as SQL.
Participate in team brainstorming sessions to identify opportunities for improving data analytics and visualization capabilities.
Requirements & Skills:
Must be pursuing a Master’s degree.
Must have attained a Master’s degree.
Must be pursuing a PhD.
Must have attained a PhD.
Proficiency in programming languages such as R, Python, SAS, and SQL.
Familiarity with data visualization tools (e.g., Spotfire, ggplot2, Matplotlib, Plotly).
Knowledge of natural language processing techniques and libraries.
Database management skills for handling and querying large datasets.
Strong analytical and problem-solving capabilities
Ability to work both independently and collaboratively in a team environment.
Excellent communication, collaboration, and interpersonal skills.
Complements our culture and the standards that guide our daily behavior & decisions: Integrity, Courage, and Passion.
Understanding of early-phase clinical trial data and operational workflows.
Experience with advanced statistical methods and calculations.
Familiarity with address parsing and geospatial tools (e.g., geopy, usaddress) for location standardization.
Knowledge of Git or other version control systems.