Diego Osborn headshot

Hey, I'm Diego Osborn

I am a fourth year undergraduate student studying Data Science at UC San Diego with minors in Economics and Mathematics. I am currently a Data Science Intern for UC San Diego's baseball team (Winter 2023 - Present). Previously, I have spent internships with UC San Diego's Career Center as a Data Engineering Intern (Data Analyst) (Fall 2024 - Spring 2025), and as a Summer Data Analytics Intern for Palm Springs Power Baseball (Summer 2023). My strengths lie in machine learning, statistical modeling, and probabilistic reasoning. I'm interested in any opportunities in data science/data analytics, so please feel free to reach out to me via my contact page!

Education

University of California, San Diego

B.S. Data Science; Minors in Economics and Mathematics

Organizations: Triton Ball Sports Analytics Club, Data Science Student Society

Honors: Eleanor Roosevelt College Honors Program, Provost Honors (4x)

Graduate Coursework (cross enrollment): Bayesian Inference, Hierarchical & Probabilistic Modeling

Relevant Coursework: Principles & Techniques of Data Science, Monte Carlo Methods, Relational Databases, Cloud Computing, Scalable ML

Experience

UC San Diego Baseball - Data Science Intern

Automated the end-to-end generation of opponent scouting reports using Python, reducing weekly preparation time by 90% to streamline pre-game strategy.

Designed a semi-supervised pipeline using Expectation-Maximization and Gaussian Mixture models to resolve pitch label noise, training gradient-boosted trees for robust pitch classification..

Developed a Monte Carlo simulation running 10,000+ iterations on 3 years of historical data to forecast expected win ranges across varying schedule scenarios to support strategic planning.

Built a Python data ingestion pipeline to migrate 1.1M+ rows of raw Trackman CSVs into PostgreSQL, creating a centralized repository to eliminate manual file merging.

Designed automated post-game visualization reports to track velocity, movement profiles, and location heatmaps, streamlining post-game analysis for coaches and players.

Contributed to the 2023 Big West Conference Championship title through support for analytics operations and ad hoc scouting analysis.

UC San Diego Career Center - Data Analytics Intern

Developed a reproducible Python ETL workflow integrating 2,000+ Qualtrics responses with LinkedIn data for 12,000+ graduates, establishing a single source of truth for outcome reporting.

Authored technical SOPs for version control and environment management, defining Git protocols to reduce code conflicts and standardize development for a 4-person analytics team.

Migrated 100,000+ student career Handshake profiles into a local PostgreSQL database, using SQL to ensure compliance with university data retention policies and to deliver actionable datasets.

Palm Springs Power Baseball - Data Analytics Intern

Oversaw data integrity and validation across 100+ league games, maintaining the central repository to ensure accurate reporting for the official league website.

Deployed an interactive Looker Studio dashboard to visualize 10+ KPIs from 7,400+ tracked pitch events, transforming raw data into actionable insights for 11 coaches and 260+ players.

Projects

Awards

2025 SMT Data Challenge Honorable Mention

Issued by SportsMEDIA Technology (SMT)

Recognized among top participants (out of 50 teams and 114 students) for my innovative, baseball, player-tracking, quantitative analysis project, Quantifying Defensive Aggression With a Bayesian Hierarchical Model, in an international sports analytics competition.

Nanar and Anthony Yoseloff Foundation Scholarship

Issued by Society for American Baseball Research (SABR)

I am grateful to say that I was selected to receive one of four nationwide Nanar and Anthony Yoseloff Foundation Scholarships to attend the 2025 SABR Analytics Conference.

Download Resume Here