Data science & ML
Production machine learning, statistical modeling, NLP, computer vision, and forecasting. Built and shipped end-to-end at enterprise scale.
- Python
- R
- PyTorch
- TensorFlow
- Polars
- Snowpark
A Portrait of —
The practitioner at work,
in study and in service.
Currently
Development Data & Analytics Manager,
Amgen Inc.
Practice
Founder, Epsilon DS
epsilon-ds.com ↗
Located
Los Angeles, CA & New York, NY
A data-science practitioner and software engineer building production systems for clinical trials, financial analytics, and considered consumer products.
Discipline
DS / ML / SWE
Experience
10+ years
Engagements
24+ shipped
Training
UC Berkeley MIDS

Photographed in studio, Los Angeles. Subject prefers a single-origin pour-over before reviewing pull-requests.
An informal letter on the work, in plain words rather than bullet-points and metrics.
My work sits at the seam between rigor and pragmatism. For a decade I have built systems that have to survive contact with the real world — predictive models for global financial institutions, clinical-trial analytics for one of the world's largest biotech companies, and bespoke software for small businesses that quietly run on it every day.
I am, by training, a mathematician. By practice I am an engineer who prefers the unglamorous parts of the trade — the lineage of a dataset, the way an interface concedes to its user, the version-controlled history of a decision. I trust craftsmen over frameworks, and I think data-science work is better when it is conducted like a craft.
That conviction is exercised in two places, kept deliberately separate. By day, I am on staff at Amgen — embedded in the Operational Design Analytics team, where I turn historical and operational data into the foundation for planning the next generation of clinical trials. The work is consequential, slow, and properly regulated — the opposite of headline AI work, and the more meaningful for it.
Alongside, and intentionally separate, I keep a small consulting studio under the name Epsilon DS. It exists to give ambitious teams the kind of senior, hands-on data and software work that is otherwise difficult to find — bespoke machine-learning pipelines, point-of-sale platforms, internal tools, dashboards — each engagement conducted personally, not through layers of account managers.
Whether I am tuning a neural network, drafting a migration plan, or pulling an espresso shot, I bring the same posture: a quiet preference for precision over performance, and a tolerance for the long route when the long route is the honest one.
I am part of the team that turns Amgen's historical and operational data into the foundation for planning its next generation of clinical trials. Quieter than headline AI work, but consequential in its own way.
Surface feasibility parameters and historical trial performance metrics — enrollment, screen-failure, and drop-out rates — to inform study design.
Operate predictive analytics tooling (Trial Trove, Site Trove, DQS) to advise placement and planning decisions across therapeutic areas.
Collate global site performance data and recommend geographic footprints calibrated to each study's specific requirements.
Curate historical reference data into high-confidence modeling sets; serve as a primary steward of dataset lineage and quality.
An incomplete index. Items under standing NDAs are described only by shape; full case-studies are available on request.
Production machine learning, statistical modeling, NLP, computer vision, and forecasting. Built and shipped end-to-end at enterprise scale.
Pipelines, warehouses, observability, and the unfashionable plumbing that turns models into systems people actually depend on.
Next.js applications, dashboards, internal tools, point-of-sale systems, and cross-platform mobile work.
Engagements as a fractional lead, technical reviewer, or hands-on contributor where the project deserves senior judgment.
When a team needs the work done by someone senior, Epsilon DS is what they hire. A small practice on purpose; quiet on purpose; uncompromising on purpose.
The consulting studio I operate alongside the day work. Engagements span machine learning, data engineering, and bespoke software — from clinical pipelines to point-of-sale platforms.
The studio is small on purpose. It exists to give ambitious teams the kind of senior, hands-on data and software work that is otherwise difficult to find. Every engagement is conducted by the principal, not by a layer of account managers and offshore subcontractors. When the work calls for extra hands, those hands are vetted personally.
Established
2015
Principal
Sam S. Won
Disciplines
Data Science · Software · Advisory
Engagement size
Discovery → Delivery
Studio
Los Angeles & New York

2021 — 2023
Master of Information & Data Science — Berkeley, CA
Rigorous, interdisciplinary program spanning machine learning, statistical inference, and information systems — taught largely through real datasets and industry partnerships.
Capstone Project
A World for Every Child ↗
Machine learning to support early childhood-cancer diagnosis in under-resourced regions.

2011 — 2015
Bachelor of Science, Mathematics — Montclair, NJ
Foundations in analysis, algebra, statistics, and computational methods — the mathematical undercarriage on which the rest of the practice rests.