I'm Leo — a Data Science graduate with a background in Industrial Engineering from Peru 🇵🇪. I build data pipelines, dashboards, and predictive models to turn messy data into clear insights. 💻 Open to data analyst / BI opportunities in the Salt Lake City / South Jordan, UT area.
- Languages: Python, SQL, R
- BI & Visualization: Power BI, Tableau, Excel
- Cloud & Data: AWS (S3, Lambda, Glue), ETL pipelines, Snowflake, dbt (learning)
- ML: Scikit-Learn, XGBoost, SHAP
- 🏥 Sepsis Early Warning System — ETL pipeline on 790K+ records, XGBoost model predicting sepsis onset 23 hours early (0.885 AUC, 78% early detection rate)
- 🧪 Healthcare Adverse Events Pipeline — End-to-end ETL pipeline extracting 51K FDA drug adverse event reports from the OpenFDA API, loading into Snowflake via idempotent MERGE statements; SQL analysis surfaced fentanyl and alcohol as the highest death-rate drugs
- 🤖 FDA Drug Safety Chatbot — Production text-to-SQL chatbot using LLaMA 3.3-70B via Groq API; translates natural language questions into SQL, queries Snowflake live, and returns plain-English answers deployed via FastAPI on Render
- 📈 S&P 500 Price Direction Predictor — Reproducible analytics pipeline integrating 2 APIs, feature engineering across 5 time horizons, Random Forest classifier achieving 58% precision and 59% recall on next-day direction
- 🚗 BMW Value Retention Analysis — Comparative regression analysis on used car depreciation across 8 model lines and 3 engine sizes, quantifying price decline per 1,000 miles driven
check them out! Portfolio
- 🌐 Bilingual: English & Spanish (native)
- 🎓 4.0 GPA | BYU-Idaho Academic Scholarship (Full Tuition)
- 🚗 Car enthusiast who also happens to analyze car data
- 📬 Reach me: [email protected] | LinkedIn | Portfolio