Skip to content
View leonardoalva98's full-sized avatar
🏋️
probably lifting
🏋️
probably lifting

Block or report leonardoalva98

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
leonardoalva98/README.md

👋

I'm Leo — a Data Science graduate with a background in Industrial Engineering from Peru 🇵🇪. I build data pipelines, dashboards, and predictive models to turn messy data into clear insights. 💻 Open to data analyst / BI opportunities in the Salt Lake City / South Jordan, UT area.

What I work with

  • Languages: Python, SQL, R
  • BI & Visualization: Power BI, Tableau, Excel
  • Cloud & Data: AWS (S3, Lambda, Glue), ETL pipelines, Snowflake, dbt (learning)
  • ML: Scikit-Learn, XGBoost, SHAP

Projects

  • 🏥 Sepsis Early Warning System — ETL pipeline on 790K+ records, XGBoost model predicting sepsis onset 23 hours early (0.885 AUC, 78% early detection rate)
  • 🧪 Healthcare Adverse Events Pipeline — End-to-end ETL pipeline extracting 51K FDA drug adverse event reports from the OpenFDA API, loading into Snowflake via idempotent MERGE statements; SQL analysis surfaced fentanyl and alcohol as the highest death-rate drugs
  • 🤖 FDA Drug Safety Chatbot — Production text-to-SQL chatbot using LLaMA 3.3-70B via Groq API; translates natural language questions into SQL, queries Snowflake live, and returns plain-English answers deployed via FastAPI on Render
  • 📈 S&P 500 Price Direction Predictor — Reproducible analytics pipeline integrating 2 APIs, feature engineering across 5 time horizons, Random Forest classifier achieving 58% precision and 59% recall on next-day direction
  • 🚗 BMW Value Retention Analysis — Comparative regression analysis on used car depreciation across 8 model lines and 3 engine sizes, quantifying price decline per 1,000 miles driven

check them out! Portfolio

About me

  • 🌐 Bilingual: English & Spanish (native)
  • 🎓 4.0 GPA | BYU-Idaho Academic Scholarship (Full Tuition)
  • 🚗 Car enthusiast who also happens to analyze car data
  • 📬 Reach me: [email protected] | LinkedIn | Portfolio

Pinned Loading

  1. Personal-Portfolio Personal-Portfolio Public archive

    Forked from codewithsadee/vcard-personal-portfolio

    CSS