Sofía Moretti

Actuary - Data Science

About Me

I’m an Actuary and a Data Science Master’s student with a real passion for turning data into insights that actually make a difference. I love building smart, data-driven solutions that simplify processes and help people make better decisions.

Curious by nature and driven by problem-solving, I enjoy diving into complex datasets, creating predictive models, and being part of teams that value innovation, collaboration, and never stop learning.

📄 See Resume

Projects

🤖 FCE - Chatbot

A smart and friendly chatbot to check subject prerequisites at "Facultad de Ciencias Económicas - Universidad de Buenos Aires" 🎓🇦🇷
Select your degree and type any subject name — the bot instantly shows what you need to take it
No more manual checking or missed requirements ✅🧠
This project was developed for a local university in Argentina and is currently available in Spanish.

Logic & data handling: Python (pandas)
Frontend: Streamlit + custom CSS

📊 EMPLOYEE CHURN - Predictive Modeling

A predictive modeling project focused on understanding and anticipating employee attrition based on internal HR data.

🧠 Built and compared multiple classification models (stepwise, regularized, decision tree)
📈 Evaluated performance using metrics such as Accuracy, ROC, and F1-score
🧩 Developed a dashboard to explore churn patterns and model outcomes

Modeling & logic: R (glmnet, rpart, caret)
Visualization & UI: ggplot2, Shiny
Deployment: shinyapps.io

📊 Insurance anomaly detection

This project detects atypical insurance claims using Isolation Forest, an unsupervised algorithm that scores anomalies based on behavioral patterns.

It features interactive filters, applies UMAP for dimensionality reduction and HDBSCAN for clustering. SHAP values explain the key drivers behind each flagged claim.

Logic & modeling: Python (Pandas, Scikit-learn, Seaborn), Isolation Forest, HDBSCAN, UMAP
Visualization & explainability: Matplotlib, Streamlit, SHAP
Deployment: Streamlit cloud

Languages

SQL
Python
R

Skills

Tools
  • Git & GitHub
  • VSCode
  • Power BI
AI/ML Libraries
  • (Py) Scikit-learn
  • (Py) Pandas
  • (Py) NumPy
  • (Py) Matplotlib
  • (Py) SHAP
  • (R) caret
  • (R) randomForest
  • (R) rpart
  • (R) glmnet
Core Concepts
  • Data Analysis
  • Machine Learning
  • Supervised & Unsupervised Learning
  • Web Scraping
  • Data Visualization

Contact Me