| Home | Projects | Links |
|---|
Welcome to my personal webpage
Used: C++, Git, R: RStudio, Rcpp, ggplot2, dplyr, tidyr
Tleco stands for both in the fire and rise in the nahuatl language.Paramo.Used: Rust, HDF5, Git, Python: Numpy, Matplotlib, Scipy
TlecoOpen Source Code for Radiative Transfer Simulations in Relativistic Astrophysics
Key Achievement: Achieved 24× speedup (2 min → 5 sec) through algorithmic optimization and OpenMP parallelization
Used: Fortran, HDF5, OpenMP, MPI, Git, Python: NumPy, Pandas, Matplotlib, SciPy
ParamoFundamental ML algorithms implemented from scratch to demonstrate deep understanding of underlying mathematics:
| Algorithm | Description |
|---|---|
| Neural Network | Feedforward NN solving XOR problem with backpropagation |
| Decision Tree | Classification tree with Gini impurity |
| Random Forest | Ensemble with bootstrap aggregating |
| Gradient Boosting | Sequential ensemble with residual fitting |
| Movie Recommender | Collaborative filtering (user-based, item-based, matrix factorization) |
Used: Python: NumPy, Matplotlib, Scikit-learn
Applied data science projects completed through DataCamp courses:
| Project | Topic | Description |
|---|---|---|
| Avocado Toast Analysis | Exploratory Data Analysis | Price and trend analysis of avocado-related datasets with visualization |
| Nobel Prizes | Historical Data Analysis | Examining Nobel Prize award data, winners, and trends over time |
| Oldest Businesses | Business & History | World’s oldest businesses: longevity, characteristics, and geographic distribution |
| Sleep Data Analysis | Health & Predictive Modeling | Sleep disorder prediction and health metrics analysis |
Used: Python: Pandas, NumPy, Matplotlib, Seaborn
| Project | Competition / Dataset | Description |
|---|---|---|
| Titanic — A Study of a Shipwreck | Titanic: Machine Learning from Disaster | Binary classification to predict passenger survival; female survival rate 74.2% vs. male 18.9% |
| Telco Customer Churn | Telco Customer Churn | Predict which telecom customers are likely to churn; EDA + model comparison across 7 classifiers on 7K records |
Used: Python: Pandas, Scikit-learn (Logistic Regression, KNN, Decision Tree, Random Forest, Gradient Boosting, SVM), XGBoost
Reusable Jupyter notebook templates for standard data science workflows, available in the Templates directory:
| Template | Purpose |
|---|---|
| Data Cleaning | Data quality checks and cleaning workflows |
| Data Querying | Data retrieval and querying operations |
| EDA | Exploratory data analysis and visualization |
| ML Preprocessing | Feature engineering and preprocessing for ML models |
I have a GitHub repository for my tonalpowalli (the nawah count of the days) project (in progress).
I have a collection of GitHubGists that you can explore.