Home Projects Links

Jesus M Rueda-Becerril, PhD

Logo

Welcome to my personal webpage

Projects

Wind and Spores | Aug 2024 – May 2025

Used: C++, Git, R: RStudio, Rcpp, ggplot2, dplyr, tidyr

Tleco | Jan 2024 – Sep 2024

Used: Rust, HDF5, Git, Python: Numpy, Matplotlib, Scipy

Publications using Tleco

Paramo | Oct 2018 – Apr 2022

Open Source Code for Radiative Transfer Simulations in Relativistic Astrophysics

Key Achievement: Achieved 24× speedup (2 min → 5 sec) through algorithmic optimization and OpenMP parallelization

Used: Fortran, HDF5, OpenMP, MPI, Git, Python: NumPy, Pandas, Matplotlib, SciPy

Publications using Paramo


Portfolio

Machine Learning From Scratch

Fundamental ML algorithms implemented from scratch to demonstrate deep understanding of underlying mathematics:

Algorithm Description
Neural Network Feedforward NN solving XOR problem with backpropagation
Decision Tree Classification tree with Gini impurity
Random Forest Ensemble with bootstrap aggregating
Gradient Boosting Sequential ensemble with residual fitting
Movie Recommender Collaborative filtering (user-based, item-based, matrix factorization)

Used: Python: NumPy, Matplotlib, Scikit-learn

DataCamp

Applied data science projects completed through DataCamp courses:

Project Topic Description
Avocado Toast Analysis Exploratory Data Analysis Price and trend analysis of avocado-related datasets with visualization
Nobel Prizes Historical Data Analysis Examining Nobel Prize award data, winners, and trends over time
Oldest Businesses Business & History World’s oldest businesses: longevity, characteristics, and geographic distribution
Sleep Data Analysis Health & Predictive Modeling Sleep disorder prediction and health metrics analysis

Used: Python: Pandas, NumPy, Matplotlib, Seaborn

Kaggle

Project Competition / Dataset Description
Titanic — A Study of a Shipwreck Titanic: Machine Learning from Disaster Binary classification to predict passenger survival; female survival rate 74.2% vs. male 18.9%
Telco Customer Churn Telco Customer Churn Predict which telecom customers are likely to churn; EDA + model comparison across 7 classifiers on 7K records

Used: Python: Pandas, Scikit-learn (Logistic Regression, KNN, Decision Tree, Random Forest, Gradient Boosting, SVM), XGBoost

Templates

Reusable Jupyter notebook templates for standard data science workflows, available in the Templates directory:

Template Purpose
Data Cleaning Data quality checks and cleaning workflows
Data Querying Data retrieval and querying operations
EDA Exploratory data analysis and visualization
ML Preprocessing Feature engineering and preprocessing for ML models

Tonalpowalli

I have a GitHub repository for my tonalpowalli (the nawah count of the days) project (in progress).

Gists

I have a collection of GitHubGists that you can explore.