Ziraddin Gulumjanli

Welcome!

This website brings together projects in data science, machine learning, and modern AI systems. It is built around one main idea: analytical thinking becomes powerful when it turns into something useful. The site is still new and growing, but the direction is clear. I was not pulled into machine learning because it sounded trendy. I was pulled into it because it matches the way I already think. Here, you will find work that starts with messy data and moves toward clear insight, reliable models, and practical solutions. Some projects focus on exploration and statistical reasoning. Some focus on prediction, classification, and model performance. Others move closer to real-world systems where the goal is not just to make something work once, but to make it understandable, testable, and useful. I also plan to grow beyond written projects, with future YouTube content where I can explain concepts in a more visual and practical way.

Disclaimer: website and me is still growing. We do not pretending everything is perfect; it is about building, learning, improving, and showing the process honestly.

Thanks!

Tech Stack

Core tools and technologies I use regularly

Python

NumPy

Pandas

Scikit-learn

PyTorch

TensorFlow

Keras

FastAPI

Docker

Kubernetes

Airflow

GitHub Actions

Prometheus

Grafana

Apache Spark

AWS

GCP

Azure

Linux

Git

Latest Projects

See all →

The Ocean’s Memory: Regression prediction on Ocean Temperature Using over 60 Years of CalCOFI Environmental Data

Built a leakage-safe regression pipeline on nearly one million CalCOFI oceanographic records, using selected environmental and spatial features to predict Pacific Ocean temperature across over 60 years of observations.

predictive regression analysis mode selection

Open Code Demo

Animal Classification with Regularized Models and Compact Ensembles

Created a mini-classifier to separate zoo animals into biological groups using features such as hair, feathers, milk, eggs, fins, legs, and aquatic behavior. Regularized models and compact ensemble methods were compared to build a simple, controlled classification pipeline.

Animal Classification Logistic Regression

Open Code

Forest Cover Type Classification with XGBoost

End-to-end multiclass (seven) classification pipeline using the UCI Forest CoverType dataset with 580K+ instances and 54 features to classify forest cover classes from cartographic and environmental features.

xgboost pipeline preprocessing

Open Code Demo

Parametric and Non-Parametric statistical tests

Built a comprehensive statistical testing workflow covering both parametric and non-parametric methods. The project demonstrates how to choose and apply appropriate tests such as t-test, ANOVA, Mann-Whitney U, Kruskal-Wallis, Wilcoxon, Chi-square, Pearson, and Spearman using Python and SciPy.

parametric non-parametric

Open Code

Ensemble Learning for Penguin Species Multiclass Classification Using Morphological and Ecological Features

Built a multiclass ML pipeline to classify penguin species using body measurements and ecological features. Compared baseline and ensemble models with cross-validation and tuning to select the best model.

model selection ensemble learning algorithms

Open Code Demo

End-to-End Census Income Classification with EDA and Hyperparameter Tuning

Built an end-to-end machine learning pipeline on the Adult Census Income dataset to predict whether an individual earns above or below $50K per year. The project covers data cleaning, EDA, preprocessing, model comparison, hyperparameter tuning, and final evaluation with Gradient Boosting as the best

pandas NumPy scikit-learn

Open Code