Sebastian Quezada Rojas'
Data Science Portfolio

A Showcase of my Data Science and Bioinformatics Projects

Featured

An Explainable XGBoost Approach to Predicting Car Prices

This is an XGBoost project I undertook with the express intent of diving into some of the most important and useful tools in Explainable AI. I used a basic XGBoost regressor to predict used car prices on a publicly available dataset, and then I used SKLearn and SHAP to explore some of the most important tools we can leverage to gain explainable insights from the model.

Project

A Decision Tool for Reducing Blood Tests in Hospitalised Patients

This is a Random Forest prototype model developed to predict and classify a hospitalised patient’s next day’s blood test results from on its most up-to-date blood results history. This model is a multidisciplinary collaborative approach and was created as part of the 2022 IntelliHQ x ANZICS Healthcare Datathon in Australia where it got the 1st place at the regional level (Victoria) and the 2nd place at the national level.

Project

Differential Genetic Expression in Fetal Sheep Brain

This is a replication and an extension of a bioinformatics project that I undertook for my PhD in Neuroscience with Monash University (Australia). Here, I analyse the differential genetic expression of two regions of the developing cortical plate derived from bulk RNA Seq using R to better understand what drives folding of the cerebral cortex.
See doi: 10.1093/cercor/bhaa171 for the full published study.

Project

Short Python-PPTX Tutorial

A short python-pptx tutorial that focuses on giving a more streamlined approach to working with this library to connect Python to PowerPoint to automate presentations.

Project

Advanced Seaborn (SNS) Techniques

A short Seaborn (Python) tutorial showing some more advanced plotting and labelling techniques that are usually not commonly covered in online courses or tutorials, but are quite handy for reporting purposes.