Marco Vieto Vega
Data Scientist
Transforming data into insights and building solutions that make a difference
Featured Projects
Explore my latest data science and analytics projects
![Project image for Senticheck: Bluesky Data Collection and Analysis [WIP]](/portfolio/assets/img/project6.jpg)
Senticheck: Bluesky Data Collection and Analysis [WIP]
Bluesky data collection and sentiment analysis project using Python, Airflow, and NLP. Currently in progress.

Forest Cover Type Classification
Predicting forest cover types using cartographic variables and machine learning with Apache Spark (PySpark).
Latest Blog Posts
Insights and thoughts on data science and technology

Freezing in Airflow, Flowing with FastAPI
A personal reflection on debugging, redesign, and progress. This post shares how a stubborn Airflow issue led to a cleaner architecture using FastAPI, a working sentiment dashboard, and a refreshed portfolio site.

Bronze, Silver, Gold: Organising SentiCheck’s Data
A look into Medallion Architecture and how SentiCheck applies the Bronze, Silver, and soon Gold layers to organise data, automate Apache Airflow workflows, and prepare it for NLP analysis.

Building Senticheck: Collecting Data, Cleaning Text, and Seeing Progress
In this post, I share how I pulled real posts from Bluesky, cleaned the text, stored everything in a database, and made progress on my portfolio site.