// 003

CNN HEADLINE
SCRAPER

Date
December 2025
Type
Machine Learning · Data
Stack
Python · TF-IDF · MATLAB

Developed a Python script to scrape CNN headlines and identify emerging news trends using natural language processing and machine learning.

A TF-IDF model paired with logistic regression was used for classification, achieving 79% accuracy. Statistical analysis was performed in MATLAB to validate results.

Add screenshots or demo video here

Add additional images here

Add a deeper technical breakdown — architecture, algorithms, key decisions

What were the hardest parts? What did you learn?

PythonBeautifulSoupTF-IDFLogistic RegressionMATLABNLP