projects.

Text classification

This project explores text classification using traditional machine learning and large language models using the Google Gemini API. It classifies tweets into categories such as natural disasters or non-disaster-related content.

GitHub: text_classification


afcharts-py

The afcharts Python package helps make accessible Matplotlib and Plotly charts following Government Analysis Function data visualisation guidance. I’m a contributor to this package.

GitHub: afcharts-py


Police data

This project showcases a data pipeline that queries the UK Police API, transforms the returned JSON data, and loads it as converted CSV files into a database that can be queried.

GitHub: police_data


Local government spending

This project uses a webscraper to extract files from the internet, load the raw data into a database, transform using dbt, and provide business-logic-applied marts to end users; a medallion architecture in miniature.

GitHub: local_gov_spending