Text classification
This project explores text classification using traditional machine learning and large language models using the Google Gemini API. It classifies tweets into categories such as natural disasters or non-disaster-related content.
GitHub: text_classification
afcharts-py
The afcharts Python package helps make accessible Matplotlib and Plotly charts following Government Analysis Function data visualisation guidance. I’m a contributor to this package.
GitHub: afcharts-py
Police data
This project showcases a data pipeline that queries the UK Police API, transforms the returned JSON data, and loads it as converted CSV files into a database that can be queried.
GitHub: police_data
Local government spending
This project uses a webscraper to extract files from the internet, load the raw data into a database, transform using dbt, and provide business-logic-applied marts to end users; a medallion architecture in miniature.
GitHub: local_gov_spending