Posts by Collection

portfolio

publications

AstronomicAL: an interactive dashboard for visualisation, integration and classification of data with Active Learning

Published in Journal for Open Source Software, 2021

AstronomicAL is a human-in-the-loop interactive labelling and training dashboard that allows users to create reliable datasets and robust classifiers using active learning. This technique prioritises data that offer high information gain, leading to improved performance using substantially less data. The system allows users to visualise and integrate data from different sources and deal with incorrect or missing labels and imbalanced class sizes. AstronomicAL enables experts to visualise domain-specific plots and key information relating both to broader context and details of a point of interest drawn from a variety of data sources, ensuring reliable labels. In addition, AstronomicAL provides functionality to explore all aspects of the training process, including custom models and query strategies. This makes the software a tool for experimenting with both domain-specific classifications and more general-purpose machine learning strategies. We illustrate using the system with an astronomical dataset due to the field’s immediate need; however, AstronomicAL has been designed for datasets from any discipline. Finally, by exporting a simple configuration file, entire layouts, models, and assigned labels can be shared with the community. This allows for complete transparency and ensures that the process of reproducing results is effortless. Read more

Download here

talks

Presentation on MLOps and Kubeflow

Published:

Lecture on MLOps and Kubeflow presented to 1st Year Interactive AI CDT students on Interactive AI Team Project unit. This talk aims to give a very accessible introduction to the need for MLOps by linking to the similarities of the development of DevOps systems. By looking into the effect of increasing scale (team size, customer base or project size), students will see potential problems with how their current development practices may not be sustainable when applied to ML projects at an industry-scale. Finally, there is a brief overview of Kubeflow and its main components, showing why they are useful and how they can potentially solve the previous issues. Read more

Computer Science, AI & Me

Published:

Webinar presented to over 100 sixth form students. The presentation began with telling the students about my academic life and how I went from a widening participation background to studying for a PhD at Bristol. This led to an introduction to what Computer Science is (and is not) like at university. The final part gave the students the (very brief) foundations of what Machine Learning and AI really are. Unfortunately, the adoption of these tools has led to a large amount of over-exaggeration and overuse of certain buzzwords throughout the industry, making it seem like companies are doing super complicated and ground-breaking things when most of the time they’re doing nothing more than the Maths the students use in their A-Level studies. Read more

teaching