Creative AI - GenAI for Creative Expression
Presentation on the state-of-the-art generative models and how GenAI is enabling new forms of creative expression. A collection of personal and professional projects that I have worked on.
Generative AIMidjourneyPublic SpeakingCreative TechnologyHex Engine - Procedural World Generator
Procedurally generate beautiful 3D hexagonal landscapes with realistic terrain generation. Runs in the browser with ThreeJS and React.
NextJSReactJSThreeJSDDPM Study - Denoising Diffusion Models
Self-study implementation of a Denoising Diffusion Probalistic Model (DDPM) for generating MNIST handwritten digits. This implementation was created as a self-study exercise to understand diffusion models through hands-on practice
PythonPyTorchDiffusion ModelsGenerative AIGenerative DiffusionVox Populi - AI Hackathon (🥉 Winner)
Developed 3D sandbox game enabling dynamic speech interaction with AI agents in a 24-hour timeframe. Built custom game engine in ThreeJS to facilitate unique AI autonomy and voice-driven gameplay.
NextJSReactJSThreeJSElevenLabs Conversational AIElevenLabs TTS APIGPT-4o-mini APILlamaXpat - Interactive Career Atlas | Meta Hackathon
Created AI-powered career exploration platform combining interactive maps with LLM-driven country insights within 24-hour hackathon constraint. Implemented Llama-2.1 model through Groq for fast, contextual career advice based on location selection.
NextJSReactJSMapBoxLlama-2.1Groq API"Guess Who" - Conversational AI Guessing Game (🥇 Winner)
Reimagined the classic "Guess Who" game using Speechmatics' conversational AI Flow API. Built interactive web interface with real-time speech recognition and character identification. Engineered dynamic game logic combining speech recognition with AI character reasoning.
NextJSReactJSSpeechmatics FLOW APIGPT-4o-miniAI Generated Music Video
One of the first fully AI-directed music videos created just days after public release of txt2vid technologies. Produced with 100% AI-generated visuals and music using cutting-edge generative models, with techniques developed for maintaining character consistency between scenes.
Midjourney (Generative Diffusion)Luma (img2vid)Suno (txt2Music)PhotoshopPremiereBrain Tumor Categorical Classifier
Uses Convolutional Neural Networks (CNNs) to analyze MRI scans and classify them according to tumor type. Achieves strong predictive classification (validation accuracy: 63%, test accuracy: 69%, F1-Score: 0.857) using TensorFlow, Keras, and a pre-trained EfficientNetB0 architecture with advanced image preprocessing.
PythonTensorFlowKerasOpenCV (cv2)NumPyPandasMatplotlibscikit-learnPILGenAI Augmented Product Design & Marketing Campaign
Established proof-of-concept for AI-assisted product design that improved efficiency and creativity using local Stable Diffusion and Photoshop. Led campaign gathering designer feedback on AI integration and produced marketing content demonstrating AI-augmented design capabilities.
Stable DiffusionPhotoshopauto-photoshop-stable-diffusion-plugin (Open-source)Digital PaintingHacking my Roomba to avoid cables
We hack some electronics and cutting-edge multi-modal models to teach my Roomba into avoiding cables.
OpenCV (cv2)Arduino (C++)Hardware HackingPythonReplicateLLaVaAugmented Reality, Real-Time Sudoku Solver
Trained OCR model on MNIST dataset to recognize handwritten digits in Sudoku puzzles. Features real-time grid detection with perspective transformation, efficient backtracking solver with threading, and AR interface that overlays solutions onto the original puzzle in live camera view.
PythonTensorFlowOpenCV (cv2)matplotlibNumPyPILArcanaGPT (Figma Prototype)
LLM-enabled Tarot webapp 🃏 Assets generated with Midjourney, interface designed and protoyped in Figma.
FigmaUI/UX DesignPrototypingLLM IntegrationDesign ThinkingIdeas to Domains (GPT-3.5 + GoDaddy)
A webapp that uses GPT-3.5 and GoDaddy API to generate clever domain names based on your idea description.
GPT-3.5ReactGoDaddy APINextJSAPI IntegrationWhatsApp Language Extension
Augmenting WhatsApp web with language features, specfically an auto-translate feature with scope to add other features in the future.
Chrome ExtensionJavaScriptOpenAI APITailwindCSSSatori - AI-Generated Meditations
Generate bespoke, fully-voiced meditations using GPT-4o and AWS Polly
GPT-4oAWS PollyNext.jsReactVercelText-to-SpeechVersius - Surgical Robotics
My usability and clinical data science work on the next generation of surgical robotics.
Surgical RoboticsUsability ResearchData ScienceHealthcare TechnologyGenPersona - AI-Generated Personas
Generate the perfect user personas for your project using GPT-4o
GPT-4oNext.jsReactUX ResearchUI DesignLinkedInfluencer.app
WebApp that uses BLIP-2 and GPT-3.5 to generate parody LinkedIn posts.
BLIP-2GPT-3.5Next.jsReactComputer VisionUX DesignWordle Bot
Using python, some complex algorithms, and a webdriver to hack a real-world, online Wordle game.
PythonSeleniumAlgorithm DesignWeb AutomationAI-Powered Art Workflow
Combining Midjourney, Stable Diffusion, and Photoshop GenAI models to rapidly create digital art.
MidjourneyStable DiffusionPhotoshopDigital ArtAI ArtData Exploration & Visualisation Tutorial
A walkthrough tutorial demonstrating how to scrape data from IMDB.com. (ep. 2 of 2)
PythonData VisualizationPandasEducational ContentData AnalysisWebscrapping Tutorial
A walkthrough tutorial demonstrating how to scrape series data from IMDB.com (ep. 1 of 2).
PythonWeb ScrapingBeautifulSoupRequestsEducational ContentPokemon Sprites Data Visualisation
Visualising Pokemon data using sprite art.
PythonData VisualizationMatplotlibPandasCreative CodingEye Disease Classifier (ML)
Training a DNN to diagnose my own retina scan.
Deep LearningTensorFlowMedical ImagingCNNHealthcare AIAI-Generated & Narrated Short Story
A demo that combines cutting-edge LLM, Image Diffusion, and text-to-voice models to tell an entirely original horror story.
LLMText-to-SpeechStable DiffusionStorytellingCreative AIControlNet Portraits
Using Stable Diffusion to create portraits of myself.
Stable DiffusionControlNetAI ArtPortrait Generation