Here are three essential resources for an AI Data Scientist in the making:
- The class repo
- My complete Curriculum for an AI Engineer
- All my Live Events
- My program to join the Proficient AI Engineer Directory
And if you wish: please connect with me on LinkedIn, follow me on X, and subscribe to me on YouTube! All the multi-modal forms of me 😂..
Keep in touch
I’ll only ever contact you occasionally, and
I’ll always aim to add value with every email.
The field of Data Science is an extraordinary exciting place to be. I couldn’t be happier to have been invited back by the O’Reilly and Pearson teams to run a live event all about it!
I’ve designed this course for people in tech who are considering a career as an AI Data Scientist.
I’ll be taking everyone on an interactive, immersive whirlwind tour. We’ll journey from the foundations of ML with scikit-learn, to neural networks with PyTorch, to Transformers / LLMs with frontier APIs and open-source models with Hugging Face.
By the end, we’ll have created an autonomous Agentic AI solution, using RAG with GPT-4o, DeepSeek and a Chroma vector datastore. We’ll use Structured Outputs, a traditional Random Forest model, and a shockingly powerful QLoRA fine-tuned Llama 3.1 LLM that outperforms frontier models. We will deploy to the serverless AI platform Modal and we’ll make a sharp UI with the new version 5 of Gradio released last week.
If that sounds like a whole ton of AI buzzwords, it’s because it is! And, we will get them all done in under 5 hours! It will be an action-packed afternoon – and it will be a lot of fun, to boot.
Segment 1: Teaser
FOUNDATION: Explaining LLMs
For those new to Data Science, I made these high level videos to lay out what ‘parameters’ are with GPT, and to look behind the curtain of the extraordinary phenomenon that is “next token prediction”.
BACKGROUND PAPERS
- Turing’s seminal 1950s paper that introduced the Turing Test
- the 2017 paper “Attention is all you need” that invented the Transformer
Frontier models – Web interface
- ChatGPT (latest model GPT-4o and o1) from OpenAI
- Claude (latest model Claude 3.5 Sonnet) from Anthropic
- Gemini Advance (latest model Gemini 2.0 Flash) from Google
- DeepSeek (latest models DeepSeek R1 and V3) from DeepSeek AI
- Le Chat from French AI powerhouse Mistral
- Chat with Command R+ from Cohere
- Meta.ai (model is Llama 3) from Meta
- Perplexity (latest model is Perplexity Pro) from Perplexity.ai
Here’s a video on the latest GPT-4.5:
Frontier models – API
- GPT API from OpenAI
- Claude API from Anthropic
- Gemini API from Google
- DeepSeek API from DeepSeek AI
I mention the Vellum leaderboard, which has a very useful table of API costs about half way up.
Segment 2: Traditional ML
Tools of the trade
Segments 3 and 4: Deep Learning, Transformers, LLMs
Tools of the trade
Segment 5: Career Paths and Wrap
I’d have this advice right off the bat for exploring your career as an AI Data Scientist: connect with me on LinkedIn here! I love building up an AI Data Scientist community and I welcome connections. Message me if you have any specific questions or if you’d like to sound me out.
If you want to take your learning to the next level, you could consider my 8 week immersive course on mastering LLM engineering. By the end, you’ll be able to build your own Agentic LLM solutions that outperform frontier models.
Just one more thing.. or two
You’ll find my code and write-up for simulating myself with my 240,000 text message history here. I’d love to hear if you try this for yourself!
And this is fun: here’s a walk-through of building an Arena for LLMs to battle over games of Connect Four! I take you on the journey of building the arena in 10 mins, then we put it to work.
Finally, here’s an entertaining LLM game I wrote a few months ago that has LLMs battle against each other, with a write-up and the code.
That’s it! Hopefully the Live Event helped you figure out if you’d like to pursue a career as an AI Data Scientist. I hope you enjoy the journey, and please do message me if I can help.


Leave a Reply