Author: Gabriele Sgroi, PhD
-
Using Qwen2.5-7B-Instruct powered code agents to create a local, open source, multi-agentic RAG system
80 min read -
Empowering Phi-3.5-vision with Wikipedia knowledge for augmented Visual Question Answering.
20 min read -
Get started with multimodal conversational models using the open-source LLaVA model.
21 min read -
Powering up LLaMa 2 with retrieval augmented generation to seek and use information from Wikipedia.
14 min read -
A technique to increase control over the images generated by pre-trained text-to-image diffusion models.
9 min read -
-
Using genetic algorithms to solve the Lunar Lander Continuous environment with a sparse reward.
12 min read -
A tutorial on multiclass text classification using Hugging Face transformers.
6 min read