Eduardo Alvarez, Author at Towards Data Science

Created with Nightcafe - Image property of Author

Meta Llama 3 Optimized CPU Inference with Hugging Face and PyTorch

Artificial Intelligence

Learn how to reduce model latency when deploying Meta* Llama 3 on CPUs

Eduardo Alvarez

April 19, 2024

8 min read

Image Property of Author - Create with Nightcafe

Improving LLM Inference Latency on CPUs with Model Quantization

Artificial Intelligence

Discover how to significantly improve inference latency on CPUs using quantization techniques for mixed, int8,…

Eduardo Alvarez

February 29, 2024

10 min read

Created with Nightcafe - Property of Author

Retrieval Augmented Generation (RAG) Inference Engines with LangChain on CPUs

Data Science

Exploring scale, fidelity, and latency in AI applications with RAG

Eduardo Alvarez

December 5, 2023

15 min read

Running Falcon Inference on a CPU with Hugging Face Pipelines

Machine Learning

Learn how to run inference with 7-billion and 40-billion Falcon on a 4th Gen Xeon…

Eduardo Alvarez

June 6, 2023

6 min read

How to Build ML Applications on the AWS Cloud with Kubernetes and oneAPI

Machine Learning

Learn the basics of Kubernetes and Intel AI Analytics Toolkit for building distributed ML Apps

Eduardo Alvarez

March 17, 2023

14 min read

A Detailed Guide for Building Hardware Accelerated MLOps Pipelines in SageMaker

Machine Learning

SageMaker is a fully managed machine learning service on the AWS cloud. The motivation behind…

Eduardo Alvarez

December 13, 2022

8 min read

Guide for Creating Custom Accelerated-AI Images for SageMaker with oneAPI and Docker

Machine Learning

AWS provides out-of-box machine-learning images for SageMaker, but what happens when you want to deploy…

Eduardo Alvarez

December 13, 2022

6 min read

Guide to Building AWS Lambda Functions from ECR Images to Manage SageMaker Inference Endpoints

Machine Learning

We breakdown the process of building a lambda function for machine-learning API endpoints

Eduardo Alvarez

December 13, 2022

6 min read

A.I. Mathematician? A Simplified Look at DeepMind’s AlphaTensor

Artificial Intelligence

Google’s DeepMind research group recently developed “AlphaTensor.” Is this this the beginning of AI driven…

Eduardo Alvarez

October 6, 2022

6 min read

Author: Eduardo Alvarez