In Plain English

Scaling Llama 2 (7 - 70B) Fine-tuning on Multi-Node GPUs with Ray on Databricks

Scaling up fine-tuning and batch inferencing of LLMs such as Llama 2 (including 7B, 13B, and 70B variants) across multiple nodes without having to worry about the complexity of distributed systems.

machine-learningllama2databricks

Super Quick: Retrieval Augmented Generation Using Ollama

Unlocking the Power of Ollama Infrastructure for Local Execution of Open Source Models and Interacting with PDFs

ollamallama2llm

Structured Data from LLMs — Langchain and Pydantic Output Parser

openaillmchatgpt

The People You Need at Your Company for LLM Capabilities

artificial-intelligencellm

Your Local LLM using FastAPI

FastAPI is a modern, fast, and easy-to-use web framework for building APIs with Python. It is based on the standard Python pointer type and supports features such as data validation, documentation…

apipythonllm