Posts in: Blog

FEB 28, 2024

Finetuning Mistral-7B with LoRA and DeepSpeed

By Kevin Musgrave, Agnieszka Ciborowska

How to Finetune Mistral-7B using HuggingFace + Determined

LEARN MORE

FEB 27, 2024

AI News #12

By Kevin Musgrave

Highlights include Sora, Stable Diffusion 3, and Mistral Large.

LEARN MORE

FEB 22, 2024

Determined v0.28.1

By Bill Boggs

Highlights of the latest release.

LEARN MORE

FEB 14, 2024

Determined v0.28.0

By Wesley Turner

Highlights of the latest release.

LEARN MORE

FEB 12, 2024

Self-Discover, Grandmaster-Level Chess without Search, and DeepSeekMath7B

By Isha Ghodgaonkar

Short summaries of self composing reasoning structures, LLMs for math and chess, plus other highlights from the week.

LEARN MORE

FEB 05, 2024

MESA and Co-Designing Model Architectures with Hardware

By Kevin Musgrave

Short summaries of MESA and Co-Designing Model Architectures with Hardware, plus other highlights from the week.

LEARN MORE

JAN 31, 2024

Announcing Determined 0.27.1

By Isha Ghodgaonkar

We are excited to announce the 0.27.1 release of the Determined deep learning training platform!

LEARN MORE

JAN 31, 2024

Finetuning an LLM using HuggingFace + Determined

By Kevin Musgrave, Agnieszka Ciborowska

How to Finetune a TinyLlama-1.1B Model on Text-to-SQL

LEARN MORE

JAN 29, 2024

MambaByte, Multimodal Pathway, and CrossMAE

By Kevin Musgrave

Short summaries of MambaByte, Multimodal Pathway, and CrossMAE, plus other highlights from the week.

LEARN MORE

JAN 19, 2024

VMamba, Sleeper Agents, and AlphaGeometry

By Isha Ghodgaonkar

Visual Mamba for image processing, deceptive LLMs, and a geometry model from DeepMind caught our eye last week.

LEARN MORE