Blogs

FEB 05, 2024

MESA and Co-Designing Model Architectures with Hardware

By Kevin Musgrave

Short summaries of MESA and Co-Designing Model Architectures with Hardware, plus other highlights from the week.

LEARN MORE

JAN 31, 2024

Announcing Determined 0.27.1

By Isha Ghodgaonkar

We are excited to announce the 0.27.1 release of the Determined deep learning training platform!

LEARN MORE

JAN 31, 2024

Finetuning an LLM using HuggingFace + Determined

By Kevin Musgrave, Agnieszka Ciborowska

How to Finetune a TinyLlama-1.1B Model on Text-to-SQL

LEARN MORE

JAN 29, 2024

MambaByte, Multimodal Pathway, and CrossMAE

By Kevin Musgrave

Short summaries of MambaByte, Multimodal Pathway, and CrossMAE, plus other highlights from the week.

LEARN MORE

JAN 19, 2024

VMamba, Sleeper Agents, and AlphaGeometry

By Isha Ghodgaonkar

Visual Mamba for image processing, deceptive LLMs, and a geometry model from DeepMind caught our eye last week.

LEARN MORE

JAN 17, 2024

How Multimodal LLMs Work

By Kevin Musgrave

Flamingo, BLIP-2, and LLaVA explained in simple terms.

LEARN MORE

JAN 12, 2024

Unsloth, V-star, and TOFU

By Isha Ghodgaonkar

A new open source library for faster LLM finetuning, a multimodal guided visual search algorithm, and a new unlearning task for LLMs caught our eye last week.

LEARN MORE

JAN 08, 2024

Mobile ALOHA, AppAgent, Virtual Token Counter, and Time Vectors

By Kevin Musgrave

Here’s what happened in AI the past few weeks.

LEARN MORE

DEC 19, 2023

NeurIPS 2023 - What's the buzz?

By Isha Ghodgaonkar, Liam Li

What we took away from attending NeurIPS ‘23 last week.

LEARN MORE

DEC 11, 2023

Generative Powers of 10, SeamlessExpressive, Mistral 8x7B, and Magicoder

By Isha Ghodgaonkar

Here’s what happened in AI last week.

LEARN MORE