Blogs

MAY 22, 2024

Mistral 7B vs. Llama-2 7B: Lightning Round using GenAI studio

By Isha Ghodgaonkar

Let’s compare how Mistral-7b-instruct-v0.2 and Llama 2-7B-chat perform on some basic LLM prompts.

LEARN MORE

MAY 15, 2024

Activation Memory: What is it?

By Garrett Goon, Kevin Musgrave

An introduction to activation memory, and how it affects GPU memory consumption during model training.

LEARN MORE

MAY 01, 2024

3D Diffuse Glioma Segmentation for Early Cancer Detection: Spotlight Demo

By Isha Ghodgaonkar, Alejandro Morales Martinez

Using HPE’s AI platform to develop an early brain cancer detection machine learning model.

LEARN MORE

APR 29, 2024

AI News #21

By Kevin Musgrave

Highlights include CatLIP, SpaceByte, and LayerSkip

LEARN MORE

APR 24, 2024

From a pre-trained model to an AI assistant: Finetuning Gemma-2B using DPO

By Agnieszka Ciborowska, Kevin Musgrave

LLM Alignment using Direct Preference Optimization, an alternative to RLHF

LEARN MORE

APR 22, 2024

AI News #20

By Isha Ghodgaonkar

Highlights include Llama 3, TR-DPO, Video2Game, Dynamic Typography, and new leaderboards.

LEARN MORE

APR 19, 2024

Determined v0.31.0

By Keita Nonaka

Highlights of the latest release.

LEARN MORE

APR 15, 2024

AI News #19

By Kevin Musgrave

Highlights include Infini-attention, OSWorld, Visual Task Vectors, and Scaling Laws for Data Filtering.

LEARN MORE

APR 11, 2024

Determined v0.30.0

By Wesley Turner

Highlights of the latest release.

LEARN MORE

APR 08, 2024

AI News #18

By Kevin Musgrave

Highlights include Mixture of Depths, ReFT, and Many-shot Jailbreaking.

LEARN MORE