Edge 432: NVIDIA Created Minitron by Distilling Llama 3.1

Posted by Alumni from Substack

September 19, 2024

We are regularly dazzled by the advancements in large language models(LLMs) particularly the ones with a massive number of parameters. However, executing 70B+ parameter models for inference results cost prohibited for most organizations. As a result, we have seen a growing influence of smaller language models(SLMs) that make it more cost effective to execute inference workloads. However, there is not always possible to pretrain SLMs from scratch as there are major challenges in terms of data collection, pretraining pipelines and many others. A popular alternative have been to start with larger LLMs and distill them to smaller models. Pruning and distillation are two of the most popular techniques in this area. Recently, NVIDIA released two models called Minitron-8B and Minitron-4B based on distilled versions of Llama 3.1'450B. Minitron focuses on reducing the size of AI models through pruning and distillation, making them more efficient without sacrificing too much accuracy. Pruning... learn more

Expertise

Find out how we connect targeted research expertise in academia to your business requirements. Discover how we accelerate business innovation and take care of the paperwork (hourly fees, fixed price, IP acquisition, seed funding)

Learn more about our events, organized by our ambassadors. Discover events organized by circle, university, metro area, and more.

Connect with Unicircles members at the universities and schools in our network.

Investors

Discover the opportunities for investors.

Find out how we facilitate investments with startups

Learn more about the opportunity behind startup investments

Corporates

Discover the opportunities for corporates.

Find out more about methodology behind how we facilitate collaboration between startups and corporates.

Learn more about the services tailored to corporates.

Check out our case studies.

Community

A global ecosystem of innovators empowering other innovators.

A global ecosystem of innovators empowering other innovators.

Find out more about partner opportunities

Check out our global events.

Unicircles

The marketplace for academic expertise and innovation.

Our story and expertise.

Send us a message, we will get back ASAP.

Join our team.

Company news, case studies, articles and more.

Edge 432: NVIDIA Created Minitron by Distilling Llama 3.1

JOIN UNICIRCLES The leading marketplace for advanced expertise and funding. learn more

JOIN UNICIRCLES
The leading marketplace for advanced expertise and funding. learn more