Researchers say their open-source AI model LLaVA, is as powerful as GPT-4 Vision, which was released only weeks ago.

RewindAgain@futurology.today · 1 year ago

Researchers say their open-source AI model LLaVA, is as powerful as GPT-4 Vision, which was released only weeks ago.

1 year ago

The problem is most of these models need like a terabyte of VRAM… And consumers have about 8-24GB.

LoafyLemon@kbin.social · edit-2 1 year ago

Old news pal! 😄

[4/27] Thanks to the community effort, LLaVA-13B with 4-bit quantization allows you to run on a GPU with as few as 12GB VRAM! Try it out here.

webghost0101@sopuli.xyz · 1 year ago

12GB of VRAM is still an upgrade away for most people and a 4bit quantized 13B model is barely going to be a tech demo. When open source ai is proclaimed to be near/on par/better then gpt4 they are talking about nothing else than their biggest models in a prime environment.

just_another_person@lemmy.world · 1 year ago

Sure, but not for standard cloud instances that are very affordable for companies wanting to get away from OpenAI.

sudo22@lemmy.world · 1 year ago

Holy shit a terabyte?

1 year ago

This specific one says it’ll run on 24GB actually. But some are just crazy big.

Lugh@futurology.today · edit-2 1 year ago

There are smaller models that can run on most laptops.

https://www.maginative.com/article/stability-ai-releases-stable-lm-3b-a-small-high-performance-language-model-for-smart-devices/

In benchmarks this looks like it is not far off Chat-GPT 3.5.

BetaDoggo_@lemmy.world · 1 year ago

It’s not even close, less than half of 3.5’s 85.5% in ARC. Some larger Open models are competitive in Hellaswag, TruthfulQA and MMLU but ARC is still a major struggle for small models.

3Bs are kind of pointless right now because the machines with processors capable of running them at a usable speed probably have enough memory to run a 7B anyway.