Can you self-host AI at parity with chatgpt?

wuphysics87 · 5 months ago

Can you self-host AI at parity with chatgpt?

JoYo · edit-2 5 months ago

It’s all dependent on VRAM. If you can load the distilled models with your GPU without maxing out your VRAM it will run just as fast as any server farm.

RX 580x

It looks like your video card only has 8 GB of VRAM. That will be your bottleneck.

@floquant@lemmy.dbzer0.com · 4 months ago

Also no ROCm support afaik, so it’s running completely on CPU

JoYo · 4 months ago

yah that’ll do it too. ive got a 6800xt which isn’t technically supported but it works well.