Timely_Jellyfish_2077@programming.dev to

LocalLLaMA@sh.itjust.worksEnglish · 3 months ago

Localllama setup for $100k.

9

1

Localllama setup for $100k.

Timely_Jellyfish_2077@programming.dev to

LocalLLaMA@sh.itjust.worksEnglish · 3 months ago

9

Consider this hypothetical scenario: if you were given $100,000 to build a PC/server to run open-source LLMs like LLaMA 3 for single-user purposes, what would you build?

Chat

kelvie@lemmy.ca
link
fedilink
English
arrow-up
0·
3 months ago
Depends on what you’re doing with it, but prompt/context processing is a lot faster on Nvidia GPUs than on Apple chips, though if you are using the same prefix all the time it’s a bit better.

The time to first token is a lot faster on datacenter GPUs, especially as context length increases, and consumer GPUs don’t have enough vram.

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

3 users / day
3 users / week
13 users / month
108 users / 6 months
0 local subscribers
2.09K subscribers
148 Posts
462 Comments
Modlog