Berlin Buzzwords 2025

Rafał Kuć

Software engineer, trainer, consultant and author from time to time - some would say that he is an all in one battle weapon concentrated on information retrieval, performance and user search experience. However he also likes all the other cool stuff that is happening in the IT world. Likes to share his knowledge by giving talks at various meet ups and conferences.


Company/Organisation

Authologic

Twitter

https://x.com/kucrafal

LinkedIn

https://www.linkedin.com/in/rafalkuc/


Session

06-16
10:40
20min
Which GPU for Local LLMs?
Radu Gheorghe, Rafał Kuć

You’re using local LLMs. For example, to power RAG. You want to deploy them in production, but you don’t know where: which type of GPU? How large should it be? Should you use a larger model but quantize more aggressively?

Our benchmark results and their interpretation will give you some answers.

Kesselhaus