Advice - Getting started with LLMs

its_me_xiphos@beehaw.org · 5 months ago

Advice - Getting started with LLMs

halcyon@slrpnk.net · 5 months ago

How much do you know about natural language processing? If you aren’t already familiar, you’ll probably want to start with some basics like tokenizing, lemmatizing, stemming, identifying stop words, determining parts of speech, spell checking, and vectorizing. Putting together a clean normalized training set will require some or most of these, and it should help give you some context of what you’re putting in.

I’m most familiar with python’s natural language toolkit for most of those, sklearn does also have some text tools (vectorizer for sure), or you could jump right into keras/tensorflow.

After that, look into the concept of transformer models - this tutorial does cover some of the basic cleanup steps, although I’d still want to understand them better than just copy pasting their code/regexes:

https://machinelearningmastery.com/building-transformer-models-with-attention-crash-course-build-a-neural-machine-translator-in-12-days/

https://machinelearningmastery.com/what-are-large-language-models/

🐝bownage [they/he]@beehaw.org · 5 months ago

Good recommendations! I’d suggest doing some spacy tutorials as well, regarding the topics in the first paragraph. But arguably it’s possible nowadays to just start at transformers without any NLP knowledge, e.g. using huggingface’s AutoTrain or something similar. I wouldn’t recommend it, but you definitely could.