赞
踩
LLM
some LLM’s model and weight are not opened to user
what is?
Llama 270b model
2 files
how to get parameters?
what neural do is trying to predict the next word in a sequence. parameters are dispersed throughout the neural network and neurons are connected to each other, fire in a certain way
prediction has strong relationship with compression
LLM create a correct form of text and fill it with its knowedge. not create a copy of text that was be trained.
how does it work?
training stage
pre-training
fine tuning
stage 3(optional)
LLM scaling laws:
multimodality. now some LLM like GPT can use different tools to help it with answering questions. browser, calculator, python interpreter.
future directions of development in LLM
experts in certain domain
Copyright © 2003-2013 www.wpsshop.cn 版权所有,并保留所有权利。