text-generation-webui/LLaMA-model.md at f4aa11cef67d10cbd1ce3050bcbd624db6b9c7f7

mirror of https://github.com/oobabooga/text-generation-webui.git synced 2024-11-22 16:17:57 +01:00

oobabooga 8705eba830 Remove universal llama tokenizer support

Instead replace it with a warning if the tokenizer files look off

2023-07-04 19:43:19 -07:00

1.8 KiB

Raw Blame History

LLaMA is a Large Language Model developed by Meta AI.

It was trained on more tokens than previous models. The result is that the smallest version with 7 billion parameters has similar performance to GPT-3 with 175 billion parameters.

This guide will cover usage through the official transformers implementation. For 4-bit mode, head over to GPTQ models (4 bit mode) .

Getting the weights

Option 1: pre-converted weights

Torrent: https://github.com/oobabooga/text-generation-webui/pull/530#issuecomment-1484235789
Direct download: https://huggingface.co/Neko-Institute-of-Science

⚠️ The tokenizers for the Torrent source above and also for many LLaMA fine-tunes available on Hugging Face may be outdated, in particular the files called tokenizer_config.json and special_tokens_map.json. Here you can find those files: https://huggingface.co/oobabooga/llama-tokenizer

Option 2: convert the weights yourself

Install the protobuf library:

pip install protobuf==3.20.1

Use the script below to convert the model in .pth format that you, a fellow academic, downloaded using Meta's official link.

If you have transformers installed in place:

python -m transformers.models.llama.convert_llama_weights_to_hf --input_dir /path/to/LLaMA --model_size 7B --output_dir /tmp/outputs/llama-7b

Otherwise download convert_llama_weights_to_hf.py first and run:

python convert_llama_weights_to_hf.py --input_dir /path/to/LLaMA --model_size 7B --output_dir /tmp/outputs/llama-7b

Move the llama-7b folder inside your text-generation-webui/models folder.

Starting the web UI

python server.py --model llama-7b

1.8 KiB Raw Blame History

Getting the weights

Option 1: pre-converted weights

Option 2: convert the weights yourself

Starting the web UI

1.8 KiB

Raw Blame History