Skip to content
Snippets Groups Projects
Commit dc9012a7 authored by Eden_WangZx's avatar Eden_WangZx
Browse files

16384

parent faa0166a
No related branches found
No related merge requests found
......@@ -64,7 +64,7 @@ class Llama3_70B_ZeroShotModel(ShopBenchBaseModel):
# dtype="half", # note: bfloat16 is not supported on nvidia-T4 GPUs
enforce_eager=True,
quantization="AWQ",
max_model_len=8000,
max_model_len=16384,
enable_prefix_caching=True
)
self.tokenizer = self.llm.get_tokenizer()
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment