Skip to content
Snippets Groups Projects
Commit b98c297b authored by Eden_WangZx's avatar Eden_WangZx
Browse files

12288

parent b62a158b
No related branches found
No related merge requests found
......@@ -64,7 +64,7 @@ class Llama3_70B_ZeroShotModel(ShopBenchBaseModel):
# dtype="half", # note: bfloat16 is not supported on nvidia-T4 GPUs
enforce_eager=True,
quantization="AWQ",
max_model_len=16384,
max_model_len=12288,
enable_prefix_caching=True
)
self.tokenizer = self.llm.get_tokenizer()
......
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment