tencent
/

Hunyuan-A13B-Instruct

Text Generation

Model card Files Files and versions

Asher commited on Jul 1

Commit

146ac84

·

1 Parent(s): c864579

doc: minor fix.

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -294,7 +294,7 @@ You can build and run vLLM from source after merging this pull request into your
 ### Model Context Length Support
-The Hunyuan A13B model supports a maximum context length of **256K tokens (262,144 token positions)**. However, due to GPU memory constraints on most hardware setups, the default configuration in `config.json` limits the context length to **32K tokens** to prevent out-of-memory (OOM) errors.
 #### Extending Context Length to 256K

 ### Model Context Length Support
+The Hunyuan A13B model supports a maximum context length of **256K tokens (262,144 tokens)**. However, due to GPU memory constraints on most hardware setups, the default configuration in `config.json` limits the context length to **32K tokens** to prevent out-of-memory (OOM) errors.
 #### Extending Context Length to 256K