Update README.md
Browse files
README.md
CHANGED
|
@@ -108,10 +108,4 @@ Tested on Google Colab Tesla T4 with:
|
|
| 108 |
|
| 109 |
## Original Model
|
| 110 |
|
| 111 |
-
For standard PyTorch/Transformers usage, see the original model: [cointegrated/rubert-tiny2](https://huggingface.co/cointegrated/rubert-tiny2)
|
| 112 |
-
|
| 113 |
-
This vLLM version is optimized for deployment scenarios requiring:
|
| 114 |
-
- High throughput batch processing
|
| 115 |
-
- Low latency inference
|
| 116 |
-
- OpenAI API compatibility
|
| 117 |
-
- Production-grade serving infrastructure
|
|
|
|
| 108 |
|
| 109 |
## Original Model
|
| 110 |
|
| 111 |
+
For standard PyTorch/Transformers usage, see the original model: [cointegrated/rubert-tiny2](https://huggingface.co/cointegrated/rubert-tiny2)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|