wang.yuqi d2a7938582
[Frontend][1/N] Improve all pooling task | Support FP16 Embedding Base64 (Still uses fp32 by default). (#26414)
Signed-off-by: wang.yuqi <noooop@126.com>
Co-authored-by: Maximilien de Bayser <maxdebayser@gmail.com>
Co-authored-by: Cyrus Leung <cyrus.tl.leung@gmail.com>
2025-10-13 19:06:43 +00:00

1.0 KiB

Pooling models

Cohere rerank usage

python examples/online_serving/pooling/cohere_rerank_client.py

Embedding embed_dtype usage

python examples/online_serving/pooling/embedding_embed_dtype_client.py

Jinaai rerank usage

python examples/online_serving/pooling/jinaai_rerank_client.py

Named Entity Recognition (NER) usage

python examples/online_serving/pooling/ner_client.py

Openai chat embedding for multimodal usage

python examples/online_serving/pooling/openai_chat_embedding_client_for_multimodal.py

Openai classification usage

python examples/online_serving/pooling/openai_classification_client.py

Openai embedding usage

python examples/online_serving/pooling/openai_embedding_client.py

Openai embedding matryoshka dimensions usage

python examples/online_serving/pooling/openai_embedding_matryoshka_fy.py

Openai pooling usage

python examples/online_serving/pooling/openai_pooling_client.py