vllm.entrypoints.pooling.classify.protocol ¶
ClassificationRequest module-attribute ¶
ClassificationRequest: TypeAlias = (
ClassificationCompletionRequest
| ClassificationChatRequest
)
ClassificationChatRequest ¶
Bases: PoolingBasicRequestMixin
Source code in vllm/entrypoints/pooling/classify/protocol.py
activation class-attribute instance-attribute ¶
activation: bool | None = Field(
default=None,
description="activation will be deprecated, please use use_activation instead.",
)
add_generation_prompt class-attribute instance-attribute ¶
add_generation_prompt: bool = Field(
default=False,
description="If true, the generation prompt will be added to the chat template. This is a parameter used by chat template in tokenizer config of the model.",
)
add_special_tokens class-attribute instance-attribute ¶
add_special_tokens: bool = Field(
default=False,
description="If true, special tokens (e.g. BOS) will be added to the prompt on top of what is added by the chat template. For most models, the chat template takes care of adding the special tokens so this should be set to false (as is the default).",
)
chat_template class-attribute instance-attribute ¶
chat_template: str | None = Field(
default=None,
description="A Jinja template to use for this conversion. As of transformers v4.44, default chat template is no longer allowed, so you must provide a chat template if the tokenizer does not define one.",
)
chat_template_kwargs class-attribute instance-attribute ¶
chat_template_kwargs: dict[str, Any] | None = Field(
default=None,
description="Additional keyword args to pass to the template renderer. Will be accessible by the chat template.",
)
mm_processor_kwargs class-attribute instance-attribute ¶
mm_processor_kwargs: dict[str, Any] | None = Field(
default=None,
description="Additional kwargs to pass to the HF processor.",
)
softmax class-attribute instance-attribute ¶
softmax: bool | None = Field(
default=None,
description="softmax will be deprecated, please use use_activation instead.",
)
ClassificationCompletionRequest ¶
Bases: PoolingBasicRequestMixin, CompletionRequestMixin
Source code in vllm/entrypoints/pooling/classify/protocol.py
activation class-attribute instance-attribute ¶
activation: bool | None = Field(
default=None,
description="activation will be deprecated, please use use_activation instead.",
)
softmax class-attribute instance-attribute ¶
softmax: bool | None = Field(
default=None,
description="softmax will be deprecated, please use use_activation instead.",
)
ClassificationData ¶
Bases: OpenAIBaseModel
Source code in vllm/entrypoints/pooling/classify/protocol.py
ClassificationResponse ¶
Bases: OpenAIBaseModel
Source code in vllm/entrypoints/pooling/classify/protocol.py
created class-attribute instance-attribute ¶
id class-attribute instance-attribute ¶
id: str = Field(
default_factory=lambda: f"classify-{random_uuid()}"
)