class documentation
class TorchCLIPModel(fout.TorchImageModel, fom.PromptMixin): (source)
Constructor: TorchCLIPModel(config)
Torch implementation of CLIP from https://github.com/openai/CLIP.
Parameters | |
config | a TorchCLIPModelConfig |
Method | __init__ |
Undocumented |
Method | embed |
Generates an embedding for the given text prompt. |
Method | embed |
Generates an embedding for the given text prompts. |
Property | can |
Whether this instance can generate prompt embeddings. |
Method | _download |
Undocumented |
Method | _embed |
Undocumented |
Method | _get |
Undocumented |
Method | _get |
Undocumented |
Method | _load |
Undocumented |
Method | _predict |
Applies a forward pass to the given iterable of data and returns the raw model output with no processing applied. |
Instance Variable | _text |
Undocumented |
Instance Variable | _tokenizer |
Undocumented |
Inherited from TorchImageModel
:
Method | __enter__ |
Undocumented |
Method | __exit__ |
Undocumented |
Method | predict |
Performs prediction on the given image. |
Method | predict |
Performs prediction on the given batch of images. |
Method | preprocess |
Undocumented |
Instance Variable | config |
Undocumented |
Property | classes |
The list of class labels for the model, if known. |
Property | device |
The torch:torch.torch.device that the model is using. |
Property | has |
Whether this instance can generate logits. |
Property | mask |
The mask targets for the model, if any. |
Property | media |
The media type processed by the model. |
Property | num |
The number of classes for the model, if known. |
Property | preprocess |
Whether to apply preprocessing transforms for inference, if any. |
Property | ragged |
Whether transforms may return tensors of different sizes. If True, then passing ragged lists of images to predict_all may not be not allowed. |
Property | skeleton |
The keypoint skeleton for the model, if any. |
Property | transforms |
A torchvision.transforms function that will be applied to each input before prediction, if any. |
Property | using |
Whether the model is using GPU. |
Property | using |
Whether the model is using half precision. |
Method | _build |
Undocumented |
Method | _build |
Undocumented |
Method | _forward |
Undocumented |
Method | _load |
Undocumented |
Method | _parse |
Undocumented |
Method | _parse |
Undocumented |
Method | _parse |
Undocumented |
Instance Variable | _benchmark |
Undocumented |
Instance Variable | _classes |
Undocumented |
Instance Variable | _device |
Undocumented |
Instance Variable | _mask |
Undocumented |
Instance Variable | _model |
Undocumented |
Instance Variable | _no |
Undocumented |
Instance Variable | _output |
Undocumented |
Instance Variable | _preprocess |
Undocumented |
Instance Variable | _ragged |
Undocumented |
Instance Variable | _skeleton |
Undocumented |
Instance Variable | _transforms |
Undocumented |
Instance Variable | _using |
Undocumented |
Instance Variable | _using |
Undocumented |
Inherited from TorchEmbeddingsMixin
(via TorchImageModel
):
Method | embed |
Generates an embedding for the given data. |
Method | embed |
Generates embeddings for the given iterable of data. |
Method | get |
Returns the embeddings generated by the last forward pass of the model. |
Property | has |
Whether this instance has embeddings. |
Instance Variable | _as |
Undocumented |
Instance Variable | _embeddings |
Undocumented |
Inherited from LogitsMixin
(via TorchImageModel
, TorchEmbeddingsMixin
, EmbeddingsMixin
, TorchModelMixin
):
Method | store |
Undocumented |
Property | store |
Whether the model should store logits in its predictions. |
Instance Variable | _store |
Undocumented |
Generates an embedding for the given text prompts.
Parameters | |
prompts | an iterable of text strings |
Returns | |
a num_prompts x num_dims array of prompt embeddings |
Applies a forward pass to the given iterable of data and returns the raw model output with no processing applied.
Parameters | |
imgs | Undocumented |
args | an iterable of data. See predict_all for details |
Returns | |
the raw output of the model |