Models
bert
Bert Model.
- class mindnlp.models.bert.BertAttention(config)[source]
Bases:
CellBert Attention
- construct(input_tensor, attention_mask=None, head_mask=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.bert.BertConfig(vocab_size=30522, hidden_size=768, num_hidden_layers=12, num_attention_heads=12, intermediate_size=3072, hidden_act='gelu', hidden_dropout_prob=0.1, attention_probs_dropout_prob=0.1, max_position_embeddings=512, type_vocab_size=2, initializer_range=0.02, layer_norm_eps=1e-12, classifier_dropout=None, **kwargs)[source]
Bases:
PreTrainedConfigConfiguration for BERT-base
- class mindnlp.models.bert.BertEmbeddings(config)[source]
Bases:
CellEmbeddings for BERT, include word, position and token_type
- construct(input_ids, token_type_ids=None, position_ids=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.bert.BertEncoder(config)[source]
Bases:
CellBert Encoder
- construct(hidden_states, attention_mask=None, head_mask=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.bert.BertForPretraining(config, *args, **kwargs)[source]
Bases:
BertPreTrainedModelBert For Pretraining
- construct(input_ids, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, masked_lm_positions=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.bert.BertForSequenceClassification(config)[source]
Bases:
BertPreTrainedModelBert Model for classification tasks
- construct(input_ids, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, labels=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.bert.BertIntermediate(config)[source]
Bases:
CellBert Intermediate
- construct(hidden_states)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.bert.BertLMPredictionHead(config)[source]
Bases:
CellBert LM Prediction Head
- construct(hidden_states, masked_lm_positions)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.bert.BertLayer(config)[source]
Bases:
CellBert Layer
- construct(hidden_states, attention_mask=None, head_mask=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.bert.BertModel(config, add_pooling_layer=True)[source]
Bases:
BertPreTrainedModelBert Model
- construct(input_ids, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
gpt
GPT Model.
- class mindnlp.models.gpt.Attention(nx, n_positions, config, scale=False)[source]
Bases:
CellGPT Attention
- construct(x, attention_mask=None, head_mask=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt.Block(n_positions, config, scale=False)[source]
Bases:
CellGPT Block
- construct(x, attention_mask=None, head_mask=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt.GPTConfig(vocab_size=40478, n_positions=512, n_embd=768, hidden_size=768, n_layer=12, n_head=12, afn='gelu_new', resid_pdrop=0.1, embd_pdrop=0.1, attn_pdrop=0.1, layer_norm_epsilon=1e-05, initializer_range=0.02, summary_type='cls_index', summary_use_proj=True, summary_activation=None, summary_proj_to_labels=True, summary_first_dropout=0.1, **kwargs)[source]
Bases:
PreTrainedConfigGPT config
- class mindnlp.models.gpt.GPTDoubleHeadsModel(config)[source]
Bases:
GPTPreTrainedModelOpenAI GPT Model transformer with a language modeling and a multiple-choice classification head on top e.g. for RocStories/SWAG tasks. The two heads are two linear layers. The language modeling head has its weights tied to the input embeddings, the classification head takes as input the input of a specified classification token index in the input sequence).
- construct(input_ids=None, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, inputs_embeds=None, mc_token_ids=None, labels=None, mc_labels=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt.GPTForSequenceClassification(config)[source]
Bases:
GPTPreTrainedModelThe Original GPT Model transformer with a sequence classification head on top (linear layer). GPTForSequenceClassification uses the last token in order to do the classification, as other causal models (e.g. GPT-2) do. Since it does classification on the last token, it requires to know the position of the last token. If a pad_token_id is defined in the configuration, it finds the last token that is not a padding token in each row. If no pad_token_id is defined, it simply takes the last value in each row of the batch. Since it cannot guess the padding tokens when inputs_embeds are passed instead of input_ids, it does the same (take the last value in each row of the batch).
- construct(input_ids=None, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, inputs_embeds=None, labels=None)[source]
- labels (torch.LongTensor of shape (batch_size,), optional):
Labels for computing the sequence classification/regression loss. Indices should be in [0, …,config.num_labels - 1]. If config.num_labels == 1 a regression loss is computed (Mean-Square loss), If config.num_labels > 1 a classification loss is computed (Cross-Entropy).
- class mindnlp.models.gpt.GPTLMHeadModel(config)[source]
Bases:
GPTPreTrainedModelGPT Model transformer with a language modeling head on top (linear layer with weights tied to the input embeddings).
- construct(input_ids=None, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, inputs_embeds=None, labels=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt.GPTModel(config)[source]
Bases:
GPTPreTrainedModelThe bare GPT transformer model outputting raw hidden-states without any specific head on top
- construct(input_ids=None, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, inputs_embeds=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt.MLP(n_state, config)[source]
Bases:
CellGPT MLP
- construct(x)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
gpt2
GPT2 Models init
- class mindnlp.models.gpt2.GPT2Attention(config, is_cross_attention=False, layer_idx=None)[source]
Bases:
Cellgpt2 Attention
- construct(hidden_states: Tuple[Tensor], layer_past: Optional[Tuple[Tensor]] = None, attention_mask: Optional[Tensor] = None, head_mask: Optional[Tensor] = None, encoder_hidden_states: Optional[Tensor] = None, encoder_attention_mask: Optional[Tensor] = None, use_cache: Optional[bool] = False)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt2.GPT2Config(vocab_size=50257, n_positions=1024, n_embd=768, n_layer=12, n_head=12, n_inner=None, activation_function='gelu_new', resid_pdrop=0.1, embd_pdrop=0.1, attn_pdrop=0.1, layer_norm_epsilon=1e-05, initializer_range=0.02, summary_type='cls_index', summary_use_proj=True, summary_activation=None, summary_proj_to_labels=True, summary_first_dropout=0.1, scale_attn_weights=True, use_cache=True, bos_token_id=50256, eos_token_id=50256, scale_attn_by_inverse_layer_idx=False, reorder_and_upcast_attn=False, **kwargs)[source]
Bases:
PreTrainedConfigConfiguration for gpt2-base
- class mindnlp.models.gpt2.GPT2DoubleHeadsModel(config)[source]
Bases:
GPT2PreTrainedModelGPT2 Double Heads Model
- construct(input_ids, past_key_values=None, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, inputs_embeds=None, mc_token_ids=None, labels=None, mc_labels=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt2.GPT2ForSequenceClassification(config)[source]
Bases:
GPT2PreTrainedModelgpt2 For Sequence Classification
- construct(input_ids: Tensor, past_key_values: Optional[Tuple[Tuple[Tensor]]] = None, attention_mask: Optional[Tensor] = None, token_type_ids: Optional[Tensor] = None, position_ids: Optional[Tensor] = None, head_mask: Optional[Tensor] = None, inputs_embeds: Optional[Tensor] = None, labels: Optional[Tensor] = None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt2.GPT2ForTokenClassification(config)[source]
Bases:
GPT2PreTrainedModelGPT2 For Token Classification
- construct(input_ids=None, past_key_values=None, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, inputs_embeds=None, labels=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt2.GPT2LMHeadModel(config, **kwargs)[source]
Bases:
GPT2PreTrainedModelgpt2 LMHead Model
- construct(input_ids: Tensor, past_key_values: Optional[Tuple[Tuple[Tensor]]] = None, attention_mask: Optional[Tensor] = None, token_type_ids: Optional[Tensor] = None, position_ids: Optional[Tensor] = None, head_mask: Optional[Tensor] = None, inputs_embeds: Optional[Tensor] = None, encoder_hidden_states: Optional[Tensor] = None, encoder_attention_mask: Optional[Tensor] = None, labels: Optional[Tensor] = None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt2.GPT2MLP(intermediate_size, config)[source]
Bases:
Cellgpt2 MLP
- construct(hidden_states: Tuple[Tensor])[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt2.GPT2Model(config)[source]
Bases:
GPT2PreTrainedModelgpt2 Model
- construct(input_ids: Tensor, past_key_values: Optional[Tuple[Tuple[Tensor]]] = None, attention_mask: Optional[Tensor] = None, token_type_ids: Optional[Tensor] = None, position_ids: Optional[Tensor] = None, head_mask: Optional[Tensor] = None, inputs_embeds: Optional[Tensor] = None, encoder_hidden_states: Optional[Tensor] = None, encoder_attention_mask: Optional[Tensor] = None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.gpt2.GPT2PreTrainedModel(config)[source]
Bases:
PreTrainedModelAn abstract class to handle weights initialization and a simple interface for downloading and loading pretrained models.
- config_class
alias of
GPT2Config
- convert_torch_to_mindspore(**kwargs)
torch to mindspore.
t5
T5 Model init
- class mindnlp.models.t5.T5Attention(config: T5Config, has_relative_attention_bias=False)[source]
Bases:
Cell- construct(hidden_states, mask=None, key_value_states=None, position_bias=None, past_key_value=None, layer_head_mask=None, query_length=None, use_cache=False, output_attentions=False)[source]
Self-attention (if key_value_states is None) or attention over source sentence (provided by key_value_states).
- class mindnlp.models.t5.T5Block(config, has_relative_attention_bias=False)[source]
Bases:
Cell- construct(hidden_states, attention_mask=None, position_bias=None, encoder_hidden_states=None, encoder_attention_mask=None, encoder_decoder_position_bias=None, layer_head_mask=None, cross_attn_layer_head_mask=None, past_key_value=None, use_cache=False, output_attentions=False)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5Config(vocab_size=32128, d_model=512, d_kv=64, d_ff=2048, num_layers=6, num_decoder_layers=None, num_heads=8, relative_attention_num_buckets=32, relative_attention_max_distance=128, dropout_rate=0.1, layer_norm_epsilon=1e-06, initializer_factor=1.0, feed_forward_proj='relu', is_encoder_decoder=True, use_cache=True, pad_token_id=0, eos_token_id=1, **kwargs)[source]
Bases:
PreTrainedConfigConfiguration for T5
- class mindnlp.models.t5.T5DenseActDense(config: T5Config)[source]
Bases:
Cell- construct(hidden_states)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5DenseGatedActDense(config: T5Config)[source]
Bases:
Cell- construct(hidden_states)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5EncoderModel(config: T5Config)[source]
Bases:
T5PreTrainedModel- construct(input_ids=None, attention_mask=None, head_mask=None, inputs_embeds=None, output_attentions=None, output_hidden_states=None, return_dict=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5ForConditionalGeneration(config: T5Config)[source]
Bases:
T5PreTrainedModel- construct(input_ids=None, attention_mask=None, decoder_input_ids=None, decoder_attention_mask=None, head_mask=None, decoder_head_mask=None, cross_attn_head_mask=None, encoder_outputs=None, past_key_values=None, inputs_embeds=None, decoder_inputs_embeds=None, labels=None, use_cache=None, output_attentions=None, output_hidden_states=None, return_dict=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- get_input_embeddings()[source]
Returns the model’s input embeddings.
- Returns:
A mindspore cell mapping vocabulary to hidden states.
- Return type:
nn.Cell
- prepare_decoder_input_ids_from_labels(labels: Tensor)[source]
prepare decoder input ids from labels
- class mindnlp.models.t5.T5LayerCrossAttention(config)[source]
Bases:
Cell- construct(hidden_states, key_value_states, attention_mask=None, position_bias=None, layer_head_mask=None, past_key_value=None, use_cache=False, query_length=None, output_attentions=False)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5LayerFF(config: T5Config)[source]
Bases:
Cell- construct(hidden_states)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5LayerNorm(hidden_size, eps=1e-06)[source]
Bases:
Cell- construct(hidden_states)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5LayerSelfAttention(config, has_relative_attention_bias=False)[source]
Bases:
Cell- construct(hidden_states, attention_mask=None, position_bias=None, layer_head_mask=None, past_key_value=None, use_cache=False, output_attentions=False)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5Model(config: T5Config)[source]
Bases:
T5PreTrainedModel- construct(input_ids=None, attention_mask=None, decoder_input_ids=None, decoder_attention_mask=None, head_mask=None, decoder_head_mask=None, cross_attn_head_mask=None, encoder_outputs=None, past_key_values=None, inputs_embeds=None, decoder_inputs_embeds=None, use_cache=None, output_attentions=None, output_hidden_states=None, return_dict=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.t5.T5PreTrainedModel(config)[source]
Bases:
PreTrainedModelAn abstract class to handle weights initialization and a simple interface for downloading and loading pretrained models.
- convert_torch_to_mindspore(**kwargs)
torch to mindspore.
- get_input_embeddings()[source]
Returns the model’s input embeddings.
- Returns:
A mindspore cell mapping vocabulary to hidden states.
- Return type:
nn.Cell
- class mindnlp.models.t5.T5Stack(config, embed_tokens=None)[source]
Bases:
T5PreTrainedModel- construct(input_ids=None, attention_mask=None, encoder_hidden_states=None, encoder_attention_mask=None, inputs_embeds=None, head_mask=None, cross_attn_head_mask=None, past_key_values=None, use_cache=None, output_attentions=None, output_hidden_states=None, return_dict=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
ernie
Ernie Model init
- class mindnlp.models.ernie.ErnieConfig(vocab_size: int = 30522, hidden_size: int = 768, num_hidden_layers: int = 12, num_attention_heads: int = 12, task_id=0, intermediate_size: int = 3072, hidden_act: str = 'gelu', hidden_dropout_prob: float = 0.1, attention_probs_dropout_prob: float = 0.1, max_position_embeddings: int = 512, task_type_vocab_size: int = 3, type_vocab_size: int = 16, initializer_range: float = 0.02, pad_token_id: int = 0, pool_act: str = 'tanh', fuse: bool = False, layer_norm_eps=1e-12, use_cache=False, use_task_id=True, enable_recompute=False, **kwargs)[source]
Bases:
PreTrainedConfigConfiguration for Ernie.
- class mindnlp.models.ernie.ErnieEmbeddings(config: ErnieConfig, embedding_table)[source]
Bases:
CellErnie Embeddings for word, position and token_type embeddings.
- construct(input_ids: Optional[Tensor] = None, token_type_ids: Optional[Tensor] = None, position_ids: Optional[Tensor] = None, task_type_ids: Optional[Tensor] = None, inputs_embeds: Optional[Tensor] = None, past_key_values_length: int = 0)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.ernie.ErnieModel(config: ErnieConfig)[source]
Bases:
ErniePretrainedModelErnie model.
- construct(input_ids: Optional[Tensor] = None, token_type_ids: Optional[Tensor] = None, position_ids: Optional[Tensor] = None, attention_mask: Optional[Tensor] = None, task_type_ids: Optional[Tensor] = None, past_key_values: Optional[Tuple[Tuple[Tensor]]] = None, inputs_embeds: Optional[Tensor] = None, output_hidden_states: Optional[bool] = None, output_attentions: Optional[bool] = None, return_dict: Optional[bool] = None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.ernie.ErniePooler(config: ErnieConfig, weight_init)[source]
Bases:
CellErnie Pooler.
- construct(hidden_states)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.ernie.UIE(config: ErnieConfig)[source]
Bases:
ErniePretrainedModelUIE model based on Ernie.
- construct(input_ids: Optional[Tensor] = None, token_type_ids: Optional[Tensor] = None, position_ids: Optional[Tensor] = None, attention_mask: Optional[Tensor] = None, inputs_embeds: Optional[Tensor] = None, return_dict: Optional[Tensor] = None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.ernie.UIEConfig(vocab_size: int = 40000, hidden_size: int = 768, num_hidden_layers: int = 12, num_attention_heads: int = 12, task_id=0, intermediate_size: int = 3072, hidden_act: str = 'gelu', hidden_dropout_prob: float = 0.1, attention_probs_dropout_prob: float = 0.1, max_position_embeddings: int = 2048, task_type_vocab_size: int = 3, type_vocab_size: int = 4, initializer_range: float = 0.02, pad_token_id: int = 0, pool_act: str = 'tanh', fuse: bool = False, layer_norm_eps=1e-12, use_cache=False, use_task_id=True, enable_recompute=False, **kwargs)[source]
Bases:
PreTrainedConfigConfiguration for UIE.
roberta
Roberta model.
- class mindnlp.models.roberta.RobertaClassificationHead(config)[source]
Bases:
Cell- construct(features)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.roberta.RobertaConfig(vocab_size=30522, hidden_size=768, num_hidden_layers=12, num_attention_heads=12, intermediate_size=3072, hidden_act='gelu', hidden_dropout_prob=0.1, attention_probs_dropout_prob=0.1, max_position_embeddings=512, type_vocab_size=2, initializer_range=0.02, layer_norm_eps=1e-12, classifier_dropout=None, **kwargs)[source]
Bases:
BertConfigRoberta Config.
- class mindnlp.models.roberta.RobertaEmbeddings(config)[source]
Bases:
BertEmbeddingsRoberta embeddings
- construct(input_ids, token_type_ids=None, position_ids=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.roberta.RobertaForMaskedLM(config, *args, **kwargs)[source]
Bases:
RobertaPreTrainedModel- construct(input_ids, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, masked_lm_labels=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.roberta.RobertaForMultipleChoice(config, *args, **kwargs)[source]
Bases:
RobertaPreTrainedModel- construct(input_ids, token_type_ids=None, attention_mask=None, labels=None, position_ids=None, head_mask=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.roberta.RobertaForSequenceClassification(config, *args, **kwargs)[source]
Bases:
RobertaPreTrainedModel- construct(input_ids, attention_mask=None, token_type_ids=None, position_ids=None, head_mask=None, labels=None)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.roberta.RobertaLMHead(config)[source]
Bases:
Cell- construct(features)[source]
Defines the computation to be performed. This method must be overridden by all subclasses.
Note
It is not supported currently that inputs contain both tuple and non-tuple types at same time.
- Parameters:
args (tuple) – Tuple of variable parameters.
kwargs (dict) – Dictionary of variable keyword parameters.
- Returns:
Tensor, returns the computed result.
- class mindnlp.models.roberta.RobertaModel(config, add_pooling_layer=True)[source]
Bases:
BertModelRoberta Model
- class mindnlp.models.roberta.RobertaPreTrainedModel(config)[source]
Bases:
BertPreTrainedModelRoberta Pretrained Model.
- config_class
alias of
RobertaConfig