Home How to add a dense layer on top of SentenceTransformer?

Questions

How to add a dense layer on top of SentenceTransformer?

January 15, 2024

In this tutorial (Train and Fine-Tune Sentence Transformers Models) they go through creating a SentenceTransformer by combining a word embedding module with a pooling layer:

from sentence_transformers import SentenceTransformer, models

## Step 1: use an existing language model
word_embedding_model = models.Transformer('distilroberta-base')

## Step 2: use a pool function over the token embeddings
pooling_model = models.Pooling(word_embedding_model.get_word_embedding_dimension())

## Join steps 1 and 2 using the modules argument
model = SentenceTransformer(modules=[word_embedding_model, pooling_model])

# model.encode("Hi there")  # => works fine

And then they say:

If necessary, additional layers can be added, for example, dense, bag of words, and convolutional.

MEDevel.com: Open-source for Healthcare and Education

Collecting and validating open-source software for healthcare, education, enterprise, development, medical imaging, medical records, and digital pathology.
Visit Medevel

I tried to add a dense layer on top of the model, but I’m getting an error:

from sentence_transformers import SentenceTransformer, models

## Step 1: use an existing language model
word_embedding_model = models.Transformer('distilroberta-base')

## Step 2: use a pool function over the token embeddings
pooling_model = models.Pooling(word_embedding_model.get_word_embedding_dimension())

##  My Dense Layer
dense_layer = torch.nn.Linear(pooling_model.get_sentence_embedding_dimension(), 128)

## Join steps 1 and 2 using the modules argument
model = SentenceTransformer(modules=[word_embedding_model, pooling_model, dense_layer])

And when I run model.encode("hi there") I get:

TypeError: linear(): argument ‘input’ (position 1) must be Tensor, not dict

I found the same error here but using BertModel.from_pretrained, not models.Transformer. The suggested answer (passing the argument return_dict=False) doesn’t work:

word_embedding_model = models.Transformer('distilroberta-base', return_dict=False)

TypeError: Transformer.init() got an unexpected keyword argument ‘return_dict’

Any ideas how to add a dense layer correctly?

>Solution :

According to the documentation, replace this line:

dense_layer = torch.nn.Linear(pooling_model.get_sentence_embedding_dimension(), 128)

with the following:

dense_layer = models.Dense(pooling_model.get_sentence_embedding_dimension(), 128)

sentence-transformers

byMR

Published January 15, 2024

Add a comment

How to filter a list of dataframes based on a unique count of categorical factors in each dataframe?

byMR

January 15, 2024

Questions

Nodejs / javascript return data from function as its available

byMR

January 16, 2024

Questions

Serializing form to JSON when there is multiple inputs with same name

byMR

January 16, 2024

Questions

How to make a JavaFX 3D Box transparent

byMR

January 16, 2024

Questions

How change labels name in risk table using ggsurvfit package?

byMR

January 16, 2024

Questions

Vue, how to assign unique v-model parameter in loop

byMR

January 16, 2024

How to add a dense layer on top of SentenceTransformer?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to filter a list of dataframes based on a unique count of categorical factors in each dataframe?

Nodejs / javascript return data from function as its available

Serializing form to JSON when there is multiple inputs with same name

How to make a JavaFX 3D Box transparent

How change labels name in risk table using ggsurvfit package?

Vue, how to assign unique v-model parameter in loop

Keep Up to Date with the Most Important News

How to add a dense layer on top of SentenceTransformer?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to filter a list of dataframes based on a unique count of categorical factors in each dataframe?

Nodejs / javascript return data from function as its available

Serializing form to JSON when there is multiple inputs with same name

How to make a JavaFX 3D Box transparent

How change labels name in risk table using ggsurvfit package?

Vue, how to assign unique v-model parameter in loop

Discover more from Dev solutions