Newest 'bert-language-model' Questions

0 votes

1 answer

35 views

How can I decide how many epochs to train for when re-training a model on the full dataset without a validation set?

I have a BERT model that I want to fine-tune. Initially, I use a training dataset, which I split into a training and validation set. During fine-tuning, I monitor the validation loss to ensure that ...

Rishi Garg

1

asked Mar 31 at 15:40

2 votes

1 answer

81 views

How to Identify Similar Code Parts Using CodeBERT Embeddings?

I'm using CodeBERT to compare how similar two pieces of code are. For example: # Code 1 def calculate_area(radius): return 3.14 * radius * radius # Code 2 def compute_circle_area(r): return 3.14159 * ...

Nep

21

asked Mar 20 at 14:30

0 votes

0 answers

25 views

How to change last layer in finetuned model?

When I fine-tuned the model Hubert to detect phoneme, I chose a fine-tuned ASR Hubert model and I removed the last two layers and added a linear layer to the config vocab_size of phoneme. What is ...

Ngoc Anh

1

asked Feb 24 at 8:47

0 votes

0 answers

49 views

How many obs per class are necessary? - transfer learning w. BERT fine-tuning

I seek advice on a classification problem in industry. The rows in a dataset must be classified/labeled--it lacks a target/column (labels have dot-separated levels like 'x.x.x.x.x.x.x')--during every ...

Johan

226

asked Feb 13 at 17:54

0 votes

0 answers

28 views

How to detect out-of-vocabulary words in a prompt

I need to detect words an LLM has no knowledge about, to add RAG-based definition of said word to the prompt, i.e.: What is the best way to achieve slubalisme using the new fabridocium product ?, ...

aguadoe

168

asked Jan 29 at 18:15

0 votes

1 answer

117 views

Why is my BERT model producing NaN loss during training for multi-label classification on imbalanced data?

I’m running into a frustrating issue while training a BERT-based multi-label text classification model on an imbalanced dataset. After a few epochs, the training loss suddenly becomes NaN, and I can’t ...

Erhan Arslan

26

asked Jan 28 at 13:03

0 votes

1 answer

123 views

torch.OutOfMemoryError: CUDA out of memory. (Google Colab)

I tried to adapt the mBERT model to an existing code. However, I received the following issue even though I tried different solutions. torch.OutOfMemoryError: CUDA out of memory. Tried to allocate 20....

MarMarhoun

55

asked Jan 24 at 23:41

0 votes

0 answers

47 views

Is it Possible to feed Embeddings generate by BERT to a LSTM based autoencoder to get the latent space?

I've just learn about how BERT produce embeddings. I might not understand it fully. I was thinking of doing a project of leveraging those embeddings and feed it to an autoencoder to generate latent ...

Nik Imran

1

asked Jan 24 at 3:28

0 votes

0 answers

16 views

Is it possible to evaluate Machine Translations using Sentence BERT?

I'm not referring to BERTScore. BERTScore uses token-level word embeddings, you compute pairwise cosine similarity of word embeddings and obtain scores using greedy matching. I'm referring to Sentence ...

Yuirike

41

asked Jan 23 at 13:18

0 votes

0 answers

49 views

Introducing additional layers (dropout and dense layers) after BERT's output

I'm working on a BERT-based model for fake news detection. While applying additional layers(as my models encounters not getting good accuracy for only BERT model), like dropout and fully connected ...

Abrar Hussain

25

asked Jan 10 at 10:34

1 vote

1 answer

231 views

ValueError: Exception encountered when calling layer 'tf_bert_model' (type TFBertModel)

I have been trying to run TFBertModel from Transformers, but it kept on throwing me this error ValueError Traceback (most recent call last) Cell In[9], line 1 ----> 1 ...

Faiz khan

13

asked Dec 26, 2024 at 15:53

1 vote

0 answers

140 views

Why does my PyTorch DataLoader only use one CPU core despite setting num_workers>1?

I am trying to fine-tune BERT for a multi-label classification task (Jigsaw toxic comments). I created a custom dataset and DataLoader as follows: class CustomDataSet(Dataset): def __init__(...

Hyppolite

57

asked Dec 26, 2024 at 13:21

0 votes

0 answers

17 views

TypeError: Tuple indices must be integers or slices, not dict

I am trying to add a learning rate scheduler with a finBERT model. However I am getting this error " TypeError: tuple indices must be integers or slices, not dict" while training. #...

B W

1

asked Dec 10, 2024 at 18:27

0 votes

0 answers

24 views

PackagesNotFound error even when verified packages as installed

I am trying to follow this tutorial for BERT topic modeling: https://jpcompartir.github.io/BertopicR/ library(reticulate) reticulate::install_miniconda() library(BertopicR) BertopicR::...

coolhand

2,109

asked Dec 9, 2024 at 3:06

1 vote

2 answers

65 views

dropout(): argument 'input' (position 1) must be Tensor, not str BERT Issue

I was trying to run some epochs to train my sentiment analysis model, at the very last passage, the epochs stopped with the error in the title. I attach the codes here: Sentiment classifier: # Build ...

Laura Valentini

11

asked Dec 7, 2024 at 9:47

Collectives™ on Stack Overflow

How can I decide how many epochs to train for when re-training a model on the full dataset without a validation set?

How to Identify Similar Code Parts Using CodeBERT Embeddings?

How to change last layer in finetuned model?

How many obs per class are necessary? - transfer learning w. BERT fine-tuning

How to detect out-of-vocabulary words in a prompt

Why is my BERT model producing NaN loss during training for multi-label classification on imbalanced data?

torch.OutOfMemoryError: CUDA out of memory. (Google Colab)

Is it Possible to feed Embeddings generate by BERT to a LSTM based autoencoder to get the latent space?

Is it possible to evaluate Machine Translations using Sentence BERT?

Introducing additional layers (dropout and dense layers) after BERT's output

ValueError: Exception encountered when calling layer 'tf_bert_model' (type TFBertModel)

Why does my PyTorch DataLoader only use one CPU core despite setting num_workers>1?

TypeError: Tuple indices must be integers or slices, not dict

PackagesNotFound error even when verified packages as installed

dropout(): argument 'input' (position 1) must be Tensor, not str BERT Issue

Hot Network Questions

Collectives™ on Stack Overflow

Related Tags