Skip to main content

All Questions

0 votes
1 answer
159 views

topic modeling from quotes

Based on the folloiwng link : quotes with help of following code(this site was based on javascript, so first i have disabled it) import selenium from selenium import webdriver from selenium....
user avatar
2 votes
1 answer
118 views

BERTopic: add legend to term score decline

I plot the term score decline for a topic model I created on Google Colab with BERTopic. Great function. Works neat! But I need to add a legend. This parameter is not specified in the topic_model....
Simone's user avatar
  • 625
-1 votes
1 answer
1k views

Bert topic clasiffying over a quarter of documents in outlier topic -1

I am running Bert topic with default options import pandas as pd from sentence_transformers import SentenceTransformer import time import pickle from bertopic import BERTopic llm_mod = "all-...
RM-'s user avatar
  • 1,018
1 vote
0 answers
198 views

BERTopic Visualization in dark

I want to change the default visualizations within BERTopic to display a dark theme rather than a white or bright theme. Basically I'm trying to do: import plotly.io as pio pio.templates.default ...
RobjSky's user avatar
  • 31
4 votes
3 answers
714 views

Jupyter keeps crashing when using BERTopic's fit_transform()

topics, probs = topic_model.fit_transform(docs) Whenever I run fit_transform like in the line above, my Jupyter notebook keeps dying, and I don't know why. I am using Python 3.9.15 on a macOS 13.4.1 ...
Jethro R. Lee's user avatar
1 vote
0 answers
437 views

What if I have too many documents labelled in -1 cluster in bertopic?

I'm generating topics using bertopic on multilingual dataset (mainly Russian and English). I'm reducing the number of topics to 140. After generating topics, I'm analyzing its quality using the ...
ApaarBawa's user avatar
1 vote
2 answers
3k views

Evaluating a BERTopic model based on classification metrics

I am unable to find a solution to a problem I have with checking coherence scores for my topic models created using BERTopic. I am new to using these methods for NLP and especially new to using Python....
Tim Mooney's user avatar
3 votes
0 answers
373 views

How long does importing bertopic usually take?

I have been trying this code and it has been running for about 24 hours and it sill running. Is this normal or should I be doing something else? from bertopic import BERTopic Device name LAPTOP-...
Ahmed Ayman's user avatar
0 votes
1 answer
3k views

Trouble in installing BERTopic's dependency ''bertopic.dimensionality''

I'm trying to run the following code from the BERTopic documentation: from bertopic import BERTopic from bertopic.dimensionality import BaseDimensionalityReduction # Fit BERTopic without actually ...
jvqp's user avatar
  • 269
0 votes
1 answer
423 views

How can I get the coordinate information of model.visualize_topics() function - BERTopic

I am trying to get the coordinate informations of the docs placed on the graph by model.visualize_topics() for my BERTopic topic analysis project. Is there any way to see the source code of the ...
selen arslan's user avatar
0 votes
1 answer
907 views

BERTopic / umap crashing with cosine metric

I'm using python bertopic library for topic modeling of Polish tweets in a straightforward way, i.e., from sentence_transformers import SentenceTransformer from bertopic import BERTopic model = ...
Julek Sienkiewicz's user avatar
3 votes
1 answer
1k views

Bertopic assign topics to data frame

I have build a topic model with Bertopic. After getting topic how could I assign them to dataset. My main aim is to convert unsupervised topic modelling to supervised multi label classification ...
Bhola Bsm's user avatar
3 votes
1 answer
1k views

How to get topic-probs matrix in bertopic modeling

I ran BERTopic to get topics for 3,500 documents. How could I get the topic-probs matrix for each document and export them to csv? When I export them, I want to export the identifier of each document ...
JJD's user avatar
  • 31
0 votes
1 answer
332 views

BerTopic Model - Visualization ignores 0th index

The BerTopic model resulted the below Topics: As you can see from the above, the model is finetuned to generate lesser outliers '-1' which has the count of 3 and it appears in the last. While ...
Woody's user avatar
  • 939
0 votes
0 answers
489 views

NLP Text Classification Model with defined categories / context / intent

This is more of a guideline question rather than a technical query. I am looking to create a classification model that classifies documents based on a specific list of strings. However, as it turns ...
Kausty's user avatar
  • 859

15 30 50 per page