All Questions
17 questions
0
votes
1
answer
159
views
topic modeling from quotes
Based on the folloiwng link : quotes
with help of following code(this site was based on javascript, so first i have disabled it)
import selenium
from selenium import webdriver
from selenium....
2
votes
1
answer
118
views
BERTopic: add legend to term score decline
I plot the term score decline for a topic model I created on Google Colab with BERTopic. Great function. Works neat! But I need to add a legend. This parameter is not specified in the topic_model....
-1
votes
1
answer
1k
views
Bert topic clasiffying over a quarter of documents in outlier topic -1
I am running Bert topic with default options
import pandas as pd
from sentence_transformers import SentenceTransformer
import time
import pickle
from bertopic import BERTopic
llm_mod = "all-...
1
vote
0
answers
198
views
BERTopic Visualization in dark
I want to change the default visualizations within BERTopic to display a dark theme rather than a white or bright theme.
Basically I'm trying to do:
import plotly.io as pio
pio.templates.default ...
4
votes
3
answers
714
views
Jupyter keeps crashing when using BERTopic's fit_transform()
topics, probs = topic_model.fit_transform(docs)
Whenever I run fit_transform like in the line above, my Jupyter notebook keeps dying, and I don't know why. I am using Python 3.9.15 on a macOS 13.4.1 ...
1
vote
0
answers
437
views
What if I have too many documents labelled in -1 cluster in bertopic?
I'm generating topics using bertopic on multilingual dataset (mainly Russian and English). I'm reducing the number of topics to 140. After generating topics, I'm analyzing its quality using the ...
1
vote
2
answers
3k
views
Evaluating a BERTopic model based on classification metrics
I am unable to find a solution to a problem I have with checking coherence scores for my topic models created using BERTopic. I am new to using these methods for NLP and especially new to using Python....
3
votes
0
answers
373
views
How long does importing bertopic usually take?
I have been trying this code and it has been running for about 24 hours and it sill running. Is this normal or should I be doing something else?
from bertopic import BERTopic
Device name LAPTOP-...
0
votes
1
answer
3k
views
Trouble in installing BERTopic's dependency ''bertopic.dimensionality''
I'm trying to run the following code from the BERTopic documentation:
from bertopic import BERTopic
from bertopic.dimensionality import BaseDimensionalityReduction
# Fit BERTopic without actually ...
0
votes
1
answer
423
views
How can I get the coordinate information of model.visualize_topics() function - BERTopic
I am trying to get the coordinate informations of the docs placed on the graph by model.visualize_topics() for my BERTopic topic analysis project. Is there any way to see the source code of the ...
0
votes
1
answer
907
views
BERTopic / umap crashing with cosine metric
I'm using python bertopic library for topic modeling of Polish tweets in a straightforward way, i.e.,
from sentence_transformers import SentenceTransformer
from bertopic import BERTopic
model = ...
3
votes
1
answer
1k
views
Bertopic assign topics to data frame
I have build a topic model with Bertopic.
After getting topic how could I assign them to dataset.
My main aim is to convert unsupervised topic modelling to supervised multi label classification ...
3
votes
1
answer
1k
views
How to get topic-probs matrix in bertopic modeling
I ran BERTopic to get topics for 3,500 documents. How could I get the topic-probs matrix for each document and export them to csv? When I export them, I want to export the identifier of each document ...
0
votes
1
answer
332
views
BerTopic Model - Visualization ignores 0th index
The BerTopic model resulted the below Topics:
As you can see from the above, the model is finetuned to generate lesser outliers '-1' which has the count of 3 and it appears in the last.
While ...
0
votes
0
answers
489
views
NLP Text Classification Model with defined categories / context / intent
This is more of a guideline question rather than a technical query. I am looking to create a classification model that classifies documents based on a specific list of strings. However, as it turns ...