Questions tagged [inference]
The inference tag has no summary.
32 questions
2
votes
0
answers
32
views
How to improve fine-tuning for task dependency extraction?
I'm trying to fine-tune a LLaMA 3.1 Instruct model to adapt it to a specific industrial domain. The goal is to have the model extract direct dependencies between tasks from a list of operational steps ...
1
vote
0
answers
39
views
Fine-tuning Llama 3 to generate task dependencies (industrial planning)
I'm working on fine-tuning a language model (Meta-Llama-3-8B-Instruct) to generate a dependency graph for industrial tasks. The idea is: given a list of unordered tasks, the model should output a ...
1
vote
0
answers
130
views
ML models that train on graphs but infer without any edges (edge prediction task)
I'm exploring a machine learning research direction and I'm looking for ideas or pointers to existing models/projects that fit the following setup:
The model is trained on graphs with edge information ...
4
votes
1
answer
131
views
Which model is the best suitable for generating edges?
I'm trying to develop a model who'd be able to generate dependencies between industrial tasks. In order to do that, i went for the GNN solution : i have nodes = tasks, dependencies = edges, and have ...
8
votes
1
answer
216
views
How to correctly perform link prediction inference on a new, unseen graph?"
I'm working on an industrial AI use case where I train a Graph Neural Network (GCN) for link prediction — specifically, to predict successor tasks in project planning graphs (e.g., for construction or ...
1
vote
0
answers
25
views
Different inference times by factor 10 for two models with similar architecture and training data
I am currently trying to replicate or come close to the results achieved in the Multispider paper, which is a multilingual Text-to-SQL benchmark. I have downloaded the model used by the authors of the ...
2
votes
1
answer
360
views
Detection of musical instruments using Yamnet
My goal is to detect musical instruments with AI (machine learning).
I'm currently using the Yamnet model to make inferences, but it has a very wide range of categories, for example, "Growling&...
1
vote
1
answer
904
views
How to Use Multiple Adapters with a Pretrained Model in Hugging Face Transformers for Inference?
I have a pretrained Llama-2 model in the models_hf directory and two fine-tuned adapters: a summarization adapter in ...
0
votes
0
answers
83
views
What is the distinction between clinical significance and statistical significance?
https://en.m.wikipedia.org/wiki/Clinical_significance
The Link says
"Jacobson-Truax is common method of calculating clinical significance. It involves calculating a Reliability Change Index (RCI)....
1
vote
0
answers
294
views
What is the best way to combine cross-validation and bootstrapping for one application?
We intend to model data with non-parametric covariate splines and we would like to understand the uncertainty of the parameter estimates/response estimates.
Currently, we use cross-validation to model ...
1
vote
1
answer
173
views
How can I deal with a computationally expensive simulator method in Sequential Monte Carlo/Approximate Bayesian Computation?
I am doing Approximate Bayesian Computation with Sequential Monte Carlo with PyMC in a way that is similar to what is described in this example of the ...
0
votes
0
answers
54
views
Statistical significance on aggregate data to show that the groups are different?
I am working with performance data for three groups for each region. The denominator for the groups is the number of people who are identified as low performers. For region A, Group-1 low performer %= ...
0
votes
1
answer
126
views
Best practices for serving user-specific large models in a web application?
First execuse any naive statement you may find below, i'm a newcomer to the field.
How do web applications that integrate fine-tuning of large machine learning/deep learning models handle the storage ...
1
vote
1
answer
820
views
Inference Process in Autoregressive Transformer Architecture
I'm abit confused about how the inference/prediction process works in transformer.
For simplicity suppose it is a transformer for translation. My understanding is that when training, the whole input ...
0
votes
1
answer
99
views
What are the requirements for a word list to be used for Bayesian inference?
Intro
I need an input file of 5 letter English words to train my Bayesian model to infer the stochastic dependency between each position. For instance, is the probability of a letter at the position 5 ...