Skip to main content

Questions tagged [papers]

For questions related to artificial intelligence research papers. So, you should use this tag if you want someone to clarify something in a research paper.

0 votes
0 answers
12 views

There has been a lot of criticism of ARC-AGI 3 online. A common argument is that humans have a perceptual advantage since we observe a rendered grid, while LLMs only receive a JSON representation. ...
GardenPots's user avatar
0 votes
1 answer
34 views

With reference to this paper Towards Adaptive Prefix Tuning for Parameter-Efficient Language Model Fine-tuning, I have following difficulties in understanding the implementation: • "$h_{i−1}$ is ...
Ritil's user avatar
  • 101
4 votes
1 answer
303 views

"Cognitive Computation" appears to be a serious peer-reviewed Springer journal. This paper was published therein in 2024: Hassija, Vikas/Vinay Chamola/Atmesh Mahapatra/Abhinandan Singal/...
Pascal's user avatar
  • 43
1 vote
1 answer
96 views

I have a question about the token compression in the native sparse attention in https://arxiv.org/pdf/2502.11089. When we compute the attention of $q_t$ and $K^{\sim cmp }_t$, is $K^{\sim cmp }_t$ ...
HIH's user avatar
  • 121
1 vote
1 answer
165 views

What paper introduced settings like system prompt, temperature, top P, top K, repetition penalty, etc. that are used in transformers? Was in the original Attention is All You Need paper?
user avatar
1 vote
1 answer
213 views

The original TRPO paper stated an algorithm that used optimization of the following surrogate objective: $$ L_\pi(\tilde{\pi})=\eta(\pi)+\sum_s \rho_\pi(s) \sum_a \tilde{\pi}(a \mid s) A_\pi(s, a) $$ ...
Rubi Shnol's user avatar
2 votes
1 answer
227 views

The original TRPO paper stated an algorithm that used optimization of the following surrogate objective: $$ L_\pi(\tilde{\pi})=\eta(\pi)+\sum_s \rho_\pi(s) \sum_a \tilde{\pi}(a \mid s) A_\pi(s, a) $$ ...
Rubi Shnol's user avatar
1 vote
1 answer
217 views

I saw OpenAI discontinue their GPT detection tool even though is was performing fairly well according to this paper: https://doi.org/10.48550/arXiv.2306.05524 Why to discontinue it, is it really ...
Atrox's user avatar
  • 61
1 vote
2 answers
165 views

An MDP is ergodic if the Markov chain induced by any policy is ergodic, which means any state is reachable from any other state by following a suitable policy. [Source] The part after "which ...
Simon's user avatar
  • 273
0 votes
0 answers
18 views

You can find the afore-mentioned text here. I'm asking a clarification of Definition 17, which is as follows: Definition 17 (Effect Identifiability) The causal effect of an action $do(X=x)$ on a set ...
AyamGorengPedes's user avatar
2 votes
1 answer
116 views

I am thinking about making use of ByteNet (https://arxiv.org/abs/1610.10099) architecture for a project, and would like to get a better understanding of how the model works. I've read through the ...
Philippa Richter's user avatar
1 vote
1 answer
86 views

This comes from the paper, Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review. I don't know the why the following derivation is true. The paper only briefly explains ...
yeebo xie's user avatar
0 votes
1 answer
48 views

I have read a news article relating to recent research into graph theory-based fraud detection using machine learning, but cannot find the source code for the paper discussed. The paper can be found ...
Rasai Stewart's user avatar
2 votes
0 answers
134 views

I was studying AI when a question came to my mind. I have to write a 6-pages essay on a chosen topic in AI, and I think it can be interesting to reflect on the measure in which machines should have a ...
Amanda Wealth's user avatar
1 vote
2 answers
2k views

When I start to learn about RAG, I can't understand the difference between the RAG-Sequence Model and RAG-Token Model. First of all, I see that for RAG-Sequence Model we use just one document to ...
LAILA EL OUEDEGHYRY's user avatar

15 30 50 per page
1
2 3 4 5
23