Skip to main content
0 votes
1 answer
117 views

I'm trying to manually verify that an old Electrum bitcoin wallet of mine contains keys that were derived from its seed as expected. One step in the key derivation requires that I hash the seed via ...
AnarchyMyDear's user avatar
-1 votes
1 answer
88 views

I have one Map Map<Long, List<Set<Long>>> map1; Example: Map<City_Id, List<Set<Subtier_Cities_ids>>> There is Another Map ` Map<Long, Set<Long>> map2; ...
Klaus's user avatar
  • 77
-2 votes
1 answer
262 views

I tried to use dask delayed to improve loops iteration speed, iteration done by map function. The problem is after dd.compute(), the result list is over bracket, so can not get proper dataframe. ...
user19858347's user avatar
0 votes
0 answers
201 views

Hi I am in process of writing a python code to input search and edit data into excel. my chosen format for the UI is PySimpleGUI and my xlsx package is openpyxl. I have been coding for approximately 3 ...
Wesley Ryman's user avatar
2 votes
3 answers
381 views

This is the first question I ask on here, I hope to do this correctly! I have a dataset with million of observations. Each row is a drug prescription picked up by different individuals on different ...
Margherita Bigossi's user avatar
2 votes
1 answer
92 views

I'm reaching the maximum recursion depth and I've been trying to use np.tensordot() I couldn't really get an insight into how to use it in this case. def stopping_condtion(a,V,V_old,eps): return ...
Max's user avatar
  • 437
1 vote
2 answers
214 views

I have looked this over and am wondering where my math issue is. I believe that it should be calculating correctly, but the floats do not round up, .75 to 1 to add to the count for births/deaths. I ...
Lee's user avatar
  • 11
0 votes
2 answers
123 views

I found this sample in a book and this is the first time that I see this notation. Obviously it's a thousand times shorter than making a switch; but what is it? When I do typeof(status) it returns ...
Karleen-Bx's user avatar
0 votes
1 answer
403 views

While studying MDP via different sources, I came across two different formulas for the Value update in Value-Iteration algorithm. The first one is (the one on Wikipedia and a couple of books): . And ...
jaja360's user avatar
  • 55
3 votes
1 answer
3k views

We know that policy iteration gives us the policy directly and hence is faster. But can anyone explain it with some examples.
shmi's user avatar
  • 33
1 vote
1 answer
1k views

I am currently studying dynamic programming in reinforcement learning in which I came across two concepts Value-Iteration and Policy-Iteration. To understand the same, I am implementing the gridworld ...
POOJA GUPTA's user avatar
  • 2,355
0 votes
6 answers
958 views

Assuming a list of dictionaries, the goal is to iterate through all the distinct values in all the dictionaries. Example: d1={'a':1, 'c':3, 'e':5} d2={'b':2, 'e':5, 'f':6} l=[d1,d2] The iteration ...
Krzysztof Słowiński's user avatar
2 votes
1 answer
1k views

I am taking a Reinforcement Learning class and I didn’t understand how to combine the concepts of policy iteration/value iteration with Monte Carlo (and also TD/SARSA/Q-learning). In the table below, ...
Johan's user avatar
  • 899
1 vote
0 answers
569 views

I am performing prioritized sweeping for which I have a matrix which has 1000*1000 cells (gridworld) whose cells I have to access repeatedly in a while true loop for assignment (I am not essentially ...
SH_V95's user avatar
  • 161
4 votes
2 answers
13k views

I find either theories or python example which is not satisfactory as a beginner. I just need to understand a simple example for understanding the step by step iterations. Could anyone please show me ...
Ahasan Ratul's user avatar

15 30 50 per page