Skip to main content
1 vote
0 answers
42 views

I'm trying to use Modin with a Dask LocalCluster to parallelize pandas DataFrame operations in a Django application (Python 3.13). Even with processes=False (thread-based workers, same process), the ...
Atul Jaiswal's user avatar
1 vote
0 answers
37 views

I am new to parallel processing with dask. I have 2 columns with IPV4 ip values in a loaded multi-partition dataframe, I cannot seem to find a good method to encode them in order to train a ...
Vasile Olteanu's user avatar
1 vote
0 answers
26 views

Hi I'm doing a conversion of HDF5 to OME-Zarr with Dask. Right now I'm using a small dataset with shape (150, 3768, 2008) approximately 4.5gb in size. My target chunks are (64, 64, 64). I'm running ...
Tobias Schleiss-Andreassen's user avatar
3 votes
1 answer
91 views

I am using Dask for some processing. The client starts successfully, but I am seeing zero workers. This is how I am creating the client: client = Client("tls://localhost:xxxx") This is the ...
martian muonhunter's user avatar
3 votes
1 answer
75 views

I have the following codes that pass an array to the task and submit to Dask cluster. The Dask cluster is running in Docker with several Dask workers. Docker starts with: scheduler: docker run -d \ -...
eric feng's user avatar
3 votes
0 answers
91 views

I am trying to analyze the 30 day standardized precipitation index for a multi-state range of the southeastern US for the year 2016. I'm using xclim to process a direct pull of gridded daily ...
helpmeplease's user avatar
0 votes
0 answers
57 views

I am analysing some data using dask distributed on a SLURM cluster. I am also using jupyter notebook. I am changing my codebase frequently and running jobs. Recently, a lot of my jobs started to crash....
Yatharth's user avatar
0 votes
0 answers
75 views

I maintain a production Dask cluster. Every few weeks or so I need to restart the scheduler because it becomes progressively slower over time. The dashboard can take well over a minute to display the &...
Z4NG's user avatar
  • 91
1 vote
1 answer
57 views

I already have a code using threadpool tkiniter and matplotlib to process signals which are getting written to a file from another process. The Synchronization between the two process is by reading ...
Ayan Banerjee's user avatar
0 votes
1 answer
95 views

I am trying to deploy a dask cluster with 0 workers and 1 scheduler, based on the work load need to scale up the worker to required, i found that the adaptive deployment is the correct way, i am using ...
Arun Kumar's user avatar
1 vote
0 answers
119 views

I am new to Dask. While attempting to run concat on a list of DataFrames, I noticed it is consuming more time, resources, and tasks than expected. Here are the details of my run: Scheduler (same as ...
sandeysh's user avatar
0 votes
1 answer
296 views

I am trying to run a Dask Scheduler and Workers on a remote cluster using SLURMRunner from dask-jobqueue. I want to bind the Dask dashboard to 0.0.0.0 (so it’s accessible via port forwarding) and ...
user1834164's user avatar
0 votes
0 answers
135 views

I'm trying out some things with Dask for the first time, and while I had it running a few weeks ago, I now find that I can't get the LocalCluster initiated. I've cut if off after running 30 minutes at ...
MKJ's user avatar
  • 338
0 votes
0 answers
136 views

I am trying to get this code to work and then use it to train various models on two gpu's: from dask_cuda import LocalCUDACluster from dask.distributed import Client if __name__ == "__main__&...
Danilo Caputo's user avatar
1 vote
1 answer
81 views

I am trying to learn dask, and have created the following toy example of a delayed pipeline. +-----+ +-----+ +-----+ | baz +--+ bar +--+ foo | +-----+ +-----+ +-----+ So baz has a dependency on ...
Steve Lorimer's user avatar

15 30 50 per page
1
2 3 4 5
77