Hottest 'distributed-computing' Answers - Software Engineering Stack Exchange

7 votes

What difference and relation are between fault tolerance and (high) availability?

The basic concepts are orthogonal, however, they are related. One has to do with the availability of your application, and the other has to do with the correctness of your application. Remember, ...

Berin Loritsch

46.5k

answered Dec 29, 2019 at 16:15

7 votes

Accepted

How can adding redunancy adversely affect performance

Scenario A: You have a system that solves problem X. Scenario B: You have a system that solves problem X and has to ensure that it is always synchronized with the redundant backup. It is pretty clear ...

Jörg W Mittag

105k

answered Dec 20, 2020 at 20:27

6 votes

Accepted

How can we keep sight of business flows in event driven architectures?

The problem is that it can be hard to see such a flow as it's not explicit in any program text. Often the only way to figure out this flow is from monitoring a live system. There are two separate ...

Joeri Sebrechts

13k

answered Oct 3, 2018 at 8:14

6 votes

Accepted

How to prevent concurrency problems when using the repository pattern?

The repository pattern does not intend to solve concurrency issues. It only provides a convenient way to work with persistent entities. In principle you'd use your repository pattern in ...

Christophe

82.3k

answered Jan 10, 2020 at 19:04

6 votes

Accepted

Giving multiple components access to a single database

Indeed, option 2 (direct DB access) makes the ownership of the shared entities, and the responsibility for their invariant unclear. In the long run, maintenance risks increase. A typical example is an ...

Christophe

82.3k

answered Nov 21, 2022 at 8:17

5 votes

Accepted

How do atomic updates work at scale?

First things first, based on this question it appears you do not at all understand the concept of atomicity. I’d recommend doing a bit more work to understand that before moving on to distributed ...

Telastyn

110k

answered Jul 28, 2020 at 1:04

4 votes

Accepted

How to design a high-scale, reliable, distributed and periodic task (cronjob) execution service?

This is a very broad question, the answer would be by designing such a system. I think however you are more interested in what techniques can be used to minimise the impact of a node failing, and ...

Kain0_0

16.6k

answered Apr 23, 2019 at 7:00

4 votes

Accepted

Accepting the UUID collision risk based on number of clients

The whole point of UUIDs is that the risk of collisions can be safely ignored. A conflict solution is not needed. If you look at your log files and see a message "Fatal Error: UUID collision detected"...

gnasher729

49.4k

answered Jan 15, 2019 at 12:08

4 votes

How can adding redunancy adversely affect performance

Two simple examples: Adding an index to a database table is a common way to introduce redundancy, with the intention of speeding up read operations on that table. However, if a table is more ...

Doc Brown

221k

answered Dec 20, 2020 at 19:45

4 votes

Is sequential consistency equivalent to performing memory accesses by a processes in program order and performing each memory access atomically?

No, your definition is not quite equivalent to the definition of sequential consistency but would be closer to strict consistency. There are two relevant aspects: (a) there are multiple processors/...

amon

136k

answered Feb 9, 2021 at 22:38

4 votes

Long-running compute-intensive tasks in APIs: background workers?

Move the problem solution to the client by providing separate HTTP API endpoints to submit processing requests and to collect results Fixed that for you. Doing this gives you a number of advantages: ...

Philip Kendall

26.1k

answered May 31, 2022 at 12:57

4 votes

Giving multiple components access to a single database

Sharing a database between multiple application is known as an integration database. Not only does it have to consider the requirements of all of the applications integrating through it (which results ...

Thomas Owens♦

85.9k

answered Nov 21, 2022 at 11:49

3 votes

Docker and GPU-based computations. Feasible?

Containerization is completely orthogonal to “high load” or “parallelization”. Containerization also does not imply any virtualization, and is better interpreted as sandboxing. So why do people use ...

amon

136k

answered May 6, 2018 at 18:04

3 votes

Is shared disk architecture scaling up or scaling out?

Shared disk is vertically scaled approach for disk. As Robert Harvey points out, you are scaling horizontally for memory and CPU but the disk is one (or a few) component. There's a simple way to ...

JimmyJames

31.1k

answered Nov 25, 2019 at 14:29

3 votes

Does the producer-consumer problem appear in both shared memory and distributed memory architectures?

You are right in thinking that the problems of races and deadlocks in producer-consumer stem from shared variables and data structures. Any system that allows multiple accessors (processes, threads, ...

Erik Eidt

34.8k

answered Dec 19, 2019 at 18:42

3 votes

Multiple sources of truth - Optimistic concurrency & Eventual consistency

There are multiple ways to go about solving the issue of having distributed entities. Avoiding it You could make sure the customer entity only exist in one place and have the other parts of the system ...

Frederik Banke

311

answered Sep 11, 2020 at 6:37

2 votes

Error handling in distributed system

Appending to a persistent log on A should suffice. This copes with reboots and network partitions to achieve eventual consistency, or to signal breakage which prevents such convergence. With amortized ...

J_H

7,997

answered Nov 23, 2017 at 20:31

2 votes

Patterns for maintaining consistency in a distributed, event sourced system?

Sounds like you could implement a business process (saga in context of Domain Driven Design) for the user registration where the user is treated like a CRDT. Resources https://doc.akka.io/docs/akka/...

SemanticBeeng

121

answered Feb 3, 2018 at 15:16

2 votes

Distributed training of many small ML models

There's no particular right or wrong way to do this because this depends on your projects, and on whether you can exploit the structure of your data for efficiency. E.g. for a one-off project, you ...

amon

136k

answered Sep 18, 2018 at 11:57

2 votes

Develop a distributed pointer in C++

The answer is yes and no: Yes, you can create a distributed pointer that could be based on an IP address and a port. It would be implemented using the remote proxy design pattern but with a pointer ...

Christophe

82.3k

answered Nov 13, 2019 at 8:03

2 votes

Develop a distributed pointer in C++

Yes, C++ provides you with awesome powers, and itty bitty living spaces. Break your problem down first. You need a Connection object responsible for handling the nitty gritty of communication. It ...

Kain0_0

16.6k

answered Nov 13, 2019 at 0:46

2 votes

Accepted

Consistency and Availability in distributed hashing Key value store

Let me rephrase your question - how is CAP theorem applicable for a raft based system. For the context: CAP says that in case partitioning is happening, then you have to pick either consistency or ...

AndrewR

196

answered Mar 3, 2022 at 21:34

2 votes

Architecting a distributed file processing system with leadership election

Well, the "leader election" problem is quite well known and the most commonly used app for solving it is probably Apache Zookeeper. Google it and you'll find plenty of documents about that. If you ...

Pawel Gorczynski

239

answered Dec 20, 2018 at 20:11

2 votes

Do persistent/transient communication and temporal decoupling/coupling mean the same?

Do persistent communication and temporal decoupling mean the same? No, but those concepts are related: temporal decoupling requires the messages/exchanged data between processes to be kept (=persisted)...

Doc Brown

221k

answered Dec 17, 2019 at 7:13

2 votes

What is the difference between masking and tolerating failures?

From what I understand both are different in respect to the level of abtractions involved: "Masked" means here: Lower levels "mask" failure transparently for higher levels of the system. Failure on a ...

Thomas Junk

9,623

answered Dec 25, 2019 at 8:12

2 votes

How do atomic updates work at scale?

How does it work? Database atomicity (aka the A of ACID) is an appearance of atomicity. The general idea for an update is: keep the old value where it is unchanged as long as the transaction is not ...

Christophe

82.3k

answered Jul 28, 2020 at 7:43

2 votes

Implementing the microservice pattern

You seem to be conflating a number of somewhat unrelated concepts: Automated deployment (different from continuous deployment) Containerization Microservices Of those, it sounds like what you really ...

Philip Kendall

26.1k

answered Nov 13, 2020 at 8:40

2 votes

Ordering of analytical events

Assuming your front end is not malicious and not purposefully manipulating the time, you can follow this procedure: Modify the message so that it contains two timestamps: (A) The time the event ...

John Wu

27k

answered Dec 4, 2020 at 18:40

2 votes

How can adding redunancy adversely affect performance

First, as others have pointed out, in general doing more, costs more, so in most cases adding more work results in an increased cost (aka performance reduction). Secondly, you are missing the most ...

jmoreno

11.2k

answered Dec 21, 2020 at 2:06

2 votes

Accepted

What are the approaches for joining data in distributed processing

About the name of such a system I can't say anything - I'd guess that it's not a specific name but a function of the system, which I would informally call event consolidation. Regarding the behavior ...

Hans-Martin Mosner

18.6k

answered Apr 21, 2022 at 11:08

Stack Exchange Network

Tag Info

Hot answers tagged distributed-computing

What difference and relation are between fault tolerance and (high) availability?

How can adding redunancy adversely affect performance

How can we keep sight of business flows in event driven architectures?

How to prevent concurrency problems when using the repository pattern?

Giving multiple components access to a single database

How do atomic updates work at scale?

How to design a high-scale, reliable, distributed and periodic task (cronjob) execution service?

Accepting the UUID collision risk based on number of clients

How can adding redunancy adversely affect performance

Is sequential consistency equivalent to performing memory accesses by a processes in program order and performing each memory access atomically?

Long-running compute-intensive tasks in APIs: background workers?

Giving multiple components access to a single database

Docker and GPU-based computations. Feasible?

Is shared disk architecture scaling up or scaling out?

Does the producer-consumer problem appear in both shared memory and distributed memory architectures?

Multiple sources of truth - Optimistic concurrency & Eventual consistency

Error handling in distributed system

Patterns for maintaining consistency in a distributed, event sourced system?

Distributed training of many small ML models

Develop a distributed pointer in C++

Develop a distributed pointer in C++

Consistency and Availability in distributed hashing Key value store

Architecting a distributed file processing system with leadership election

Do persistent/transient communication and temporal decoupling/coupling mean the same?

What is the difference between masking and tolerating failures?

How do atomic updates work at scale?

Implementing the microservice pattern

Ordering of analytical events

How can adding redunancy adversely affect performance

What are the approaches for joining data in distributed processing

Tag Info

Hot answers tagged distributed-computing

Related Tags