Tall vs distributed array

16 views (last 30 days)
Pey
Pey on 11 May 2018
Commented: Pey on 15 May 2018
I see that we have tall and distributed arrays.
Tall divides data into chunks.
Distributed also divides data into chunks!
What's the differece here?
And, how either of these are connected to parallel computing?

Answers (1)

Edric Ellis
Edric Ellis on 14 May 2018
Both tall and distributed arrays are designed for processing large amounts of data, but they have somewhat different capabilities.
distributed arrays exist spread across the memory of several MATLAB worker processes - so the largest distributed array you can create is limited by the total amount of physical memory you have. Also, distributed arrays are more oriented towards dense and sparse linear algebra. distributed arrays require Parallel Computing Toolbox, and are most effective when used with MATLAB Distributed Computing Server (which allows the use of multiple machines across which to distribute the data).
The data for tall arrays exists on disk, and so their size is not limited by the amount of memory you have available. However, as the name implies, tall arrays can be large only in the first dimension. tall arrays are more geared towards data analytics. tall arrays ship with MATLAB itself, but there is enhanced support in both Parallel Computing Toolbox (which enables parallel processing in a single computer) and MATLAB Distributed Computing Server (which enables parallel processing across a cluster, including Hadoop/Spark clusters).
  3 Comments
Edric Ellis
Edric Ellis on 15 May 2018
The fundamental difference is where the data is held once you've created the array. distributed arrays are more restricted in size because the contents are always in memory, but they are more capable. tall arrays can be much larger - as long as you have the disk space.
Pey
Pey on 15 May 2018
Thanks. So if I understood correctly, can I summarize it in this way?

Sign in to comment.

Categories

Find more on Creating and Concatenating Matrices in Help Center and File Exchange

Products

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!