Documentation

This is machine translation

Translated by Microsoft
Mouseover text to see original. Click the button below to return to the English verison of the page.

Note: This page has been translated by MathWorks. Please click here
To view all translated materals including this page, select Japan from the country navigator on the bottom of this page.

Big Data Processing

Analyze big data sets in parallel using distributed arrays, tall arrays, datastores, or mapreduce, on Spark® and Hadoop® clusters

You can use Parallel Computing Toolbox™ to distribute large arrays in parallel across multiple MATLAB® workers, so that you can run big-data applications that use the combined memory of your cluster. Parallel Computing Toolbox also enables you to execute MATLAB® tall array and datastore calculations in parallel, so that you can analyze big data sets that do not fit in the memory of your cluster. You can use MATLAB Distributed Computing Server™ to run tall array and datastore calculations in parallel on Spark enabled Hadoop clusters. Doing so significantly reduces the execution time of very large data calculations.

  • Distributed Arrays
    Analyze big data sets in parallel using distributed arrays and simultaneous execution.
  • Tall Arrays and Mapreduce
    Analyze big data sets in parallel using MATLAB tall arrays and datastores or mapreduce on Spark and Hadoop clusters, and parallel pools.
Was this topic helpful?