Precision lost when combining Int32 integers with single precision numerical numbers

Question

0 votes

I have a column data A composed of Int32 numbers, and another column data B composed of single precision numbers. When I try to put them into one array C, my single precision numbers were botchered into integers.

C = [A, B];

Why is Matlab set up this way? Due to the loss of precision, my final calcualted values are way off. It took me quite some time to find out this is the reason.

1 Comment
Show -1 older comments Hide -1 older comments

Paul on 21 Jun 2025

@Leon

Relevant doc page that may be of interest: Valid Combinations of Unlike Classes

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

John D'Errico on 21 Jun 2025

Open in MATLAB Online

4 votes

Why is MATLAB set up this way? Because you can't please all of the people, all of the time. Suppose a numeric vector could have elements that are all different different numeric classes. Something like:

X = [pi, single(2.1), uint16(3)]
X = 1×3
   3   2   3
<mw-icon class=""></mw-icon>
<mw-icon class=""></mw-icon>
whos X
  Name      Size            Bytes  Class     Attributes

  X         1x3                 6  uint16              

etc. X will be a UINT16 by default here. But if the elements could retain their class information (not in the form of a cell array, which DOES retain the class information for each element), then any computation would be come IMMENSELY SLOW.

A huge benefiit of MATLAB is it runs blazingly fast when doing double precision computation, especially on large arrays. But if the code needed to check each element, and deal with the class of that number, then it would not be at all fast. And then almost everyone would be unhappy. As such, MATLAB is designed to store all numeric vectors using one class. concatenation operators make the decision which class to use, based on some simple rules.

So what happened to you?

In your case, you were combining int32 numbers with singles, by way of concatenation (horzcat)

Y1 = [int32(2), single(3.2)]
Y1 = 1×2
   2   3
<mw-icon class=""></mw-icon>
<mw-icon class=""></mw-icon>
whos Y1
  Name      Size            Bytes  Class    Attributes

  Y1        1x2                 8  int32              

The rule is that when you concatenate integers and singles together, you get an integer result.

https://www.mathworks.com/help/releases/R2025a/matlab/matlab_prog/valid-combinations-of-unlike-classes.html

If you really want to retain the information about each element, then you needed to use a cell array.

Z = {int32(2), single(3.2)}
Z = 1×2 cell array
    {[2]}    {[3.2000]}

As you can see, MATLAB now retains all the information you want for each element. The problem is, you can't do numerical operations using cell arrays.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 2

Walter Roberson on 21 Jun 2025

1 vote

The general rule is that when you combine numbers of two different types, that the result is the type that is considered more restricted. Integer is considered more restrictive than float

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 3

Matt J on 21 Jun 2025

Edited: Matt J on 22 Jun 2025

Open in MATLAB Online

0 votes

I wasn't there when the decision was made, but I suspect it would be because in Matlab, numeric literals are always double floats. That goes back to the origins of Matlab, before integer types like int32 type were even introduced.

Once you're locked in with literals that are always double, it becomes inconvenient to prioritize precision. If the rule was to promote the lower precision operand to higher precision, then you would get an automatic explosion in RAM usage every time you did the simplest operations between large integer arrays and literal scalars, e.g.,.

A=int8(5000); %integer
C=A+1;

You could avoid this by remembering to convert your literal scalars, as in,

C=A+int8(1)

but not only is this incredibly cumbersome, it would also have forced people to rewrite their old code from before the days when integers were introduced.

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Answer 4

Umar on 21 Jun 2025

0 votes

Hi @Leon,

I read your comments and Hope interpret them carefully. In MATLAB, when you create an array that combines different data types, it attempts to promote all elements to a common type that can accommodate all values without loss of information. Since both Int32 and single types occupy 4 bytes, MATLAB defaults to converting the entire array to the type that is capable of representing all elements. In this case, it promotes to Int32, causing your single-precision floating-point numbers to lose their fractional parts and be represented as integers.

Here’s a deeper look at how this works:

1. Data Types in MATLAB Int32: A 32-bit signed integer that can represent whole numbers from -2,147,483,648 to 2,147,483,647.

Single: A 32-bit floating-point number that can represent a much wider range of values but includes fractions.

2. Array Concatenation Behavior When concatenating arrays like [A, B], MATLAB checks the types of both arrays. Since A is Int32 and B is single, it opts for Int32 for the entire resulting array C. The conversion effectively truncates any decimal portion of your single-precision numbers in B, leading to inaccuracies in further calculations.

Solutions and Recommendations

To address this issue effectively, consider the following approaches:

1. Explicit Type Conversion: Before concatenating your arrays, convert both arrays to a common type that preserves precision. For example:

C = [int32(A), single(B)];

This ensures that both arrays are treated as single-precision floating points in the resulting array.

2. Using Cell Arrays: If maintaining different data types is essential for your application, consider using cell arrays:

C = {A, B};

This allows you to keep the data types separate but still access them together.

3. Review Data Types Before Operations: Always check the data types using the class ( ) function before performing operations that combine different types. This can prevent unexpected behavior during calculations.

Hope this helps.

9 Comments
Show 7 older comments Hide 7 older comments

Walter Roberson on 21 Jun 2025

Open in MATLAB Online

types = {'single', 'double', 'uint8', 'int8', 'uint16', 'int16', 'uint32', 'int32', 'uint64', 'int64'};
nt = length(types);
for J = 1 : nt - 1
    tJ = types{J};
    A = cast(pi, tJ);
    for K = J+1 : nt
        tK = types{K};
        B = cast(-123, tK);
        C = [A,B];
        fprintf('%-6s + %-6s = %-6s\n', tJ, tK, class(C));
    end
end
single + double = single
single + uint8  = uint8 
single + int8   = int8  
single + uint16 = uint16
single + int16  = int16 
single + uint32 = uint32
single + int32  = int32 
single + uint64 = uint64
single + int64  = int64 
double + uint8  = uint8 
double + int8   = int8  
double + uint16 = uint16
double + int16  = int16 
double + uint32 = uint32
double + int32  = int32 
double + uint64 = uint64
double + int64  = int64 
uint8  + int8   = uint8 
uint8  + uint16 = uint8 
uint8  + int16  = uint8 
uint8  + uint32 = uint8 
uint8  + int32  = uint8 
uint8  + uint64 = uint8 
uint8  + int64  = uint8 
int8   + uint16 = int8  
int8   + int16  = int8  
int8   + uint32 = int8  
int8   + int32  = int8  
int8   + uint64 = int8  
int8   + int64  = int8  
uint16 + int16  = uint16
uint16 + uint32 = uint16
uint16 + int32  = uint16
uint16 + uint64 = uint16
uint16 + int64  = uint16
int16  + uint32 = int16 
int16  + int32  = int16 
int16  + uint64 = int16 
int16  + int64  = int16 
uint32 + int32  = uint32
uint32 + uint64 = uint32
uint32 + int64  = uint32
int32  + uint64 = int32 
int32  + int64  = int32 
uint64 + int64  = uint64

So the algorithm is:

integer type wins over float type
if two integers are combined, the one with the fewest bits wins
if two integers with the same bits are combined, uint wins over int

Steven Lord on 22 Jun 2025

Open in MATLAB Online

if two integers are combined, the one with the fewest bits wins

if two integers with the same bits are combined, uint wins over int

No. If you concatenate two integer arrays together, the resulting array will be of the class of the left-most integer array.

integerTypes = reshape(["", "u"] + "int" + [8; 16; 32; 64], 1, 8);
results = array2table(repmat("", 8, 8), ...
                VariableNames = integerTypes, ...
                RowNames = integerTypes);
for type1 = integerTypes
    A = ones(1, type1);
    for type2 = integerTypes
        B = ones(1, type2);
        C = [A, B];
        results{type1, type2} = string(class(C));
    end
end

The rows of the results table represent the type of A (the first array being concatenated together) and the variables represent the type of B. You can see that each row's values are always equal to the row name of the table (the type of A.)

results
results = 8×8 table
                int8       int16       int32       int64       uint8       uint16      uint32      uint64 
              ________    ________    ________    ________    ________    ________    ________    ________

    int8      "int8"      "int8"      "int8"      "int8"      "int8"      "int8"      "int8"      "int8"  
    int16     "int16"     "int16"     "int16"     "int16"     "int16"     "int16"     "int16"     "int16" 
    int32     "int32"     "int32"     "int32"     "int32"     "int32"     "int32"     "int32"     "int32" 
    int64     "int64"     "int64"     "int64"     "int64"     "int64"     "int64"     "int64"     "int64" 
    uint8     "uint8"     "uint8"     "uint8"     "uint8"     "uint8"     "uint8"     "uint8"     "uint8" 
    uint16    "uint16"    "uint16"    "uint16"    "uint16"    "uint16"    "uint16"    "uint16"    "uint16"
    uint32    "uint32"    "uint32"    "uint32"    "uint32"    "uint32"    "uint32"    "uint32"    "uint32"
    uint64    "uint64"    "uint64"    "uint64"    "uint64"    "uint64"    "uint64"    "uint64"    "uint64"

Joss Knight on 24 Jun 2025

64:1 is now worst-case I'm sorry to say.

Leon on 24 Jun 2025

Glad to know single is faster than double. Thanks.

Sign in to comment.

Precision lost when combining Int32 integers with single precision numerical numbers

1 Comment
Show -1 older comments Hide -1 older comments

Accepted Answer

0 Comments
Show -2 older comments Hide -2 older comments

More Answers (3)

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

9 Comments
Show 7 older comments Hide 7 older comments

Categories

Products

Release

Tags

Community Treasure Hunt

Precision lost when combining Int32 integers with single precision numerical numbers

1 Comment Show -1 older comments Hide -1 older comments

Accepted Answer

0 Comments Show -2 older comments Hide -2 older comments

More Answers (3)

0 Comments Show -2 older comments Hide -2 older comments

0 Comments Show -2 older comments Hide -2 older comments

9 Comments Show 7 older comments Hide 7 older comments

Categories

Products

Release

Tags

See Also

Community Treasure Hunt

1 Comment
Show -1 older comments Hide -1 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

0 Comments
Show -2 older comments Hide -2 older comments

9 Comments
Show 7 older comments Hide 7 older comments