What's the best way to read an ASCII/txt file containing a few lines of text and then the data I want to load into a matrix?

Question

0 votes

I will have files converted to something like this:

Pad 1:1 /This is what the converter writes out /Not sure if it will have the same number of comments every time /information /more text

0      14.666
2     134.567
3    1567.435
      ...      ...

and so forth. I want to read only the numerical data into a matrix to later work with and prefer it to not be a hack job, and something that will consistently read in many files. Thanks!

- Mark

0 Comments
Show -2 older comments Hide -2 older comments

Sign in to comment.

Sign in to answer this question.

Follow Question

Answer 1

Cedric on 8 Oct 2013

Edited: Cedric on 8 Oct 2013

Open in MATLAB Online

2 votes

If you always have the same number of header lines, use TEXTSCAN and set the parameter 'HeaderLines' to a relevant value, e.g. 2 if you have two lines of header in each file.

In the example that you provided, it seems that you have have a line of text, an empty line, and then numbers, so you should be able to work with something like:

 fid  = fopen( 'myFile.txt', 'r' ) ;
 data = textscan( fid, '%f%f', 'HeaderLines', 2 ) ;
 fclose( fid ) ;

6 Comments
Show 4 older comments Hide 4 older comments

Cedric on 8 Oct 2013

Edited: Cedric on 8 Oct 2013

Open in MATLAB Online

There are several options; I guess that one of the classic approaches is something like

 data  = zeros( 1e6, 2 ) ;                   % Prealloc (see note 1).
 rowId = 0 ;
 fid   = fopen( 'myFile.txt', 'r' ) ;
 while ~feof( fid )
    line = fgetl( fid ) ;
    num  = sscanf( line, '%f %f'  ) ;
    if ~isempty( num )
       rowId = rowId + 1 ;
       data(rowId,:) = num.' ;
    end
 end
 fclose( fid ) ;
 data = data(1:rowId,:) ;                    % Truncate to filled portion.

Note 1 : prealloc for more rows (a million) than what you have in the file. This is not mandatory, but it prevents data to be reallocated each time a valid row is read, which is more efficient. If you don't know if a million is enough but you don't want to prealloc with more, you can implement a mechanism which adds another million each time rowId reach the size of the preallocated array. You would have to bring the following update in the internal IF statement:

    if ~isempty( num )
       rowId = rowId + 1
       if rowId > size( data, 1 )
          data = [data; zeros( 1e6, 2 )] ; 
       end
       data(rowId,:) = num.' ;
    end

If it is not efficient enough, you can read the file while SSCANF returns an empty array or eof(fid), and then read the rest of the file in one shot. It is not the first solution that I gave you because it is a bit more difficult to understand. You would have to implement something like (not tested):

 fid  = fopen( 'myFile.txt', 'r' ) ;
 data = [] ;
 while ~feof( fid ) && isempty( data )
    line = fgetl( fid ) ;
    data = sscanf( line, '%f %f' ).' ;
 end
 if ~feof( fid )
    data = [data; fscanf( fid, '%f %f', [2, Inf] ).'] ;
 end
 fclose( fid ) ;

Mark on 8 Oct 2013

Thank you for the help. I ended up checking for the comment delimiter and using fgetl until it hit the data. From there textscan just read the rest of the text file and only put in numbers. Thanks again

Cedric on 8 Oct 2013

Edited: Cedric on 8 Oct 2013

You're welcome, just be careful not to loose the first line of data; if it was read by FGETL, it won't be read by TEXTSCAN as the file pointer was moved by FGETL after this line. This is why I have the concatenation in my last solution.

Sign in to comment.

What's the best way to read an ASCII/txt file containing a few lines of text and then the data I want to load into a matrix?

0 Comments
Show -2 older comments Hide -2 older comments

Accepted Answer

6 Comments
Show 4 older comments Hide 4 older comments

More Answers (0)

Categories

Products

Tags

Community Treasure Hunt

What's the best way to read an ASCII/txt file containing a few lines of text and then the data I want to load into a matrix?

0 Comments Show -2 older comments Hide -2 older comments

Accepted Answer

6 Comments Show 4 older comments Hide 4 older comments

More Answers (0)

Categories

Products

Tags

See Also

Community Treasure Hunt

0 Comments
Show -2 older comments Hide -2 older comments

6 Comments
Show 4 older comments Hide 4 older comments