Extracting parts of a string
Show older comments
I have a text filewith information like this:
FileName; SampleFreq; Test;Modality;Channel;Description;StimIntensity; Position; RecordingTime
C:\Users\G10040419\Desktop\lp export application\Data 139\00000090_1.WAV; 22000; 2;1;1;5 CH Right; 0.00; -10000; 40147.491374
I need to extract the sampleFreq (22000) and the position (-10000). I tried to use regular expressions, but I cannot find specific delimiter for these data.
Accepted Answer
More Answers (3)
per isakson
on 15 Jun 2018
Edited: per isakson
on 15 Jun 2018
Is this what you are looking for?
fid = fopen( '00000090Head.txt', 'r' );
cac = textscan( fid, '%*s%f%*f%*f%*f%*s%*f%f%*f', 'Headerlines',1,'Delimiter',';' );
fclose( fid );
and inspect the result
>> cac
cac =
1×2 cell array
{183×1 double} {183×1 double}
>> cac{2}(1:3)
ans =
-10000
-9000
-8000
Ana Maria Alzate
on 15 Jun 2018
0 votes
5 Comments
per isakson
on 15 Jun 2018
Edited: per isakson
on 15 Jun 2018
Did I make a mistake when counting columns? This format string works here with the sample file. (Do all files have the same format?)
'%*s%f%*f%*f%*f%*s%*f%f%*f'
However, remove one %*f and try
'%*s%f%*f%*f%*f%*s%f%*f'
Ana Maria Alzate
on 15 Jun 2018
per isakson
on 15 Jun 2018
Edited: per isakson
on 15 Jun 2018
Shifts like this one should not be a problem. (This is the only "shift" I find in the sample file.)

- Do have problems reading the uploaded sample file?
- Why not upload a file, which causes problems.
- Do you get any error or warning messages?
Jan
on 15 Jun 2018
@Ana Maria Alzate: Please do not post comments in the section for answers in the future. There is a section for comments for this job. Thanks.
Ana Maria Alzate
on 18 Jun 2018
Importing the data as strings and then using regular expressions to parse them is inefficient, yet is not required because that file is very nicely formatted in delimited columns, and the required data can easily and efficiently be read directly as numeric (or char). The command textscan makes it easy specify how to read those columns, and the format string is much simpler and more intuitive that those regular expressions:
>> fmt = '%*s%f%*d%*d%*d%*s%*f%f%*f';
>> opt = {'HeaderLines',1,'Delimiter',';'};
>> [fid,msg] = fopen('00000090Head.txt','rt');
>> assert(fid>=3,msg)
>> C = textscan(fid,fmt,opt{:});
>> fclose(fid);
>> [C{:}]
ans =
22000 -10000
22000 -9000
22000 -8000
22000 -7000
22000 -6000
22000 -5000
22000 -4500
22000 -4000
22000 -3500
22000 -3000
22000 -3000
22000 -2500
22000 -2000
22000 -1500
22000 -1000
22000 -500
22000 0
22000 500
22000 1000
22000 1500
... lots of lines here
22000 -3000
22000 -2500
22000 -2000
22000 -1500
22000 -1000
22000 -500
22000 0
22000 500
22000 1000
22000 1500
22000 2000
22000 2500
22000 3000
22000 3500
22000 4000
22000 4000
22000 5000
Categories
Find more on Characters and Strings in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!