affyprobeseqread
Read data file containing probe sequence information for Affymetrix GeneChip array
Syntax
Struct
= affyprobeseqread(SeqFile
, CDFFile
)
Struct
= affyprobeseqread(SeqFile
, CDFFile
,
...'SeqPath', SeqPathValue
, ...)
Struct
= affyprobeseqread(SeqFile
, CDFFile
,
...'CDFPath', CDFPathValue
, ...)
Struct
= affyprobeseqread(SeqFile
, CDFFile
,
...'SeqOnly', SeqOnlyValue
, ...)
Input Arguments
SeqFile | Character vector or string specifying a file name of a sequence file (tab-separated or FASTA) that contains the following information for a specific type of Affymetrix® GeneChip® array:
The sequence file (tab-separated or FASTA)
must be on the MATLAB® search path or in the Current Folder (unless
you use the |
CDFFile | Either of the following:
Caution Make sure that |
SeqPathValue | Character vector or string specifying a folder or path and folder where
SeqFile is stored. |
CDFPathValue | Character vector or string specifying a folder or path and folder where
CDFFile is stored. |
SeqOnlyValue | Controls the return of a structure, Struct ,
with only one field, SequenceMatrix . Choices are true or false (default). |
Output Arguments
Struct | MATLAB structure containing the following fields:
|
Description
reads
the data from files Struct
= affyprobeseqread(SeqFile
, CDFFile
)SeqFile
and CDFFile
,
and stores the data in the MATLAB structure Struct
,
which contains the following fields.
Field | Description |
---|---|
ProbeSetIDs | Cell array containing the probe set IDs from the Affymetrix CDF library file. |
ProbeIndices | Column vector containing probe indexing information. Probes within a probe set are numbered 0 through N - 1, where N is the number of probes in the probe set. |
SequenceMatrix | An N-by-25 matrix of sequence information for the perfect match (PM) probes on the Affymetrix GeneChip array, where N is the number of probes on the array. Each row corresponds to a probe, and each column corresponds to one of the 25 sequence positions. Nucleotides in the sequences are represented by one of the following integers:
Note Probes without sequence information are represented in Tip You can use the |
calls Struct
= affyprobeseqread(SeqFile
, CDFFile
,
...'PropertyName
', PropertyValue
,
...)affyprobeseqread
with optional
properties that use property name/property value pairs. You can specify
one or more properties in any order. Each PropertyName
must
be enclosed in single quotation marks and is case insensitive. These
property name/property value pairs are as follows:
lets
you specify a path and folder where Struct
= affyprobeseqread(SeqFile
, CDFFile
,
...'SeqPath', SeqPathValue
, ...)SeqFile
is
stored.
lets
you specify a path and folder where Struct
= affyprobeseqread(SeqFile
, CDFFile
,
...'CDFPath', CDFPathValue
, ...)CDFFile
is
stored.
controls
the return of a structure, Struct
= affyprobeseqread(SeqFile
, CDFFile
,
...'SeqOnly', SeqOnlyValue
, ...)Struct
, with
only one field, SequenceMatrix
. Choices are true
or false
(default).
Examples
Read the data from a FASTA file and associated CDF library file, assuming both are located on the MATLAB search path or in the Current Folder.
S1 = affyprobeseqread('HG-U95A_probe_fasta', 'HG_U95A.CDF');
Read the data from a tab-separated file and associated CDF structure, assuming the tab-separated file is located in the specified folder and the CDF structure is in your MATLAB Workspace.
S2 = affyprobeseqread('HG-U95A_probe_tab',hgu95aCDFStruct,... 'seqpath','C:\Affymetrix\SequenceFiles\HGGenome');
Access the nucleotide sequences of the first probe set (rows 1 through 20) in the
SequenceMatrix
field of theS2
structure.seq = int2nt(S2.SequenceMatrix(1:20,:))
Version History
Introduced in R2007a
See Also
affygcrma
| affyinvarsetnorm
| affyread
| celintensityread
| int2nt
| probelibraryinfo
| probesetlookup
| probesetplot
| probesetvalues