Seeley G. Mudd Library | Library Research

A typical "raw" data file looks like this:

01520002274100000000000000000300000000000000100000010000000021231111312
0152000215010000000000000000030000000000000100000000000000013423333332 
01520001531100000000000000000500000100000000000100100000000023213311122
01520001494100000000000000000501100100000000000100100000000011231212143
01520001379100000000000000000300001000000000000000100000001014423233454
01520001532100000000000000000500100100000000000000110000000024134141212
0152000238010000000000000000030000000000000010100010000000003421    121
01520002337100000000000000000300000000000000100001000000000023234333213
01520002163000000010000000000400100000000000000100100000000034211131422
01520002119100000000000000000300000000000000101000100000000022134313415
01520002195100000000000000001400100000000000000000110000000035133323113
01520002125100000000000000001200110000000000000000000000000122212331423
01520001480100000000000000000500000000000000101100000000000011132343413


The numbers indicate study number, respondent number, and values for responses.
A codebook will provide more information about the numbers:

PROBRX1   R HAS A PROBLEM WITH HEALTH                                     
                                                       Valid     Cum      
Value Label                 Value  Frequency  Percent  Percent  Percent   
                                                                          
NO                              1      1731     68.8     68.8     68.8    
YES                             2       779     30.9     30.9     99.7    
REFUSAL                         7         1       .0       .0     99.8    
MISSING                         9         6       .2       .2    100.0    
                                     -------  -------  -------            
                            Total      2517    100.0    100.0             
Valid cases    2517      Missing cases      0                             

Back Forward