[DDI-users] Use of ICPSR Stylesheet

Mary.Decuypere at statcan.ca Mary.Decuypere at statcan.ca
Fri Oct 13 15:23:41 EDT 2006


Skipped content of type multipart/alternative-------------- next part --------------
<?xml version='1.0' encoding='UTF-8'?>
<?xml-stylesheet href="\\lhs4\dism\DDI\Documentation\Stylesheets\codebook2-0.xsl" type="text/xsl"?>
<codeBook version="1.2.2" ID="4440">
  <docDscr>
    <citation>
      <titlStmt>
        <titl>
          Canadian Tobacco Use Monitoring Survey
        </titl>
        <IDNo>
          4440
        </IDNo>
      </titlStmt>
      <rspStmt>
        <AuthEnty affiliation="STATISTICS CANADA">
          Special Surveys Division
        </AuthEnty>
      </rspStmt>
      <prodStmt>
        <prodDate date="2006-09-22">
          2006-09-22
        </prodDate>
        <software version="3.01" date="2004-07-13">
          Nesstar Publisher
        </software>
      </prodStmt>
    </citation>
  </docDscr>
  <stdyDscr>
    <citation>
      <titlStmt>
        <titl>
          2005 CTUMS - Canadian Tobacco Use Monitoring Survey - Cycle 1 - February to June 2005
        </titl>
      </titlStmt>
      <rspStmt>
        <AuthEnty affiliation="STATISTICS CANADA">
          Special Surveys Division
        </AuthEnty>
      </rspStmt>
      <prodStmt>
        <producer abbr="SSD" affiliation="STATISTICS CANADA" role="Federal Government">
          Special Surveys Division
        </producer>
        <prodDate date="2006-09-22">
          2006-09-22
        </prodDate>
        <software version="3.01" date="2004-07-13">
          Nesstar Publisher
        </software>
        <fundAg abbr="HC" role="Survey Sponsor">
          Health Canada
        </fundAg>
      </prodStmt>
      <distStmt>
        <distrbtr abbr="DLI" affiliation="Statistics Canada">
          Data Liberation Initiative
        </distrbtr>
      </distStmt>
    </citation>
    <stdyInfo>
      <subject>
        <keyword>
          Health
        </keyword>
        <keyword>
          Health status indicators
        </keyword>
        <keyword>
          Social behaviour
        </keyword>
      </subject>
      <abstract>
        <![CDATA[The Canadian Tobacco Use Monitoring Survey (CTUMS) has been conducted for Health Canada since 1999, and provides data on tobacco use and related issues. The primary objective of the survey is to track changes in smoking status, especially for populations most at risk, such as the 15 to 24 year olds. The survey allows Health Canada to estimate smoking prevalence by province-sex-age groups on a semi-annual basis.

The Cycle 1 (February to June) data of each year are released separately. The Cycle 2 (July to December) data are released within the annual summary.]]>
      </abstract>
      <sumDscr>
        <nation>
          Canada
        </nation>
        <geogCover>
          <![CDATA[All persons 15 years of age and over living in Canada with the following two exceptions:

1) residents of the Yukon, Northwest Territories and Nunavut, and
2) full-time residents of institutions.]]>
        </geogCover>
        <anlyUnit>
          Persons
        </anlyUnit>
        <universe>
          <![CDATA[All persons 15 years of age and over living in Canada with the following two exceptions:

1) residents of the Yukon, Northwest Territories and Nunavut, and
2) full-time residents of institutions.

Because the survey was conducted using a sample of telephone numbers, households (and thus persons living in households) that do not have telephone land lines were excluded from the sample population. This means that people without telephones and people with cellphones only, were excluded. People without land lines account for less than 6% of the target population. However, the survey estimates have been weighted to include persons without land lines.]]>
        </universe>
      </sumDscr>
    </stdyInfo>
    <method>
      <dataColl>
        <timeMeth>
          <![CDATA[Data collection for this reference period: February 1, 2005 – June 30, 2005 

Responding to this survey is voluntary. Data are collected directly from survey respondents.

The interviews for Cycle 1 were conducted every month from February to June 2005.]]>
        </timeMeth>
        <sampProc>
          <![CDATA[This is a sample survey with a cross-sectional design.

The sample design is a special two-phase stratified random sample of telephone numbers. The two-phase design is used in order to increase the representation in the sample of individuals belonging to the 15 to 19 and 20 to 24 age groups. In the first phase, households are selected using Random Digit Dialing. In the second phase, one or two individuals (or none) are selected based upon household composition.

The CTUMS, Cycle 1, conducted from February to June 2005, collected data from 9,375 respondents.]]>
        </sampProc>
        <collMode>
          <![CDATA[Data were collected using computer-assisted telephone interviewing (CATI). A front-end module contains a set of standard response codes for dealing with all possible call outcomes, as well as the associated scripts to be read by the interviewers. A standard approach set up for introducing the agency, the name and purpose of the survey, the survey sponsors, how the survey results will be used, and the duration of the interview was used. 

The CATI application ensured that only valid question responses were entered and that all the correct flows were followed. Edits were built into the application to check the consistency of responses, identify and correct outliers, and to control who gets asked specific questions. This meant that the data was already quite "clean" at the end of the collection process.]]>
        </collMode>
        <sources/>
        <weight>
          <![CDATA[The main output of the CTUMS are two microdata files, one for the household level information and one for the person level information.

For the microdata file, statistical weights were placed on each record to represent the number of sampled persons that the record represents. One weight was calculated for each household and a separate weight was calculated and provided on a different file, for each person. 

The weighting for the CTUMS consists of several steps: calculation of a basic weight, followed by several adjustments such as non-response, non resolved telephone numbers and an adjustment for households with multiple telephone lines. Additional adjustments were made to account for the number of persons (0, 1 or 2) selected in the household and the over-sampling of the 15 to 24 age group. The last step of weighting consists of an adjustment to the person weights in order to make population estimates consistent with external population counts for persons 15 years and older (calibration). The following external control totals were used:

1) Monthly population totals for each province-stratum, and 

2) For Cycle 1 and Cycle 2:
population totals by province, sex and the following age groups: 15 to 19, 20 to 24, 25 to 34, 35 to 44, 45 to 54, 55 to 64, and 65 and over. These totals were averaged over the survey period.

For the Annual Summary:
population totals by province, sex and the following age groups: 15 to 19, 20 to 24, 25 to 29, 30 to 34, 35 to 39, 40 to 44, 45 to 49, 50 to 54, 55 to 59, 60 to 64, 65 to 69, and 70 and over. These totals were averaged over the survey period. 

The method called generalized regression (GREG) estimation was used to modify the weights to ensure that the survey estimates agreed with the external totals simultaneously along the two dimensions.

In order to supply coefficients of variation (CV) which would be applicable to a wide variety of categorical estimates produced from this microdata file and which could be readily accessed by the user, a set of Approximate Sampling Variability Tables has been produced. These CV tables allow the user to obtain an approximate coefficient of variation based on the size of the estimate calculated from the survey data.

The coefficients of variation in these tables are derived using the variance formula for simple random sampling and incorporating a factor which reflects the multi stage, clustered nature of the sample design. This factor, known as the design effect, was determined by first calculating design effects for a wide range of characteristics and then choosing from among these a conservative value to be used in the CV tables which would then apply to the entire set of characteristics.

The variance estimation method used until 2004 was Jackknife. Starting with Cycle 1 2004, it is Bootstrap.]]>
        </weight>
      </dataColl>
    </method>
    <othrStdyMat>
      <relMat>
        <citation>
          <titlStmt>
            <titl>
              <![CDATA[Questionnaire(s) and reporting guide(s)  (in PDF format) ]]>
            </titl>
          </titlStmt>
          <holdings URI="http://www.statcan.ca/english/sdds/instrument/4440_Q1_V11_E.pdf"/>
        </citation>
      </relMat>
      <relMat>
        <![CDATA[The Canadian Tobacco Use Monitoring Survey (CTUMS) was conducted by Statistics Canada from February to June 2005 with the cooperation and support of Health Canada.  This manual has been produced to facilitate the manipulation of the microdata file of the survey results.]]>
        <citation>
          <titlStmt>
            <titl>
              Canadian Tobacco Use Monitoring Survey, Cycle 1, February to June 2005 - Microdata User Guide (in PDF format)
            </titl>
          </titlStmt>
          <holdings URI="http://www.statcan.ca/english/sdds/document/4440_D1_T1_V11_E.pdf"/>
        </citation>
      </relMat>
      <relPubl>
        <![CDATA[This analysis integrates data for age at smoking initiation, smoking patterns (ever/never smokers), and number of cigarettes smoked by age and gender from 13 Canadian population-based health surveys. Data for cohorts of individuals born between 1910 and 1985 were combined into a single dataset to analyze changes in smoking behaviour.
The proportion of males and females who stated that they were current smokers declined considerably over the years; the proportion labelling themselves as former smokers has increased; and recent cohorts are smoking fewer cigarettes than those born earlier in the century. Although prevalence rates vary across birth cohorts, the pattern of use within each cohort is fairly similar, peaking during the middle years.
Females are now starting to smoke at a much younger age than females born early in the twentieth century. Males are starting at a slightly younger age than in earlier cohorts. Since the first survey, more males than females have consistently stated that they were current smokers, although this gap has closed over time. The most recent survey data suggest that the gender gap for age at initiation and smoking rates is now very modest.
This analysis contributes information about smoking patterns over time to a large integrative framework about population health in Canada: the Population Health Impact of Disease, Injury, and Health Determinants in Canada (PHI). 

]]>
        <citation>
          <titlStmt>
            <titl>
              <![CDATA[82-005-X20020036573   How times have changed! Canadian smoking patterns in the 20th century ]]>
            </titl>
          </titlStmt>
          <holdings URI="http://www.statcan.ca/bsolc/english/bsolc?catno=82-005-X20020036573"/>
        </citation>
      </relPubl>
      <relPubl>
        <![CDATA[The objective of this working paper series is to analyse the comparability of surveys conducted by Statistics Canada on smoking, to highlight the changes in the data among data years and to illustrate their statistical significance. The aim is to clarify any confusion regarding comparability of survey estimates of smoking prevalence and daily cigarette consumption over this period, as well as to provide the user-requested data in a technical but understandable format.

]]>
        <citation>
          <titlStmt>
            <titl>
              <![CDATA[82F0077X  Report on Smoking Prevalence in Canada ]]>
            </titl>
          </titlStmt>
          <holdings URI="http://www.statcan.ca/bsolc/english/bsolc?catno=82F0077X"/>
        </citation>
      </relPubl>
    </othrStdyMat>
  </stdyDscr>
  <fileDscr>
    <fileTxt>
      <fileName ID="F1">
        Person_personne.NSDstat
      </fileName>
      <dimensns>
        <caseQnty>
          9375
        </caseQnty>
        <varQnty>
          154
        </varQnty>
      </dimensns>
      <fileType>
        NSDstat 200203
      </fileType>
    </fileTxt>
  </fileDscr>
  <dataDscr>
    <varGrp ID="VG3F1" type="subject" var="V21 V22 V23 V24 V25 V26 V27 V28 V29 V30 V31 V32 V33 V34 V35 V36 V37 V38 V39 V40 V41 V42 V43 V44 V45 V46 V47 V48 V49 V50 V51 V52 V53 V54 V55 V56 V57 V58 V59 V60 V61 V62 V63 V64 V65 V66 V67 V68 V69 V70 V71 V72 V73 V74 V75 V76 V77 V78 V79 V80 V81 V82 V83 V84 V85 V86 V87 V88 V89 V90 V91 V92 V93 V94 V95 V96 V97 V98 V99 V100 V101 V102 V103 V104 V105 V106 V107 V108 V109 V110 V111 V112 V113 V114 V115 V116 V117 V118 V119 V120 V121 V122 V123 V124 V125 V126 V127 V128 V129 V130 V131 V132 V133 V134 V135 V136 V137 V138 V139 V140 V141 V142 V143 V144 V145 V146 V147 V148 V149 V150 V151 V152 V153 V1">
      <labl>
        Individual characteristics
      </labl>
    </varGrp>
    <varGrp ID="VG6F1" type="subject" var="V2 V3">
      <labl>
        Time measures
      </labl>
    </varGrp>
    <varGrp ID="VG1F1" type="subject" var="V4">
      <labl>
        Geographic identifiers
      </labl>
    </varGrp>
    <varGrp ID="VG5F1" type="subject" var="V154">
      <labl>
        Weight variable
      </labl>
    </varGrp>
    <varGrp ID="VG2F1" type="subject" var="V5 V6 V7 V8 V9 V10 V11">
      <labl>
        Household size and composition
      </labl>
    </varGrp>
    <varGrp ID="VG4F1" type="subject" var="V12 V13 V14 V15 V16 V17 V18 V19 V20">
      <labl>
        Household characteristics
      </labl>
    </varGrp>
    <var ID="V1" name="PUMFID" files="F1" dcml="0" intrvl="contin">
      <location StartPos="1" EndPos="5" width="5"/>
      <labl>
        PUMF ID
      </labl>
      <valrng>
        <range min="1" max="9375"/>
      </valrng>
      <sumStat type="vald">
        9375
      </sumStat>
      <sumStat type="invd">
        0
      </sumStat>
      <sumStat type="min">
        1
      </sumStat>
      <sumStat type="max">
        9375
      </sumStat>
      <sumStat type="mean">
        4688
      </sumStat>
      <sumStat type="stdev">
        2706.474
      </sumStat>
      <txt>
        Randomly generated sequence number for the public use microdata file
      </txt>
      <varFormat type="numeric" schema="other"/>
    </var>
    <var ID="V2" name="REFYEAR" files="F1" dcml="0" intrvl="discrete">
      <location StartPos="6" EndPos="9" width="4"/>
      <labl>
        Survey year
      </labl>
      <valrng>
        <range min="2005" max="2005"/>
      </valrng>
      <sumStat type="vald">
        9375
      </sumStat>
      <sumStat type="invd">
        0
      </sumStat>
      <catgry>
        <catValu>
          2005
        </catValu>
        <catStat type="freq">
          9375
        </catStat>
      </catgry>
      <varFormat type="numeric" schema="other"/>
    </var>
    <var ID="V3" name="REFMONTH" files="F1" dcml="0" intrvl="discrete">
      <location StartPos="10" EndPos="11" width="2"/>
      <labl>
        Survey month
      </labl>
      <valrng>
        <range min="2" max="6"/>
      </valrng>
      <sumStat type="vald">
        9375
      </sumStat>
      <sumStat type="invd">
        0
      </sumStat>
      <catgry>
        <catValu>
          2
        </catValu>
        <catStat type="freq">
          1762
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          3
        </catValu>
        <catStat type="freq">
          1851
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          4
        </catValu>
        <catStat type="freq">
          1843
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          5
        </catValu>
        <catStat type="freq">
          1956
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          6
        </catValu>
        <catStat type="freq">
          1963
        </catStat>
      </catgry>
      <varFormat type="numeric" schema="other"/>
    </var>
    <var ID="V4" name="PROV" files="F1" dcml="0" intrvl="discrete">
      <location StartPos="12" EndPos="13" width="2"/>
      <labl>
        Province of the respondent
      </labl>
      <valrng>
        <range min="10" max="59"/>
      </valrng>
      <sumStat type="vald">
        9375
      </sumStat>
      <sumStat type="invd">
        0
      </sumStat>
      <txt>
        Province of the selected respondent
      </txt>
      <catgry>
        <catValu>
          10
        </catValu>
        <labl>
          N.L.
        </labl>
        <catStat type="freq">
          883
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          11
        </catValu>
        <labl>
          P.E.I.
        </labl>
        <catStat type="freq">
          798
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          12
        </catValu>
        <labl>
          Nova Scotia
        </labl>
        <catStat type="freq">
          840
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          13
        </catValu>
        <labl>
          N.B.
        </labl>
        <catStat type="freq">
          901
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          24
        </catValu>
        <labl>
          Quebec
        </labl>
        <catStat type="freq">
          874
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          35
        </catValu>
        <labl>
          Ontario
        </labl>
        <catStat type="freq">
          762
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          46
        </catValu>
        <labl>
          Manitoba
        </labl>
        <catStat type="freq">
          867
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          47
        </catValu>
        <labl>
          Saskatchewan
        </labl>
        <catStat type="freq">
          895
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          48
        </catValu>
        <labl>
          Alberta
        </labl>
        <catStat type="freq">
          1751
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          59
        </catValu>
        <labl>
          B.C.
        </labl>
        <catStat type="freq">
          804
        </catStat>
      </catgry>
      <varFormat type="numeric" schema="other"/>
    </var>

</dataDscr>
  <otherMat level="study"/>
</codeBook>


More information about the DDI-users mailing list