Objective: This study describes a new computer methodology for analyzing workers' free text work descriptions. Methods: Computerized lexical analysis was applied to work descriptions of participants in the Lung Health Study, a smoking-cessation study in persons with early chronic obstructive pulmonary disease. Text was parsed and analyzed as single term roots and pairs of roots commonly occurring together. Results: The frequencies of terms reflect the work of a population; our subjects' most frequently used terms included "sale, office, service, business, engine[er], secretary, construct, driv[e], comput[e], teach, truck." Standard classification schemes (NAICS and SOC) and textbooks use terms inconsistent with those of actual workers. Many common empirical terms imply both industry and job information content, although traditional coding schemes separate industry and job title. Conclusions: Formal analyses of language may facilitate communication, identify translation priorities, and allow automated work coding.
|Original language||English (US)|
|Number of pages||6|
|Journal||Journal of occupational and environmental medicine / American College of Occupational and Environmental Medicine|
|Publication status||Published - Aug 2005|
ASJC Scopus subject areas
- Public Health, Environmental and Occupational Health
- Health, Toxicology and Mutagenesis