论文部分内容阅读
文章围绕学科评价之文献计量数据准备中常见的数据采集、分类、清洗三大问题展开。分析了数据源选择的3种模式、选择数据库的依据和三类数据检索方式的特点;总结了现行的科学文献数据学科分类法,并给出了对文献数据分别进行一级学科分类和二级学科分类的方法建议;最后以21世纪初国际数学学科科学实体评价为例,归纳了科学文献数据中国家和机构名称标注混乱的原因,并制作了国家名称叙词表和若干机构名称叙词表。
The article focuses on the three major issues of data collection, classification, cleaning around the subject evaluation literature measurement data preparation. The three modes of data source selection, the basis of selecting database and the characteristics of three types of data retrieval methods are analyzed. The current taxonomy of scientific literature data is summarized, and the first-level disciplinary classification and second- Finally, taking the evaluation of international mathematical subjects as an example in the early 21st century, this paper summarizes the reasons for the confusion of names annotation of countries and agencies in the scientific literature, and produces the national name thesaurus and several agencies’ name thesaurus .