[ 2010 September,10, Friday ]
中国中文信息学会
Chinese Information Processing Society of China
首页
学会简介
学会领导
学会办公室
工作委员会
专业委员会
学术活动
发展会员
钱伟长中文信息处理奖
科技工作者之家
中文信息学报
新书介绍
按年代和期次浏览(最新数据: 2000年第3期)
“调素”论与普通话合成自然度的提高
[作者]郑新春; 柴佩琪;

[摘要]文献 [1 ]中的“边缘调素脱落论”提出了音步内各音节合成时边缘调素脱落的一般规律 ,其规律较好地揭示了两字词和三字词的轻重音分布原则和连续变调的现象 ;本文在此基础上 ,进一步提出了音步间各音步合成时的“边缘调素脱落论” ,该规律揭示了四字以上词的“调素”脱落规则以及轻重音分布原则 ,通过该规律的应用 ,提高了句子合成的自然度。

[Abstract]The rule of peeling off edge tonal element in syllable concatenation within a sound foot,presented in“edge tonal element peeling theory" [1] ,can be used to explain the stress principle on two syllable and three syllable words and the phenomenon of tone sandhi.In this paper a new rule of peeling off tonal element in sound foot concatenation is further developed.It is effective to explain the stress principle and the edge tonal element peeling principle on four seven syllable words successfully as wel...
[关键字]调素; 音步; 轻重音原则; 自然度;



基于支持向量机的手写体相似字识别
[作者]田盛丰; 黄厚宽; 李洪波;

[摘要]本文提出对手写相似汉字进行识别的支持向量机方法。该方法与人工神经网络一样适用于小规模分类 ,但由于支持向量机依据结构风险最小化原则 ,因此泛化能力更强。并且 ,由于支持向量机算法是一个凸二次优化问题 ,能够保证找到的极值解就是全局最优解。本文用支持向量机算法对三组手写相似汉字进行了识别 ,取得了较好的结果。

[Abstract]This paper presents a recognition method of similar Chinese handwriting by support vector machine. This method can be used to small scale recognition, like artificial neural networks, but it has stronger generalization ability because the support vector machine theory is based on the minimization principle to structure risk. Because the algorithm of support vector machine is a convex quadratic optimization problem, the local optimal solution is certainly the global optimal one. This paper presents a sample ...
[关键字]汉字识别; 相似字识别; 支持向量机;



文本自动分类中的词权重与分类算法
[作者]刁倩; 王永成; 张惠惠; 何骥;

[摘要]本文详细阐述了自动分类中的词与文献的相关权重的经典计算方法IDF(InverseDocumentFrequency) ,进一步总结了两种典型的分类算法———Bayes判别准则与向量空间模型 (VSM) ,并提出结合词权重和分类算法进行分类的具体公式以及相关实验结果。

[Abstract]In this paper,a classical term weighting method——IDF(Inverse Document Frequency) is discussed detailedly.The two important classification algorithms——Bayes Judge Rule and VSM (Vector Space Model) are summarized.Furthermore,the way of how to combine term weighting methods with two classification algorithm is also provided in the paper.
[关键字]自动分类; IDF; Bayes判别准则; 向量空间模型(VSM);



Zipf定律与汉字字频分布
[作者]游荣彦;

[摘要]本文证明了在以Zipf定律描述整个汉字字频分布时 ,不管如何精心挑选参数a和c,一些累计拟合频率都有明显的误差。针对这一现象 ,本文提出了一个解决办法 ,那就是以Zipf定律仅描述汉字字频分布的尾部的方法。

[Abstract]This paper first proves that when the Zipf's law is used to describe the total distribution of Chinese Character frequency,some fitting frequence accumulations take obvious errors no matter how elaborately the parameters a and care selected,and then presents a method to overcome the shortcoming by using the Zipf's law to describe only the tail of the distribution.
[关键字]计量语言模型; 汉字字频分布; Zipf定律; 拟合频率;



CJK汉字字形文件处理器的实现
[作者]程波; 孙玉芳;

[摘要]XWindow在汉字字形方面一直只支持点阵字形 ,另外字符集也只是针对GB2 31 2- 80 ,无法满足对大字符集汉字应用的需要。本文侧重于在XWindow下“透明地”提供对CJK统一汉字字符集TrueType字形的支持。文章介绍了XServer的结构 ,详细描述了汉字字形文件处理器实现算法。

[Abstract]X Window provided Chinese character only with bitmap font,and supported only GB2312-80 Chinese character set,not meet the need of people for CJK unified character set. The paper lays emphasis on providing with CJK unified character set Chinese TrueType font supporting on X Window.This paper outlines the X Server architecture,presents in detail the implementation algorithm of Chinese font file renderer.
[关键字]汉字; 字形文件处理器; CJK统一汉字字符集; TrueType; 字形管理库; GBK;



一种基于句法语义特征的汉语句法分析器
[作者]杨开城;

[摘要]句法分析不是简单地符号推理 ,而应该是一种实体推理。增加语义信息是实现句法分析实体推理的有效手段。本文所介绍的句法分析器有两个特色 :一是利用基于词的兼类处理规则大大提高了句法分析的效率 ;二是利用词静态和动态的句法语义特征来限制句法规则过强的生成能力 ,取得了较好的效果。

[Abstract]Syntax parsing is not a kind of symbol inference,but a kind of entity inference.Adding semantic information during syntax parsing is an effective method to implement entity inference.The syntax parser,introduced by this paper,have two features.One feature is that to use clear up rule of multi parts of speech based words to improve the efficiency of parser,the other featureis that to use static and dynamic syntax semantic properties of words to limit the too strong power of syntax rules.The parser gain p...
[关键字]句法语义特征; 句法分析; 兼类消解;



手写文稿识别的一种后处理方法和系统集成
[作者]蔡樱; 盛立东;

[摘要]本文提出了一种词间匹配的后处理方法 ,利用汉语上下文中词和词之间有一定的联系来对识别结果进行纠错 ,并综合词匹配、词间匹配和Markov语言模型使各环节之间形成反馈 ,相互补偿 ,形成一个较好的组合 ,以此来提高后处理部分的纠错能力以及稳定性。

[Abstract]This paper puts forward a post processing approach based on words intermatching.It takes full advantage of the relationship between words,with words innermatching、words intermatching and Markov Model.Each reinforces another.As a result,the combination produces the ability of error correcting and the stability of post processing.
[关键字]汉字识别; 后处理; 词间匹配; 词匹配; Markov语言模型;



基于小波网络和多模块网络的数字识别
[作者]宋红萍; 刘宏超; 全安寿; 崔晋川; 汤映杰;

[摘要]本文研究一种新的数字识别方法 ,这种方法用小波神经网络抽取特征、用多模块结构神经网络作模式分类器。小波分解的函数近似能力和人工神经网络的学习能力结合起来形成的小波神经网络 ,有着良好的特征描述性能 ,可用作特征抽取工具。多模块结构的神经网络将一个k类的模式分类问题转换为k个互相独立的 2类分类问题。这种结构将一个复杂的分类问题化解为多个简单的分类问题 ,各个模块互相并联 ,各自负责一种模式的识别。用这种修改过的多模块结构网络的BP训练方法 ,可加速训练和提高训练精度 ,并且各模块可互相独立地进行训练。用美国NIST数字样本进行训练及测试 ,结果良好。这种方法可用于更广泛的平面图形识别。

[Abstract]In this paper,We present a new handwritten digit recognition method based on wavelet networks and modular neural networks.Approximation ability of wavelet decomposition for general continuous functions and learning ability of neural networks can be combined to form the wavelet networks which has a nice feature descriptive ability,and can be used as a feature extraction tools.The modular neural networks is used as a classifier,it reduce a k class classification problems into a set of two class classificati...
[关键字]数字识别; 小波神经网络; 多模块神经网络;



基于组件的中文版面分析
[作者]刘定强; 张炘中;

[摘要]本文提出基于组件的中文版面分析方法。本方法是以自底向上为主 ,同时结合了自顶向下和基于组件的思想。基于组件的思想使得算法结构清晰、图象扫描次数被尽可能地减少 ;以自底向上为主、同时结合自顶向下的方法具有效率高、准确性高、文档适应性广的特点。二维有序树型结构的文档及其组件的组织形式 ,提高了频繁的查找操作速度 ,同时为版面描述和分析结果的应用提供了方便。

[Abstract]This paper presents a methord for Chinese document layout analysis based on component. This methord mostly bases on a bottom up approach,it also benefits from a top down approach and a concept of component.The concept of component lets the methord have a clear structure and reduces the times of scanning picture.Union the bottom up approach and the top down approach lets the methord have a high efficiency,precision and adaptability.We use a two dimensional orderly tree structure to organize document and...
[关键字]版面分析; 自底向上; 组件;



汉字识别中以词为分类单位的分类器研究
[作者]方应谦; 王鲁;

[摘要]汉字识别中 ,以往的分类器设计都是以字为单位的“字分类器”。字分类器的输出总是与待识字结构相似的一个侯选字集合。这是使后级识别容易产生误识的主要原因。为克服字分类器的缺点 ,本文给出了以词为单位的词分类器设计的策略与方法 ,并实验验证了词分类器在分类率及分类速度方面均优于字分类器。

[Abstract]In Chinese Character Recognition,the classifier was designed as word classifier whose classification unit is a word in the past.The output of word classifier is always a set of candidate words that are similar with await recognised words in structure of word.It is the primary reason that make mistakes in post level recognition.To overcome disadvantage of word classifier,the strategy and method of phrase classifier designing whose classification unit is phrase are proposed.The experiments results prove tha...
[关键字]汉字识别; 分类; 词分类器;



共95页 当前第46页 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 
41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95   
©中国中文信息学会 1981-2007
京ICP备05039057号