Home Catalogue search

eng

Refine your search:

Search in the Catalogues and Directories






	Sort by
Simple Search

Page: 1 2 3 4

Hits 1 – 20 of 69

1	『現代日本語書き言葉均衡コーパス』出版書籍サンプルのNDC別語彙分布
	加藤祥; 浅原正幸; Sachi KATO...
	In: https://ccd.ninjal.ac.jp/lrw2021.html (2021)
	BASE
	Show details

2	Word Delimitation Issues in UD Japanese
	Mai Omura; Aya Wakasa; Masayuki Asahara. - : Association for Computational Linguistics, 2021
	BASE
	Show details

3	編集後記
	浅原正幸; Masayuki ASAHARA
	In: https://ccd.ninjal.ac.jp/lrw2021.html (2021)
	BASE
	Show details

4	Lower Perplexity is Not Always Human-Like
	Tatsuki Kuribayashi; Yohei Oseki; Takumi Ito. - : Association for Computational Linguistics, 2021
	BASE
	Show details

5	ALICE++ : Adversarial Training for Robust and Effective Temporal Reasoning
	Lis Kanashiro Pereira; Fei Cheng; Masayuki Asahara. - : Association for Computational Linguistics, 2021
	BASE
	Show details

6	『分類語彙表』に対する反対語情報付与
	加藤祥; 浅原正幸; 森山奈々美. - : 言語処理学会, 2021
	BASE
	Show details

7	The Annotation of Antonym Information in the 'Word List by Semantic Principles'
	Sachi Kato; Masayuki Asahara; Nanami Moriyama. - : Association for Computational Linguistics, 2021
	BASE
	Show details

8	『現代日本語書き言葉均衡コーパス』新聞記事情報を用いたジャンル別語彙分布
	加藤祥; 森山奈々美; 浅原正幸...
	In: https://ccd.ninjal.ac.jp/lrw2021.html (2021)
	BASE
	Show details

9	『現代日本語書き言葉均衡コーパス』書籍サンプルのNDC情報増補 : NDC情報を用いた随筆の抽出と文体調査
	加藤祥; 森山奈々美; 浅原正幸; Sachi KATO; Nanami MORIYAMA; Masayuki ASAHARA. - : 国立国語研究所, 2021
	Abstract: 目白大学 ; 国立国語研究所コーパス開発センター技術補佐員 ; 国立国語研究所コーパス開発センター ; Mejiro University ; Technical Staff, Center for Corpus Development, NINJAL ; Center for Corpus Development, NINJAL ; 本研究では『現代日本語書き言葉均衡コーパス』（BCCWJ）の書籍全サンプル22,058サンプル（PB（出版）10,117サンプル・LB（図書館）10,551サンプル・OB（ベストセラー）1,390サンプル）に付与された日本十進分類法（NDC）分類記号の補助分類を拡張した。作業は，国立国会図書館サーチのNDC情報を参照し，人手によって分類の確認と追加を行った。また，開発当時NDC分類記号が付与されていなかったサンプル（「分類なし」）などの見直しもあわせて行った。本作業結果により，たとえば形式区分を利用し，ジャンルの分散する「随筆（-049）」「理論（-01）」「教科書（-078）」などのカテゴリでBCCWJサンプルを分類することが可能となった。このほか，時代情報や小項目が追加されたサンプルもあり，今まで以上に詳細な分類が可能となった。本研究では，情報付与作業の方法と基礎情報を報告し，分類例を示す。本データを用いた研究事例として，NDC情報を用いた随筆の抽出と随筆の文体調査結果を報告する。本データは「中納言」の検索で利用できる。 ; This study presents the enlargement of Nippon Decimal Classification (NDC) metadata of book samples in the "Balanced Corpus of Contemporary Written Japanese (BCCWJ)." We revised and enhanced the NDC information about all of the book samples from the BCCWJ (22,058 samples) comprising PB (books in the publication subcorpus: 10,117 samples), LB (books in library subcorpus: 10,551 samples), and OB (books in the special-purpose subcorpus; namely, best sellers: 1,390 samples). We referred to the NDC information using the National Diet Library Search API and manually re-annotated the NDC information. In addition, we completed the empty entries of the original BCCWJ metadata. Based on these procedures, we were able to classify the BCCWJ book samples according to the genres of essay (-049), theory (-01), and textbook (-078) with the NDC supplemental tables. Furthermore, since finer-grained categories, including their chronological periods, were added to some samples, users can explore a more detailed classification of the book samples. We present the methodology of NDC information enlargement and its basic statistics. We also present experimental research on extraction essays from books and the investigation of their writing style. The compiled data can be used in the corpus query systems of "Chunagon."
	Keyword: log-likelihood ratio; Nippon Decimal Classification; writing style; “Balanced Corpus of Contemporary Written Japanese”; 『現代日本語書き言葉均衡コーパス』; 対数尤度比; 文体; 日本十進分類法
	URL: https://repository.ninjal.ac.jp/?action=repository_uri&item_id=3454 http://id.nii.ac.jp/1328/00003437/ https://repository.ninjal.ac.jp/?action=repository_action_common_download&item_id=3454&item_no=1&attribute_id=54&file_no=1
	BASE
	Hide details

10	自然言語処理 : 言語資源・意味解析（レクチャーシリーズ「人工知能の今」第6回）
	松林優一郎; 浅原正幸; Yuichiroh Matsubayashi...
	In: https://www.ai-gakkai.or.jp/vol35_no1/ (2020)
	BASE
	Show details

11	Design of BCCWJ-EEG : Balanced Corpus with Human Electroencephalography
	Yohei Oseki; Masayuki Asahara. - : European Language Resources Association, 2020
	BASE
	Show details

12	KOTONOHA : A Corpus Concordance System for Skewer-Searching NINJAL Corpora
	Teruaki Oka; Yuichi Ishimoto; Yutaka Yagi. - : European Language Resources Association, 2020
	BASE
	Show details

13	Adversarial Training for Commonsense Inference
	Lis Pereira; Xiaodong Liu; Fei Cheng. - : Association for Computational Linguistics, 2020
	BASE
	Show details

14	日本語における名詞句の情報構造と語順の相関についての統計的検討
	宮内拓也; 浅原正幸; Takuya Miyauchi. - : 言語処理学会, 2020
	BASE
	Show details

15	Composing Word Vectors for Japanese Compound Words Using Bilingual Word Embeddings
	Teruo Hirabayashi; Kanako Komiya; Masayuki Asahara. - : Association for Computational Linguistics, 2020
	BASE
	Show details

16	Generation and Evaluation of Concept Embeddings Via Fine-Tuning Using Automatically Tagged Corpus
	Kanako Komiya; Daiki Yaginuma; Masayuki Asahara. - : Association for Computational Linguistics, 2020
	BASE
	Show details

17	編集後記
	浅原正幸; Masayuki Asahara
	In: https://pj.ninjal.ac.jp/corpus_center/lrw2020.html (2020)
	BASE
	Show details

18	Automatic Creation of Correspondence Table of Meaning Tags from Two Dictionaries in One Language Using Bilingual Word Embedding
	Teruo Hirabayashi; Kanako Komiya; Masayuki Asahara. - : European Language Resources Association, 2020
	BASE
	Show details

19	Bayesian Linear Mixed Model による単語親密度推定と位相情報付与
	浅原正幸; Masayuki Asahara. - : 言語処理学会, 2020
	BASE
	Show details

20	Dynamically Updating Event Representations for Temporal Relation Classification with Multi-category Learning
	Fei Cheng; Masayuki Asahara; Ichiro Kobayashi. - : Association for Computational Linguistics, 2020
	BASE
	Show details

Page: 1 2 3 4

© 2013 - 2024 Lin|gu|is|tik | Imprint | Privacy Policy | Datenschutzeinstellungen ändern