Chinese Traditional Poetry Generating System Based on Deep Learning
- URL: http://arxiv.org/abs/2110.12335v1
- Date: Sun, 24 Oct 2021 02:43:03 GMT
- Title: Chinese Traditional Poetry Generating System Based on Deep Learning
- Authors: Chenlei Bao and Lican Huang
- Abstract summary: This paper proposes an automatic generation method of Chinese traditional poetry based on deep learning technology.
It extracts keywords from each poem and matches them with the previous text to make the poem conform to the theme.
When a user inputs a paragraph of text, the machine obtains the theme and generates poem sentence by sentence.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Chinese traditional poetry is an important intangible cultural heritage of
China and an artistic carrier of thought, culture, spirit and emotion. However,
due to the strict rules of ancient poetry, it is very difficult to write poetry
by machine. This paper proposes an automatic generation method of Chinese
traditional poetry based on deep learning technology, which extracts keywords
from each poem and matches them with the previous text to make the poem conform
to the theme, and when a user inputs a paragraph of text, the machine obtains
the theme and generates poem sentence by sentence. Using the classic word2vec
model as the preprocessing model, the Chinese characters which are not
understood by the computer are transformed into matrix for processing.
Bi-directional Long Short-Term Memory is used as the neural network model to
generate Chinese characters one by one and make the meaning of Chinese
characters as accurate as possible. At the same time, TF-IDF and TextRank are
used to extract keywords. Using the attention mechanism based encoding-decoding
model, we can solve practical problems by transforming the model, and
strengthen the important information of long-distance information, so as to
grasp the key points without losing important information. In the aspect of
emotion judgment, Long Short-Term Memory network is used. The final result
shows that it can get good poetry outputs according to the user input text.
Related papers
- Semi-supervised Chinese Poem-to-Painting Generation via Cycle-consistent Adversarial Networks [2.250406890348191]
We propose a semi-supervised approach using cycle-consistent adversarial networks to leverage the limited paired data.
We introduce novel evaluation metrics to assess the quality, diversity, and consistency of the generated poems and paintings.
The proposed model outperforms previous methods, showing promise in capturing the symbolic essence of artistic expression.
arXiv Detail & Related papers (2024-10-25T04:57:44Z) - Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through
Image-IDS Aligning [61.34060587461462]
We propose a two-stage framework for Chinese Text Recognition (CTR)
We pre-train a CLIP-like model through aligning printed character images and Ideographic Description Sequences (IDS)
This pre-training stage simulates humans recognizing Chinese characters and obtains the canonical representation of each character.
The learned representations are employed to supervise the CTR model, such that traditional single-character recognition can be improved to text-line recognition.
arXiv Detail & Related papers (2023-09-03T05:33:16Z) - BACON: Deep-Learning Powered AI for Poetry Generation with Author
Linguistic Style Transfer [91.3755431537592]
This paper describes BACON, a prototype of an automatic poetry generator with author linguistic style transfer.
It combines concepts and techniques from finite state machinery, probabilistic models, artificial neural networks and deep learning, to write original poetry with rich aesthetic-qualities in the style of any given author.
arXiv Detail & Related papers (2021-12-14T00:08:36Z) - TransCouplet:Transformer based Chinese Couplet Generation [1.084959821967413]
Chinese couplet is a form of poetry composed of complex syntax with ancient Chinese language.
This paper presents a transformer-based sequence-to-sequence couplet generation model.
We also evaluate the Glyph, PinYin and Part-of-Speech tagging on the couplet grammatical rules.
arXiv Detail & Related papers (2021-12-03T04:34:48Z) - ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin
Information [32.70080326854314]
We propose ChineseBERT, which incorporates the glyph and pinyin information of Chinese characters into language model pretraining.
The proposed ChineseBERT model yields significant performance boost over baseline models with fewer training steps.
arXiv Detail & Related papers (2021-06-30T13:06:00Z) - CCPM: A Chinese Classical Poetry Matching Dataset [50.90794811956129]
We propose a novel task to assess a model's semantic understanding of poetry by poem matching.
This task requires the model to select one line of Chinese classical poetry among four candidates according to the modern Chinese translation of a line of poetry.
To construct this dataset, we first obtain a set of parallel data of Chinese classical poetry and modern Chinese translation.
arXiv Detail & Related papers (2021-06-03T16:49:03Z) - SHUOWEN-JIEZI: Linguistically Informed Tokenizers For Chinese Language
Model Pretraining [48.880840711568425]
We study the influences of three main factors on the Chinese tokenization for pretrained language models.
We propose three kinds of tokenizers: SHUOWEN (meaning Talk Word), the pronunciation-based tokenizers; 2) JIEZI (meaning Solve Character), the glyph-based tokenizers.
We find that SHUOWEN and JIEZI tokenizers can generally outperform conventional single-character tokenizers.
arXiv Detail & Related papers (2021-06-01T11:20:02Z) - Lattice-BERT: Leveraging Multi-Granularity Representations in Chinese
Pre-trained Language Models [62.41139712595334]
We propose a novel pre-training paradigm for Chinese -- Lattice-BERT.
We construct a lattice graph from the characters and words in a sentence and feed all these text units into transformers.
We show that our model can bring an average increase of 1.5% under the 12-layer setting.
arXiv Detail & Related papers (2021-04-15T02:36:49Z) - Generating Major Types of Chinese Classical Poetry in a Uniformed
Framework [88.57587722069239]
We propose a GPT-2 based framework for generating major types of Chinese classical poems.
Preliminary results show this enhanced model can generate Chinese classical poems of major types with high quality in both form and content.
arXiv Detail & Related papers (2020-03-13T14:16:25Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.