Style Classification of Rabbinic Literature for Detection of Lost
Midrash Tanhuma Material
- URL: http://arxiv.org/abs/2211.09710v3
- Date: Mon, 24 Jul 2023 05:39:27 GMT
- Title: Style Classification of Rabbinic Literature for Detection of Lost
Midrash Tanhuma Material
- Authors: Shlomo Tannor, Nachum Dershowitz, Moshe Lavee
- Abstract summary: We propose a system for classification of rabbinic literature based on its style.
We show how this method can be applied to uncover lost material from a specific midrash genre.
- Score: 1.933681537640272
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Midrash collections are complex rabbinic works that consist of text in
multiple languages, which evolved through long processes of unstable oral and
written transmission. Determining the origin of a given passage in such a
compilation is not always straightforward and is often a matter of dispute
among scholars, yet it is essential for scholars' understanding of the passage
and its relationship to other texts in the rabbinic corpus. To help solve this
problem, we propose a system for classification of rabbinic literature based on
its style, leveraging recent advances in natural language processing for Hebrew
texts. Additionally, we demonstrate how this method can be applied to uncover
lost material from a specific midrash genre, Tan\d{h}uma-Yelammedenu, that has
been preserved in later anthologies.
Related papers
- A computational system to handle the orthographic layer of tajwid in contemporary Quranic Orthography [0.0]
We explore the systematicity of the rules of tajwid, as they are encountered in the Cairo Quran.<n>We develop a python module that can remove or add the orthographic layer of tajwid from a Quranic text in CQO.
arXiv Detail & Related papers (2025-05-16T15:41:51Z) - Story Grammar Semantic Matching for Literary Study [3.686808512438363]
We propose a more transparent approach that makes use of story structure and related elements.
We label prose and epic poetry with story element labels and perform semantic matching by only considering these labels as features.
This new method, Story Grammar Semantic Matching, guides literary scholars to allusions and other semantic similarities across texts in a way that allows for characterizing patterns and literary technique.
arXiv Detail & Related papers (2025-02-17T19:20:39Z) - Critical biblical studies via word frequency analysis: unveiling text authorship [7.2762881851201255]
We aim to differentiate between three distinct authors across numerous chapters spanning the first nine books of the Bible.
Our analysis indicates that the first two authors (D and DtrH) are much more closely related compared to P, a fact that aligns with expert assessments.
arXiv Detail & Related papers (2024-10-24T22:08:38Z) - Puzzle Pieces Picker: Deciphering Ancient Chinese Characters with Radical Reconstruction [73.26364649572237]
Oracle Bone Inscriptions is one of the oldest existing forms of writing in the world.
A large number of Oracle Bone Inscriptions (OBI) remain undeciphered, making it one of the global challenges in paleography today.
This paper introduces a novel approach, namely Puzzle Pieces Picker (P$3$), to decipher these enigmatic characters through radical reconstruction.
arXiv Detail & Related papers (2024-06-05T07:34:39Z) - A Dataset for Metaphor Detection in Early Medieval Hebrew Poetry [35.963626057019034]
Poetry is prominent in late antique and medieval Hebrew texts.
Distinguishing figurative and literal language use is a major task for scholars of the Humanities.
This paper presents a new, challenging dataset of late antique and medieval Hebrew poetry with expert annotations of metaphor.
arXiv Detail & Related papers (2024-02-27T10:09:40Z) - Don't lose the message while paraphrasing: A study on content preserving
style transfer [61.38460184163704]
Content preservation is critical for real-world applications of style transfer studies.
We compare various style transfer models on the example of the formality transfer domain.
We conduct a precise comparative study of several state-of-the-art techniques for style transfer.
arXiv Detail & Related papers (2023-08-17T15:41:08Z) - StoryTrans: Non-Parallel Story Author-Style Transfer with Discourse
Representations and Content Enhancing [73.81778485157234]
Long texts usually involve more complicated author linguistic preferences such as discourse structures than sentences.
We formulate the task of non-parallel story author-style transfer, which requires transferring an input story into a specified author style.
We use an additional training objective to disentangle stylistic features from the learned discourse representation to prevent the model from degenerating to an auto-encoder.
arXiv Detail & Related papers (2022-08-29T08:47:49Z) - Tortured phrases: A dubious writing style emerging in science. Evidence
of critical issues affecting established journals [69.76097138157816]
Probabilistic text generators have been used to produce fake scientific papers for more than a decade.
Complex AI-powered generation techniques produce texts indistinguishable from that of humans.
Some websites offer to rewrite texts for free, generating gobbledegook full of tortured phrases.
arXiv Detail & Related papers (2021-07-12T20:47:08Z) - Disambiguatory Signals are Stronger in Word-initial Positions [48.18148856974974]
We point out the confounds in existing methods for comparing the informativeness of segments early in the word versus later in the word.
We find evidence across hundreds of languages that indeed there is a cross-linguistic tendency to front-load information in words.
arXiv Detail & Related papers (2021-02-03T18:19:16Z) - Deep Learning for Text Style Transfer: A Survey [71.8870854396927]
Text style transfer is an important task in natural language generation, which aims to control certain attributes in the generated text.
We present a systematic survey of the research on neural text style transfer, spanning over 100 representative articles since the first neural text style transfer work in 2017.
We discuss the task formulation, existing datasets and subtasks, evaluation, as well as the rich methodologies in the presence of parallel and non-parallel data.
arXiv Detail & Related papers (2020-11-01T04:04:43Z) - Artificial intelligence based writer identification generates new
evidence for the unknown scribes of the Dead Sea Scrolls exemplified by the
Great Isaiah Scroll (1QIsaa) [5.285396202883411]
We use pattern recognition and artificial intelligence techniques to innovate the palaeography of the scrolls regarding writer identification.
Although many scholars believe that 1QIsaa was written by one scribe, we report new evidence for a breaking point in the series of columns in this scroll.
This study sheds new light on the Bible's ancient scribal culture by providing new, tangible evidence that ancient biblical texts were not copied by a single scribe only but that multiple scribes could closely collaborate on one particular manuscript.
arXiv Detail & Related papers (2020-10-27T17:36:18Z) - Restoration of Fragmentary Babylonian Texts Using Recurrent Neural
Networks [14.024892678242379]
The main source of information regarding ancient Mesopotamian history and culture are clay cuneiform tablets.
Despite being an invaluable resource, many tablets are fragmented leading to missing information.
In this work we investigate the possibility of assisting scholars and even automatically completing the breaks in ancient Akkadian texts from Achaemenid period Babylonia by modelling the language using recurrent neural networks.
arXiv Detail & Related papers (2020-03-04T06:36:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.