ARTH: Algorithm For Reading Text Handily -- An AI Aid for People having
Word Processing Issues
- URL: http://arxiv.org/abs/2101.09464v1
- Date: Sat, 23 Jan 2021 09:39:45 GMT
- Title: ARTH: Algorithm For Reading Text Handily -- An AI Aid for People having
Word Processing Issues
- Authors: Akanksha Malhotra and Sudhir Kamle
- Abstract summary: "ARTH" is a self-learning set of algorithms that is an intelligent way of fulfilling the need for "reading and understanding the text effortlessly"
The technology "ARTH" focuses on the revival of the joy of reading among those people, who have a poor vocabulary or any word processing issues.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The objective of this project is to solve one of the major problems faced by
the people having word processing issues like trauma, or mild mental
disability. "ARTH" is the short form of Algorithm for Reading Handily. ARTH is
a self-learning set of algorithms that is an intelligent way of fulfilling the
need for "reading and understanding the text effortlessly" which adjusts
according to the needs of every user. The research project propagates in two
steps. In the first step, the algorithm tries to identify the difficult words
present in the text based on two features -- the number of syllables and usage
frequency -- using a clustering algorithm. After the analysis of the clusters,
the algorithm labels these clusters, according to their difficulty level. In
the second step, the algorithm interacts with the user. It aims to test the
user's comprehensibility of the text and his/her vocabulary level by taking an
automatically generated quiz. The algorithm identifies the clusters which are
difficult for the user, based on the result of the analysis. The meaning of
perceived difficult words is displayed next to them. The technology "ARTH"
focuses on the revival of the joy of reading among those people, who have a
poor vocabulary or any word processing issues.
Related papers
- Learning Robust Named Entity Recognizers From Noisy Data With Retrieval Augmentation [67.89838237013078]
Named entity recognition (NER) models often struggle with noisy inputs.
We propose a more realistic setting in which only noisy text and its NER labels are available.
We employ a multi-view training framework that improves robust NER without retrieving text during inference.
arXiv Detail & Related papers (2024-07-26T07:30:41Z) - Performance Prediction of Data-Driven Knowledge summarization of High
Entropy Alloys (HEAs) literature implementing Natural Language Processing
algorithms [0.0]
The goal of natural language processing (NLP) is to get a machine intelligence to process words the same way a human brain does.
Five NLP algorithms, namely, Geneism, Sumy, Luhn, Latent Semantic Analysis (LSA), and Kull-back-Liebler (KL) al-gorithm, are implemented.
Luhn algorithm has the highest accuracy score for the knowledge summarization tasks compared to the other used algorithms.
arXiv Detail & Related papers (2023-11-06T16:22:32Z) - SeqXGPT: Sentence-Level AI-Generated Text Detection [62.3792779440284]
We introduce a sentence-level detection challenge by synthesizing documents polished with large language models (LLMs)
We then propose textbfSequence textbfX (Check) textbfGPT, a novel method that utilizes log probability lists from white-box LLMs as features for sentence-level AIGT detection.
arXiv Detail & Related papers (2023-10-13T07:18:53Z) - A Gold Standard Dataset for the Reviewer Assignment Problem [117.59690218507565]
"Similarity score" is a numerical estimate of the expertise of a reviewer in reviewing a paper.
Our dataset consists of 477 self-reported expertise scores provided by 58 researchers.
For the task of ordering two papers in terms of their relevance for a reviewer, the error rates range from 12%-30% in easy cases to 36%-43% in hard cases.
arXiv Detail & Related papers (2023-03-23T16:15:03Z) - A comparison of several AI techniques for authorship attribution on
Romanian texts [0.0]
We compare AI techniques for classifying literary texts written by multiple authors.
We also introduce a new dataset composed of texts written in the Romanian language on which we have run the algorithms.
arXiv Detail & Related papers (2022-11-09T20:24:48Z) - How to transfer algorithmic reasoning knowledge to learn new algorithms? [23.335939830754747]
We investigate how we can use algorithms for which we have access to the execution trace to learn to solve similar tasks for which we do not.
We create a dataset including 9 algorithms and 3 different graph types.
We validate this empirically and show how instead multi-task learning can be used to achieve the transfer of algorithmic reasoning knowledge.
arXiv Detail & Related papers (2021-10-26T22:14:47Z) - Machine Learning for Online Algorithm Selection under Censored Feedback [71.6879432974126]
In online algorithm selection (OAS), instances of an algorithmic problem class are presented to an agent one after another, and the agent has to quickly select a presumably best algorithm from a fixed set of candidate algorithms.
For decision problems such as satisfiability (SAT), quality typically refers to the algorithm's runtime.
In this work, we revisit multi-armed bandit algorithms for OAS and discuss their capability of dealing with the problem.
We adapt them towards runtime-oriented losses, allowing for partially censored data while keeping a space- and time-complexity independent of the time horizon.
arXiv Detail & Related papers (2021-09-13T18:10:52Z) - Evaluating Various Tokenizers for Arabic Text Classification [4.110108749051656]
We introduce three new tokenization algorithms for Arabic and compare them to three other baselines using unsupervised evaluations.
Our experiments show that the performance of such tokenization algorithms depends on the size of the dataset, type of the task, and the amount of morphology that exists in the dataset.
arXiv Detail & Related papers (2021-06-14T16:05:58Z) - Research on Annotation Rules and Recognition Algorithm Based on Phrase
Window [4.334276223622026]
We propose labeling rules based on phrase windows, and designed corresponding phrase recognition algorithms.
The labeling rule uses phrases as the minimum unit, di-vides sentences into 7 types of nestable phrase types, and marks the grammatical dependencies between phrases.
The corresponding algorithm, drawing on the idea of identifying the target area in the image field, can find the start and end positions of various phrases in the sentence.
arXiv Detail & Related papers (2020-07-07T00:19:47Z) - TextScanner: Reading Characters in Order for Robust Scene Text
Recognition [60.04267660533966]
TextScanner is an alternative approach for scene text recognition.
It generates pixel-wise, multi-channel segmentation maps for character class, position and order.
It also adopts RNN for context modeling and performs paralleled prediction for character position and class.
arXiv Detail & Related papers (2019-12-28T07:52:00Z) - Optimal Clustering from Noisy Binary Feedback [75.17453757892152]
We study the problem of clustering a set of items from binary user feedback.
We devise an algorithm with a minimal cluster recovery error rate.
For adaptive selection, we develop an algorithm inspired by the derivation of the information-theoretical error lower bounds.
arXiv Detail & Related papers (2019-10-14T09:18:26Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.