Research on Cross-media Science and Technology Information Data
Retrieval
- URL: http://arxiv.org/abs/2204.04887v1
- Date: Mon, 11 Apr 2022 06:10:21 GMT
- Title: Research on Cross-media Science and Technology Information Data
Retrieval
- Authors: Yang Jiang and Zhe Xue and Ang Li
- Abstract summary: Cross-media technology information data has different characteristics.
Traditional science and technology information retrieval system can no longer meet the daily retrieval needs of science and technology scholars.
Cross-media science and technology information data retrieval system based on deep semantic features.
- Score: 15.265191824669555
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Since the era of big data, the Internet has been flooded with all kinds of
information. Browsing information through the Internet has become an integral
part of people's daily life. Unlike the news data and social data in the
Internet, the cross-media technology information data has different
characteristics. This data has become an important basis for researchers and
scholars to track the current hot spots and explore the future direction of
technology development. As the volume of science and technology information
data becomes richer, the traditional science and technology information
retrieval system, which only supports unimodal data retrieval and uses outdated
data keyword matching model, can no longer meet the daily retrieval needs of
science and technology scholars. Therefore, in view of the above research
background, it is of profound practical significance to study the cross-media
science and technology information data retrieval system based on deep semantic
features, which is in line with the development trend of domestic and
international technologies.
Related papers
- Web Scraping for Research: Legal, Ethical, Institutional, and Scientific Considerations [11.851771490297693]
This paper proposes a comprehensive framework for web scraping in social science research for U.S.-based researchers.
We present an overview of the current regulatory environment impacting when and how researchers can access, collect, store, and share data via scraping.
We then provide researchers with recommendations to conduct scraping in a scientifically legitimate and ethical manner.
arXiv Detail & Related papers (2024-10-30T20:20:44Z) - The Age of Synthetic Realities: Challenges and Opportunities [85.058932103181]
We highlight the crucial need for the development of forensic techniques capable of identifying harmful synthetic creations and distinguishing them from reality.
Our focus extends to various forms of media, such as images, videos, audio, and text, as we examine how synthetic realities are crafted and explore approaches to detecting these malicious creations.
This study is of paramount importance due to the rapid progress of AI generative techniques and their impact on the fundamental principles of Forensic Science.
arXiv Detail & Related papers (2023-06-09T15:55:10Z) - Advanced Data Augmentation Approaches: A Comprehensive Survey and Future
directions [57.30984060215482]
We provide a background of data augmentation, a novel and comprehensive taxonomy of reviewed data augmentation techniques, and the strengths and weaknesses (wherever possible) of each technique.
We also provide comprehensive results of the data augmentation effect on three popular computer vision tasks, such as image classification, object detection and semantic segmentation.
arXiv Detail & Related papers (2023-01-07T11:37:32Z) - Modeling Information Change in Science Communication with Semantically
Matched Paraphrases [50.67030449927206]
SPICED is the first paraphrase dataset of scientific findings annotated for degree of information change.
SPICED contains 6,000 scientific finding pairs extracted from news stories, social media discussions, and full texts of original papers.
Models trained on SPICED improve downstream performance on evidence retrieval for fact checking of real-world scientific claims.
arXiv Detail & Related papers (2022-10-24T07:44:38Z) - Research Trends and Applications of Data Augmentation Algorithms [77.34726150561087]
We identify the main areas of application of data augmentation algorithms, the types of algorithms used, significant research trends, their progression over time and research gaps in data augmentation literature.
We expect readers to understand the potential of data augmentation, as well as identify future research directions and open questions within data augmentation research.
arXiv Detail & Related papers (2022-07-18T11:38:32Z) - Profiling and Evolution of Intellectual Property [23.793136650433024]
In recent years, with the rapid growth of Internet data, the number and types of scientific and technological resources are rapidly expanding.
For technology-based enterprises or users, policies related to technology or the development of their industries should also belong to a type of scientific and technological resources.
This article focuses on the difficulties and problems in the field of science and technology policy, and introduces related technologies and developments.
arXiv Detail & Related papers (2022-04-20T09:09:39Z) - Knowledge Graph and Accurate Portrait Construction of Scientific and
Technological Academic Conferences [14.130765322587264]
In recent years, with the continuous progress of science and technology, the number of scientific research achievements is increasing day by day.
The convening of scientific and technological academic conferences will bring large number of academic papers, researchers, research institutions and other data.
It is of great significance to use deep learning technology to mine the core information in the data of scientific and technological academic conferences.
arXiv Detail & Related papers (2022-04-11T06:15:45Z) - Accurate Portraits of Scientific Resources and Knowledge Service
Components [18.014902048632912]
The main body of scientific and technological resources is composed of academic-style resources or entities such as papers, patents, authors, and research institutions.
There is a rich relationship network between resources, from which a large amount of cutting-edge scientific and technological information can be mined.
How to construct a complete and accurate representation of scientific and technological resources is an urgent problem.
arXiv Detail & Related papers (2022-04-11T06:03:29Z) - Synthetic Data: Opening the data floodgates to enable faster, more
directed development of machine learning methods [96.92041573661407]
Many ground-breaking advancements in machine learning can be attributed to the availability of a large volume of rich data.
Many large-scale datasets are highly sensitive, such as healthcare data, and are not widely available to the machine learning community.
Generating synthetic data with privacy guarantees provides one such solution.
arXiv Detail & Related papers (2020-12-08T17:26:10Z) - Data Science: A Comprehensive Overview [42.98602883069444]
The twenty-first century has ushered in the age of big data and data economy, in which data DNA has become an intrinsic constituent of all data-based organisms.
An appropriate understanding of data DNA and its organisms relies on the new field of data science and its keystone, analytics.
This article is the first in the field to draw a comprehensive big picture, in addition to offering rich observations, lessons and thinking about data science and analytics.
arXiv Detail & Related papers (2020-07-01T02:33:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.