Exceeding the Limits of Visual-Linguistic Multi-Task Learning
        - URL: http://arxiv.org/abs/2107.13054v1
- Date: Tue, 27 Jul 2021 19:42:14 GMT
- Title: Exceeding the Limits of Visual-Linguistic Multi-Task Learning
- Authors: Cameron R. Wolfe and Keld T. Lundgaard
- Abstract summary: We construct 1000 unique classification tasks that share similarly-structured input data.
These classification tasks focus on learning the product hierarchy of different e-commerce websites.
We solve these tasks in unison using multi-task learning (MTL)
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract:   By leveraging large amounts of product data collected across hundreds of live
e-commerce websites, we construct 1000 unique classification tasks that share
similarly-structured input data, comprised of both text and images. These
classification tasks focus on learning the product hierarchy of different
e-commerce websites, causing many of them to be correlated. Adopting a
multi-modal transformer model, we solve these tasks in unison using multi-task
learning (MTL). Extensive experiments are presented over an initial 100-task
dataset to reveal best practices for "large-scale MTL" (i.e., MTL with more
than 100 tasks). From these experiments, a final, unified methodology is
derived, which is composed of both best practices and new proposals such as
DyPa, a simple heuristic for automatically allocating task-specific parameters
to tasks that could benefit from extra capacity. Using our large-scale MTL
methodology, we successfully train a single model across all 1000 tasks in our
dataset while using minimal task specific parameters, thereby showing that it
is possible to extend several orders of magnitude beyond current efforts in
MTL.
 
      
        Related papers
        - Pilot: Building the Federated Multimodal Instruction Tuning Framework [79.56362403673354]
 Our framework integrates two stages of "adapter on adapter" into the connector of the vision encoder and the LLM.
In stage 1, we extract task-specific features and client-specific features from visual information.
In stage 2, we build the cross-task Mixture-of-Adapters(CT-MoA) module to perform cross-task interaction.
 arXiv  Detail & Related papers  (2025-01-23T07:49:24Z)
- MetaGPT: Merging Large Language Models Using Model Exclusive Task   Arithmetic [6.46176287368784]
 We propose textbfModel textbfExclusive textbfTask textbfArithmetic for merging textbfGPT-scale models.
Our proposed MetaGPT is data-agnostic and bypasses the heavy search process, making it cost-effective and easy to implement for LLMs.
 arXiv  Detail & Related papers  (2024-06-17T10:12:45Z)
- Distribution Matching for Multi-Task Learning of Classification Tasks: a
  Large-Scale Study on Faces & Beyond [62.406687088097605]
 Multi-Task Learning (MTL) is a framework, where multiple related tasks are learned jointly and benefit from a shared representation space.
We show that MTL can be successful with classification tasks with little, or non-overlapping annotations.
We propose a novel approach, where knowledge exchange is enabled between the tasks via distribution matching.
 arXiv  Detail & Related papers  (2024-01-02T14:18:11Z)
- MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning [29.88567810099265]
 Multi-task learning is designed to train multiple correlated tasks simultaneously.
To tackle this challenge, we integrate the decoder-free vision-language model CLIP.
We propose Multi-modal Alignment Prompt (MmAP) for CLIP, which aligns text and visual modalities during fine-tuning process.
 arXiv  Detail & Related papers  (2023-12-14T03:33:02Z)
- STG-MTL: Scalable Task Grouping for Multi-Task Learning Using Data Map [4.263847576433289]
 Multi-Task Learning (MTL) is a powerful technique that has gained popularity due to its performance improvement over traditional Single-Task Learning (STL)
However, MTL is often challenging because there is an exponential number of possible task groupings.
We propose a new data-driven method that addresses these challenges and provides a scalable and modular solution for classification task grouping.
 arXiv  Detail & Related papers  (2023-07-07T03:54:26Z)
- An Efficient General-Purpose Modular Vision Model via Multi-Task
  Heterogeneous Training [79.78201886156513]
 We present a model that can perform multiple vision tasks and can be adapted to other downstream tasks efficiently.
Our approach achieves comparable results to single-task state-of-the-art models and demonstrates strong generalization on downstream tasks.
 arXiv  Detail & Related papers  (2023-06-29T17:59:57Z)
- Knowledge Assembly: Semi-Supervised Multi-Task Learning from Multiple
  Datasets with Disjoint Labels [8.816979799419107]
 Multi-Task Learning (MTL) is an adequate method to do so, but usually requires datasets labeled for all tasks.
We propose a method that can leverage datasets labeled for only some of the tasks in the MTL framework.
Our work, Knowledge Assembly (KA), learns multiple tasks from disjoint datasets by leveraging the unlabeled data in a semi-supervised manner.
 arXiv  Detail & Related papers  (2023-06-15T04:05:03Z)
- Diffusion Model is an Effective Planner and Data Synthesizer for
  Multi-Task Reinforcement Learning [101.66860222415512]
 Multi-Task Diffusion Model (textscMTDiff) is a diffusion-based method that incorporates Transformer backbones and prompt learning for generative planning and data synthesis.
For generative planning, we find textscMTDiff outperforms state-of-the-art algorithms across 50 tasks on Meta-World and 8 maps on Maze2D.
 arXiv  Detail & Related papers  (2023-05-29T05:20:38Z)
- Task Aware Feature Extraction Framework for Sequential Dependence
  Multi-Task Learning [1.0765359420035392]
 We analyze sequential dependence MTL from rigorous mathematical perspective.
We propose a Task Aware Feature Extraction (TAFE) framework for sequential dependence MTL.
 arXiv  Detail & Related papers  (2023-01-06T13:12:59Z)
- Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners [74.92558307689265]
 We propose Mod-Squad, a new model that is Modularized into groups of experts (a 'Squad')
We optimize this matching process during the training of a single model.
Experiments on the Taskonomy dataset with 13 vision tasks and the PASCAL-Context dataset with 5 vision tasks show the superiority of our approach.
 arXiv  Detail & Related papers  (2022-12-15T18:59:52Z)
- When to Use Multi-Task Learning vs Intermediate Fine-Tuning for
  Pre-Trained Encoder Transfer Learning [15.39115079099451]
 Transfer learning (TL) in natural language processing has seen a surge of interest in recent years.
Three main strategies have emerged for making use of multiple supervised datasets during fine-tuning.
We compare all three TL methods in a comprehensive analysis on the GLUE dataset suite.
 arXiv  Detail & Related papers  (2022-05-17T06:48:45Z)
- Semi-supervised Multi-task Learning for Semantics and Depth [88.77716991603252]
 Multi-Task Learning (MTL) aims to enhance the model generalization by sharing representations between related tasks for better performance.
We propose the Semi-supervised Multi-Task Learning (MTL) method to leverage the available supervisory signals from different datasets.
We present a domain-aware discriminator structure with various alignment formulations to mitigate the domain discrepancy issue among datasets.
 arXiv  Detail & Related papers  (2021-10-14T07:43:39Z)
- Using a thousand optimization tasks to learn hyperparameter search
  strategies [53.318615663332274]
 We present TaskSet, a dataset of neural tasks for use in training and evaluating neurals.
 TaskSet is unique in its size and diversity, containing over a thousand tasks ranging from image classification with fully connected or convolutional networks, to variational autoencoders, to non-volume preserving flows on a variety of datasets.
 arXiv  Detail & Related papers  (2020-02-27T02:49:10Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
       
     
           This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.