Phoenix: Democratizing ChatGPT across Languages
- URL: http://arxiv.org/abs/2304.10453v1
- Date: Thu, 20 Apr 2023 16:50:04 GMT
- Title: Phoenix: Democratizing ChatGPT across Languages
- Authors: Zhihong Chen, Feng Jiang, Junying Chen, Tiannan Wang, Fei Yu, Guiming
Chen, Hongbo Zhang, Juhao Liang, Chen Zhang, Zhiyi Zhang, Jianquan Li, Xiang
Wan, Benyou Wang, Haizhou Li
- Abstract summary: We release a large language model "Phoenix", achieving competitive performance among open-source English and Chinese models.
We believe this work will be beneficial to make ChatGPT more accessible, especially in countries where people cannot use ChatGPT due to restrictions from OpenAI or local goverments.
- Score: 68.75163236421352
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: This paper presents our efforts to democratize ChatGPT across language. We
release a large language model "Phoenix", achieving competitive performance
among open-source English and Chinese models while excelling in languages with
limited resources (covering both Latin and non-Latin languages). We believe
this work will be beneficial to make ChatGPT more accessible, especially in
countries where people cannot use ChatGPT due to restrictions from OpenAI or
local goverments. Our data, code, and models are available at
https://github.com/FreedomIntelligence/LLMZoo.
Related papers
- ChatGPT, Let us Chat Sign Language: Experiments, Architectural Elements,
Challenges and Research Directions [0.0]
ChatGPT is a language model based on Generative AI.
It can translate from English to American (ASL), Australian (AUSLAN), and British (BSL) sign languages and from Arabic Sign Language (ArSL) to English with only one prompt iteration.
However, the model failed to translate from Arabic to ArSL and ASL, AUSLAN, and BSL to Arabic.
arXiv Detail & Related papers (2024-01-10T13:39:49Z) - Fumbling in Babel: An Investigation into ChatGPT's Language Identification Ability [15.274404016420737]
We study ChatGPT's (both GPT-3.5 and GPT-4) ability to identify language names and language codes.
When compared to smaller finetuned LID tools, we find that ChatGPT lags behind.
We conclude that current large language models would benefit from further development before they can sufficiently serve diverse communities.
arXiv Detail & Related papers (2023-11-16T09:12:20Z) - Primacy Effect of ChatGPT [69.49920102917598]
We study the primacy effect of ChatGPT: the tendency of selecting the labels at earlier positions as the answer.
We hope that our experiments and analyses provide additional insights into building more reliable ChatGPT-based solutions.
arXiv Detail & Related papers (2023-10-20T00:37:28Z) - Are Large Language Models a Threat to Digital Public Goods? Evidence
from Activity on Stack Overflow [1.5039745292757671]
We investigate how the release of ChatGPT changed human-generated open data on the web.
We find that relative to its Russian and Chinese counterparts, where access to ChatGPT is limited, activity on Stack Overflow significantly decreased.
arXiv Detail & Related papers (2023-07-14T14:22:12Z) - ChatLog: Carefully Evaluating the Evolution of ChatGPT Across Time [54.18651663847874]
ChatGPT has achieved great success and can be considered to have acquired an infrastructural status.
Existing benchmarks encounter two challenges: (1) Disregard for periodical evaluation and (2) Lack of fine-grained features.
We construct ChatLog, an ever-updating dataset with large-scale records of diverse long-form ChatGPT responses for 21 NLP benchmarks from March, 2023 to now.
arXiv Detail & Related papers (2023-04-27T11:33:48Z) - ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large
Language Models in Multilingual Learning [70.57126720079971]
Large language models (LLMs) have emerged as the most important breakthroughs in natural language processing (NLP)
This paper evaluates ChatGPT on 7 different tasks, covering 37 diverse languages with high, medium, low, and extremely low resources.
Compared to the performance of previous models, our extensive experimental results demonstrate a worse performance of ChatGPT for different NLP tasks and languages.
arXiv Detail & Related papers (2023-04-12T05:08:52Z) - Prompting Multilingual Large Language Models to Generate Code-Mixed
Texts: The Case of South East Asian Languages [47.78634360870564]
We explore prompting multilingual models to generate code-mixed data for seven languages in South East Asia (SEA)
We find that publicly available multilingual instruction-tuned models such as BLOOMZ are incapable of producing texts with phrases or clauses from different languages.
ChatGPT exhibits inconsistent capabilities in generating code-mixed texts, wherein its performance varies depending on the prompt template and language pairing.
arXiv Detail & Related papers (2023-03-23T18:16:30Z) - ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use
Case of Automatic Genre Identification [0.0]
ChatGPT has shown strong capabilities in natural language generation tasks, which naturally leads researchers to explore where its abilities end.
We compare ChatGPT with a multilingual XLM-RoBERTa language model that was fine-tuned on datasets, manually annotated with genres.
Results show that ChatGPT outperforms the fine-tuned model when applied to the dataset which was not seen before by either of the models.
arXiv Detail & Related papers (2023-03-07T14:59:33Z) - Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine [97.8609714773255]
We evaluate ChatGPT for machine translation, including translation prompt, multilingual translation, and translation robustness.
ChatGPT performs competitively with commercial translation products but lags behind significantly on low-resource or distant languages.
With the launch of the GPT-4 engine, the translation performance of ChatGPT is significantly boosted.
arXiv Detail & Related papers (2023-01-20T08:51:36Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.