Can ChatGPT Boost Artistic Creation: The Need of Imaginative Intelligence for Parallel Art

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Related collections

Most cited references 33

Record: found
Abstract: found
Article: not found

Language Models are Few-Shot Learners

Tom B. Brown, Benjamin Mann, Nick Ryder … (2020)

Recent work has demonstrated substantial gains on many NLP tasks and benchmarks by pre-training on a large corpus of text followed by fine-tuning on a specific task. While typically task-agnostic in architecture, this method still requires task-specific fine-tuning datasets of thousands or tens of thousands of examples. By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do. Here we show that scaling up language models greatly improves task-agnostic, few-shot performance, sometimes even reaching competitiveness with prior state-of-the-art fine-tuning approaches. Specifically, we train GPT-3, an autoregressive language model with 175 billion parameters, 10x more than any previous non-sparse language model, and test its performance in the few-shot setting. For all tasks, GPT-3 is applied without any gradient updates or fine-tuning, with tasks and few-shot demonstrations specified purely via text interaction with the model. GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks, as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetic. At the same time, we also identify some datasets where GPT-3's few-shot learning still struggles, as well as some datasets where GPT-3 faces methodological issues related to training on large web corpora. Finally, we find that GPT-3 can generate samples of news articles which human evaluators have difficulty distinguishing from articles written by humans. We discuss broader societal impacts of this finding and of GPT-3 in general. 40+32 pages

0 comments Cited 681 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Training language models to follow instructions with human feedback

Long Ouyang, Jeff Wu, Xu Jiang … (2022)

Making language models bigger does not inherently make them better at following a user's intent. For example, large language models can generate outputs that are untruthful, toxic, or simply not helpful to the user. In other words, these models are not aligned with their users. In this paper, we show an avenue for aligning language models with user intent on a wide range of tasks by fine-tuning with human feedback. Starting with a set of labeler-written prompts and prompts submitted through the OpenAI API, we collect a dataset of labeler demonstrations of the desired model behavior, which we use to fine-tune GPT-3 using supervised learning. We then collect a dataset of rankings of model outputs, which we use to further fine-tune this supervised model using reinforcement learning from human feedback. We call the resulting models InstructGPT. In human evaluations on our prompt distribution, outputs from the 1.3B parameter InstructGPT model are preferred to outputs from the 175B GPT-3, despite having 100x fewer parameters. Moreover, InstructGPT models show improvements in truthfulness and reductions in toxic output generation while having minimal performance regressions on public NLP datasets. Even though InstructGPT still makes simple mistakes, our results show that fine-tuning with human feedback is a promising direction for aligning language models with human intent.

0 comments Cited 128 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Conference Proceedings: not found

High-Resolution Image Synthesis with Latent Diffusion Models

Robin Rombach, Andreas Blattmann, Dominik Lorenz … (2022)

0 comments Cited 100 times – based on 0 reviews

Bookmark

All references

Author and article information

Journal

Title: IEEE/CAA Journal of Automatica Sinica

Abbreviated Title: IEEE/CAA J. Autom. Sinica

Publisher: Institute of Electrical and Electronics Engineers (IEEE)

ISSN (Print): 2329-9266

ISSN (Electronic): 2329-9274

Publication date Created: April 2023

Publication date (Print): April 2023

Volume: 10

Issue: 4

Pages: 835-838

Affiliations

[1 ]The State Key Laboratory for Management and Control of Complex Systems, Chinese Academy of Sciences,Beijing,China,100190

[2 ]Macao Institute of Systems Engineering, Macau University of Science and Technology,Macao,China,999078

Article

DOI: 10.1109/JAS.2023.123555

SO-VID: a3d159f5-d51a-416c-9b19-d4937ba73b0b

History

Data availability:

Can ChatGPT Boost Artistic Creation: The Need of Imaginative Intelligence for Parallel Art

Read this article at

Related collections

Taxonomic intelligence

Most cited references 33

Language Models are Few-Shot Learners

Training language models to follow instructions with human feedback

High-Resolution Image Synthesis with Latent Diffusion Models

Author and article information

Journal

Affiliations

Article

History

Comments

Comment on this article

Similar content 2,090

Cited by 7