GPT-3: Creative Potential of NLP

栏目: IT技术 · 发布时间: 4年前

It was last year in February, as OpenAI published results on their training of unsupervised language model GPT-2 . Trained in 40Gb texts (8 Mio websites) and was able to predict words in proximity. GPT-2, a transformer-based language applied to self-attention, allowed us to generated very convincing and coherent texts. The quality was that good, so the main model with 1.5 billion parameters wasn’t initially publicly accessible, to prevent uncontrolled fake news. Luckily, the complete model was later published and could be even used withColab Notebooks.

This year OpenAI strikes back with new language model GPT-3. With 175 billion parameters.

Unnecessary spoiler: it’s incredibly good.

There are already some profound articles on TDS examining features and paper of GPT-3:

Is bigger also smarter? — Open AI releases GPT-3 language model

The race for larger language models is entering the next round.

towardsdatascience.com

GPT-3: The New Mighty Language Model from OpenAI

Pushing Deep Learning to the Limit with 175B Parameters

towardsdatascience.com

GPT-3, a Giant Step for Deep Learning And NLP

Can intelligence emerge simply by training a big enough LM using lots of data? OpenAI tries to do so, using 175 billion…

towardsdatascience.com

But how does it look like in action?

OpenAI is building an API, currently accessible via waiting list:

OpenAI API

An API for accessing new AI models developed by OpenAI

beta.openai.com

Fortunately, I could get access and experiment with GPT-3 directly. Here are some of my initial outcomes.

Interface, Settings, Presets.

The AI Playground interface looks simple, but it bears the power within. For the first, here is a setting dialog, which lets you configure text length, temperature (from low/boring to standard to chaotic/creative), and other features.

You also can define where the generated text has to start and to stop, these are some of the control functions that have a direct impact on textual results.

The simple interface provides also some GPT-3 presets . The amazing thing about transformer-driven GPT-models is among others the ability to recognize a specific style, text character, or structure. In case you begin with lists, GPT-3 continues generating lists. In case your prompt has a Q&A structure, it will be kept coherently. If you ask for a poem, it writes a poem.

You can do your own presets, or use the existing, which are:

Chat.

A typical setting for a chatbot. You ask - AI answers. It’s possible to change the “characters” or setting also. As you can see, the chat situation was accomplished perfectly (even if my, Human’s, third question was kind of unfair).

To demonstrate the contextual impact, let’s change the AI character from “helpful” and “very friendly” to “brutal, stupid and very unfriendly”. You will see how the whole dialogue will be influenced:

I think, we re-invented Marvin.

Q&A

This preset consists of a clear dual structure: Question and Answer. You need some training before it starts to answer the question (and get the rules), but then it works perfectly. I asked some random questions from various areas and here you go:

I’d say, perfect!

Parsing unstructured data

This one is fascinating and shows a good comprehension of the unstructured text — extracting structured data from the full text.

Summarizing for a 2nd grader

This preset shows another level of comprehension — including rephrasing of difficult concepts and sentences in clear words.

I tried Wittgenstein:

The simple proverb can be paraphrased convincingly:

Or look at this pretty well and clear transition of Sigmund Freud’s time distancing concept:

As you see, compression of text and its coherent “translation” is one of the strengths of GPT-3.

What about languages?

GPT-2 was already a great language model when it was about English. You could generate amazing texts, especially with 1.5 billion parameters. I used GPT-2 for a screenplay of this short movie — and its absurdity could be rather understood as a good tradition of David Lynch and Beckett:

The dialogues were logical, even if spontaneous. But it was regarding English. If you’ve tried with inputs in other languages, you would face the barrier of understanding. GPT-2 tried to imitate languages, but you needed to fine-tune it on text corpus in a specific language to get good results.

GPT-3 is different.

It’s processing in other languages is phenomenal.

I tried German, Russian, and Japanese.

German.

It was rather my daughter, who tried to let GPT-3 write a fairy tale. She began with “ Eine Katze mit Flügeln ging im Park spazieren ” (“ A cat with wings took a walk in a park ”).

The emerged story was astonishingly well written. With irony, vivid characters, and some leitmotifs. This is not just a collection of topoi or connected sentences. This is… a story!

Russian.

I trained once GPT-2 on Pushkin’s poetry and have got some interesting neologisms, but it was a grammar mess. Here I input some lines of Pushkin’s poem — and the result I’ve got was… interesting. It hadn’t rhymes, but stylistically intense power. It was not Pushkin style, though. But almost without any mistakes or weird grammar. And… it works as poetry (especially if you are ready to interpret it).

Japanese.

This was something special. I entered just a random sentence:

今日は楽しい一日になりますように！と言いました。// Today was funny and entertaining day, I said.

And the result was a small story about prayer, happiness, wisdom, and financial investment. In well written Japanese (neutral politeness form, like the input).

It does mean: GPT-3 is ready for multilingual text processing.

Various experiments (and alerting signals).

ShakespAIre and writing poems

My first try was, of course, to write a Shakespearean sonnet. So the prompt was just:

here is a poem by Shakespeare

The result was this:

Perfect iambic verse, great style, nice rhymes… If not one thing:

The first two lines are actually from Alexander Pope, The Rape of the Lock. And here we have a reason to be cautious: GPT-3 produces unique and unrepeatable texts, but it can reuse the whole quotes of existing texts it was trained on.

Re-examination of results is inevitable if you want to guarantee a singularity of a text.

I wonder, if there are some possibilities for “Projection” like StyleGAN2 feature , just in opposite to StyleGAN2 (where it compares the image with latent space), in GPT-3 it would compare with the dataset it was trained on? To prevent accidental plagiarism.

But the thing is: GPT-3 can write poems on demand, in particular styles.

Here is another example:

Essays

As I still hadn’t accessed, I asked a friend to let GPT-3 write an essay on Kurt Schwitters, a German artist, and Dadaist:

The outcome is: GPT-3 has already a rich knowledge, which can be recollected. It is not always reliable (you have to fine-tune it to have a perfect meaning match), but it’s still very close to the discourse.

Coding with GPT-3

Another mindblowing possibility is using GPT-3 is quite different cases than just text generation:

You can get support by CSS:

And calling it General Intelligence is already a thing:

Summary.

We are still at the beginning, but the experiments with GPT-3 made by the AI community show its power, potential, and impact. We just have to use it with reason and good intention. But that’s the human factor. Which is not always the best one.

For more wonderful text experiments I highly recommend you to read Gwern:

GPT-3 Creative Fiction

I continue my AI poetry generation experiments with OpenAI's 2020 GPT-3, which is 116× larger, and much more powerful…

www.gwern.net

Let the journey continue!

以上就是本文的全部内容，希望本文的内容对大家的学习或者工作能带来一定的帮助，也希望大家多多支持码农网

查看所有标签

猜你喜欢:

GPT-3: Creative Potential of NLP

本站部分资源来源于网络，本站转载出于传递更多信息之目的，版权归原作者或者来源机构所有，如转载稿涉及版权问题，请联系我们。

码农书籍

程序员面试笔试宝典

何昊、叶向阳、窦浩 / 2012-10 / 59.80元

《程序员面试笔试宝典》除了对传统的计算机相关知识（C/C++、数据结构与算法、操作系统、计算机网络与通信、软件工程、数据库、智力题、英语面试等）进行介绍外，还根据当前计算机技术的发展潮流，对面试笔试中常见的海量数据处理进行了详细的分析。同时，为了更具说服力，《程序员面试笔试宝典》特邀多位IT名企面试官现身说法，对面试过程中求职者存在的问题进行了深度剖析，同时《程序员面试笔试宝典》引入了一批来自于名......一起来看看《程序员面试笔试宝典》这本书的介绍吧!

码农工具

图片转BASE64编码

在线图片转Base64编码工具

GPT-3: Creative Potential of NLP

Is bigger also smarter? — Open AI releases GPT-3 language model

The race for larger language models is entering the next round.

towardsdatascience.com

GPT-3: The New Mighty Language Model from OpenAI

Pushing Deep Learning to the Limit with 175B Parameters

towardsdatascience.com

GPT-3, a Giant Step for Deep Learning And NLP

towardsdatascience.com

But how does it look like in action?

OpenAI API

An API for accessing new AI models developed by OpenAI

beta.openai.com

Interface, Settings, Presets.

Chat.

Q&A

Parsing unstructured data

Summarizing for a 2nd grader

What about languages?

German.

Russian.

Japanese.

Various experiments (and alerting signals).

ShakespAIre and writing poems

Essays

Coding with GPT-3

Summary.

GPT-3 Creative Fiction

www.gwern.net

Let the journey continue!

程序员面试笔试宝典

图片转BASE64编码

HTML 编码/解码