What is a GAN?

栏目: IT技术 · 发布时间: 4年前

内容简介:It’s around five o’clock and you’ve just finished your homework.‘I’m done!’

How a weird idea became the foundation of cutting-edge AI

What is a GAN? T ake any course on machine learning and you’ll invariably encounter Generative Adversarial Networks, or GANs. Understanding them means mastering the surprising power of playing a computer out against itself.

It’s around five o’clock and you’ve just finished your homework.

‘I’m done!’

‘Great! Would you like to play a game?’

‘Yeah! Today we’re playing Monopoly.’

‘Ugh, Monopoly! To be honest, I hate it when one of the players gets all the properties and it becomes impossible for the other one to win.’

‘Come on! Please!’

‘All right then.’

Half an hour later, your and my properties are still quite evenly distributed.

‘See? Monopoly isn’t that bad.’

‘You’re right. I said earlier that because I’m having a problem with an algorithm that I’m building. Somehow one player is notoriously better than the other. I’m having a hard time to fix that.’

What is a GAN?

Image made with a neural network on dreamgenerator.com — the original picture being the middle one right at the top.

‘Huh? How can there be a player in an algorithm?’

‘When you’re dealing with GANs, you have players.’

‘What? What are GANs?’

‘Let me explain. Do you know what a neural network is?’

‘Ermm, no.’

‘A neural network is basically an algorithm that was made to recognize patterns. Facial recognition, for example, relies heavily on neural networks.

‘You give it a heap of unordered data — let’s say pictures of faces — and with time it learns what a human face looks like. This we call training.

‘Once we’ve trained the neural network, we can show it a random image, and it will decide whether it’s a human face or not — with impressive accuracy.’

‘Okay, so a neural network can tell you whether a face is a face. Now what’s a GAN?’

‘A GAN is when we pit two neural networks against one another. Like in a game.’

‘You let the computer play against itself? How stupid is that?’

‘It’s not stupid. It’s genius.’

‘You’ll have to explain that to me…’

‘The first neural network is called the Generator. It generates fake data points and passes them to its opponent. That’s the second network, the Discriminator. Its job is to tell which data point is real and which is fake.

‘First, we train the Discriminator on a set of real and fake data points. The discriminator guesses what is real and what is fake. After that, we unveil the real solution to it. Based on the feedback, the discriminator learns what is fake and what is not.’

‘Kind of like in Police Academy when the students have to tell apart real from fake cash?’

‘Exactly. But now the Generator comes into play. He’s the money forger in our game. We give him real and fake data points to train with. Based on these points, its job is to make new data points which fool the Discriminator.’

‘But wouldn’t it be much better to give the Generator only real data points? Because if you train it on fakes, then it will repeat the mistakes from those. If you let me forge a 300-dollar-bill, the police will catch it rather quickly. ’

What is a GAN?

Image made with a GAN on dreamgenerator.com — the original picture being the above in the middle.

‘You’re right! But we don’t want the Generator to get too good too fast. Otherwise the Discriminator can’t keep up.’

‘Oh, that’s the Monopoly problem? Where one player gets too much property in Monopoly? And then it gets boring because the other player can’t win anyway?’

‘Precisely. We want to mislead the Generator from time to time. So that the Discriminator has a chance to keep up.’

‘Okay, so let’s say the Generator has made fake data. Now it will fool the Discriminator, right?’

‘Well, it’s gonna try. The Discriminator will hopefully do a reasonable job at differentiating the real data points from the fake ones. It then informs the Generator about its decisions.’

‘Like when the police will tell a criminal whether he’s convicted or not?’

‘Yep. And based on this decision, the Generator will set to work again. It’ll make new fakes, but based on the feedback it will try to make them better so that they fool the Discriminator.

‘Then it’s the Discriminator’s turn again, and so on and so forth. And at the end of the day, we’ve used a GAN to make a heap of new data points!’

‘Okay cool. But why is that useful?’

‘You can use it for all kinds of things. For example, you can make beautiful new pictures, based on others.

‘You’ve probably heard of deep-fakes. These are pictures that are made with GANs. And have you heard about the apps with which you can age people’s faces?’

‘Pah. Gimmicks!’

‘Oh, they might be useful! For example to catch criminals many years later.

‘GANs are also used to make good descriptions of pictures. This is extremely useful for search engines, but also for visually impaired people. You’re basically teaching a computer how to see, and how to explain what it’s seeing!

‘And we haven’t reached the end. There are so many applications that are yet to be explored.’

‘If you say so… But you said earlier that you’re having a problem with your GAN.’

‘True. It’s the Monopoly problem — even though I give my Generator evough fakes to train on, it gets too good too fast. So the discriminator keeps giving up…’

‘Oh man!’

‘It’s quite a common issue with GANs. Research is still ongoing on how to balance the system out better.’

‘Okay, then I’ll let you get on with your problem. I’ve won this round of Monopoly anyway.’

‘Wait, what? I must have gotten rather distracted by thinking about my GANs… Damn, this round goes to you!’

‘Yippie! I’ll let you work on your GAN Monopoly now.’

‘Thank you. But wait for it — tomorrow’s my round of revenge!’


以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

格蠹汇编

格蠹汇编

张银奎 / 电子工业出版社 / 2013-3-1 / 66.00元

《格蠹汇编——软件调试案例集锦》以案例形式讨论了使用调试技术解决复杂软件问题的工具和方法。全书共36章,分为四篇。前两篇每章讲述一个有代表性的真实案例,包括从堆里抢救丢失的博客,修复因误杀而瘫痪的系统,徒手战木马,拯救“发疯”的windows7,经典阅读器的经典死锁,拯救挂死的powerpoint,转储分析之双误谜团,是谁动了我的句柄,寻找系统中的“耗电大王”,解救即将被断网的系统,转储分析之系统......一起来看看 《格蠹汇编》 这本书的介绍吧!

XML 在线格式化
XML 在线格式化

在线 XML 格式化压缩工具

RGB CMYK 转换工具
RGB CMYK 转换工具

RGB CMYK 互转工具

HEX HSV 转换工具
HEX HSV 转换工具

HEX HSV 互换工具