Disappointing AI: Using AWS for OCR & Celebrity Recognition

栏目: IT技术 · 发布时间: 5年前

内容简介:According toTesting cloud services offering Optical Character Recognition was the topic of a hackathon I attended a few weeks back, at the company I currently work in. It gave me the chance to test theOCR represents the ability of a system to detect typed,

Or when AWS mistook The Witcher for a Serbian footballer.

Disappointing AI: Using AWS for OCR & Celebrity Recognition

Photo by Rock'n Roll Monkey on Unsplash

According to Forbes , 83% of enterprise workloads will be in The Cloud by 2020, including 41% on public platforms such as AWS, Google Cloud Platform or Microsoft Azure. During one of my recent project as a data scientist, I had to start getting used to cloud computing, storage and deployment. Another good step in this direction might be to start experimenting with cloud services such as image recognition, character recognition and speech recognition, if not only to know the performance they offer.

Testing OCR services during a Hackathon

Testing cloud services offering Optical Character Recognition was the topic of a hackathon I attended a few weeks back, at the company I currently work in. It gave me the chance to test the Artificial Intelligence Services of AWS , focusing primarily on OCR ( Textract, Rekognition ), but also fun services such as celebrity recognition ( Rekognition ). Indeed, AWS Rekognition is supposed to be quite a complete service that can detect objects, recognize faces and detect text. The reality is slightly different.

A tiny reminder about OCR

OCR represents the ability of a system to detect typed, handwritten or printed text into machine-encoded text, whether it is from a scanned document, a photo of a document, a sign or other type of text displayed in a photo. It is currently a hot topic as many “old” institutions such as governmental institutions, banks or insurances aim at digitizing all documents, such as printed contracts that might include both printed and hand-written characters. In general, OCR can be separated in two groups:

  • HCR : Hand Written Character Recognition
  • PCR : Printed Character Recognition

HCR tends to be a lot more challenging than PCR for an obvious reason: most people have a horribly confusing handwriting. The difference is also noticeable in the results we experienced.


以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

精通EJB

精通EJB

罗曼 / 第1版 (2005年9月1日) / 2005-9 / 69.0

本书是EJB组件技术教程,专注于EJB的概念、方法、开发过程的介绍。全书共分为4个部分,首先对EJB编程基础进行介绍,其次重点关注EJB编程的具体内容和过程,然后对高级EJB进行了阐述,最后的附录收集了EJB组件技术相关的其他内容。作为一本交互性好、读起来有趣、涉及到EJB中各方面知识的书籍,本书确信这正是你所寻找的。  本书是关于EJB 2.1的经典书籍,是EJB开发者必备的参考书。全书共分为3......一起来看看 《精通EJB》 这本书的介绍吧!

RGB转16进制工具
RGB转16进制工具

RGB HEX 互转工具

在线进制转换器
在线进制转换器

各进制数互转换器