Disappointing AI: Using AWS for OCR & Celebrity Recognition

栏目: IT技术 · 发布时间: 5年前

内容简介:According toTesting cloud services offering Optical Character Recognition was the topic of a hackathon I attended a few weeks back, at the company I currently work in. It gave me the chance to test theOCR represents the ability of a system to detect typed,

Or when AWS mistook The Witcher for a Serbian footballer.

Disappointing AI: Using AWS for OCR & Celebrity Recognition

Photo by Rock'n Roll Monkey on Unsplash

According to Forbes , 83% of enterprise workloads will be in The Cloud by 2020, including 41% on public platforms such as AWS, Google Cloud Platform or Microsoft Azure. During one of my recent project as a data scientist, I had to start getting used to cloud computing, storage and deployment. Another good step in this direction might be to start experimenting with cloud services such as image recognition, character recognition and speech recognition, if not only to know the performance they offer.

Testing OCR services during a Hackathon

Testing cloud services offering Optical Character Recognition was the topic of a hackathon I attended a few weeks back, at the company I currently work in. It gave me the chance to test the Artificial Intelligence Services of AWS , focusing primarily on OCR ( Textract, Rekognition ), but also fun services such as celebrity recognition ( Rekognition ). Indeed, AWS Rekognition is supposed to be quite a complete service that can detect objects, recognize faces and detect text. The reality is slightly different.

A tiny reminder about OCR

OCR represents the ability of a system to detect typed, handwritten or printed text into machine-encoded text, whether it is from a scanned document, a photo of a document, a sign or other type of text displayed in a photo. It is currently a hot topic as many “old” institutions such as governmental institutions, banks or insurances aim at digitizing all documents, such as printed contracts that might include both printed and hand-written characters. In general, OCR can be separated in two groups:

  • HCR : Hand Written Character Recognition
  • PCR : Printed Character Recognition

HCR tends to be a lot more challenging than PCR for an obvious reason: most people have a horribly confusing handwriting. The difference is also noticeable in the results we experienced.


以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

Tomcat架构解析

Tomcat架构解析

刘光瑞 / 人民邮电出版社 / 2017-5 / 79.00元

本书全面介绍了Tomcat的架构、各组件的实现方案以及使用方式。包括Tomcat的基础组件架构以及工作原理,Tomcat各组件的实现方案、使用方式以及详细配置说明,Tomcat与Web服务器集成以及性能优化,Tomcat部分扩展特性介绍等。读者可以了解应用服务器的架构以及工作原理,学习Tomcat的使用、优化以及详细配置。一起来看看 《Tomcat架构解析》 这本书的介绍吧!

图片转BASE64编码
图片转BASE64编码

在线图片转Base64编码工具

Base64 编码/解码
Base64 编码/解码

Base64 编码/解码

UNIX 时间戳转换
UNIX 时间戳转换

UNIX 时间戳转换