Why you need to treat AI models like data

栏目: IT技术 · 发布时间: 4年前

内容简介:Here is a very abstract question — What does a AI or data science model look like? We are all using data science models in our day to day life. Most of people that aren’t data scientists have experienced a data science model, but never seen one. So, let me

Why you need to treat AI models like data

Why treating models like data is a very strategic approach

Photo by Alexander Sinn on Unsplash

Here is a very abstract question — What does a AI or data science model look like? We are all using data science models in our day to day life. Most of people that aren’t data scientists have experienced a data science model, but never seen one. So, let me reveal the secret. It may look scary. Here is what a data science model looks like

It is a mathematical formula encrypted into alphanumeric characters. But make no mistake, this strange looking thing is the secret sauce for making your enterprise successful and blow away the competition. It can help you perform your business operations with some cutting-edge advanced analytics. Diverse business cases such as Product recommendation to increase revenue, fraud detection to prevent revenue loss, asset failure prediction to safeguard your asset value –all have predictive models behind them

Because models are so crucial in creating business value, we need to handle them with care. Let us look at different ways these models can be handled

The worst care possible — the model left on laptop:

The worst type of care is that these models are left on a laptop, usually where it was originally created. Imagine treating your enterprise secret sauce as a person left abandoned on an island. This situation is somewhat comparable.

Photo by John Soo on Unsplash

Unfortunately, this happens lot of times. Models created by data scientists using analytic tools on a laptop or pc remain there. A large amount of effort and brain power was used to create them and they contain elements critical for your enterprise success. However, as they remain on local machine and are never operationalised, this is the worst thing which can happen to such beautiful pieces of data science work

Getting Better — Putting models in containers

A better approach is to put models in docker containers. In this way you are taking one step closer to treating the model in a better and more justified way. You are now putting the models in containers, which means that they are secured and isolated within the container, as well as easier to operationalise.

Photo by William Felker on Unsplash

Though the model is in a safe container, it is still isolated. Which means that if you want to use the model , you need to send data to the docker container and use an API to get back the results. This means that data movement is increased, which may not be a desired situation for all business operations

Strategic Approach — Treating Models like data

In recent times data has become a valuable asset for any company. Many advances in technology have been in managing data as a valuable asset, for example, Data warehousing and big data storage platforms all revolve around keeping the data safe, managed and make it easily available to benefit a business

So if we start thinking of models as data, we can leverage all the benefits of data management and apply it to models. By treating models like data, we ensure that models will also become as strategic to business operations as the data is.

Here are some points on why treating models as data is an interesting proposition

Models are made from data

Models are not created from thin air or by magic wand. They are created from the application of an algorithm to data. You can consider it as a mathematical projection of data. So, it makes sense to consider them as part of data.

Model results need data to make sense of them

Say your model alerts you of a critical asset failure in coming days. In order that any action can be taken, you need to know more details about this asset such its location and its value. You will also need an assessment of whether it makes sense to carry out an urgent repair or take the risk of waiting until the next scheduled maintenance is due.

As you realise by now that output of the model was just an alert trigger. The real action needs to be done and converting model output to something tangible needs data about the asset in question. So, if you have your model as part of the data, that is stored in the system as your data in tables, you can easily integrate the output of models with other data, this makes sense out of the model output and also makes it more actionable

Managing millions of models

In the book, “Prediction Machines” (predictionmachines.ai) , the authors write that AI predictions are becoming cheaper and this means we will use more of it. This also means that there will more and more models.

predictionmachines.ai

Use-cases where millions of models are required is not science fiction. Accurate retail stock forecasting requires a model for each product in each store. Fraud detection requires modelling of normal customer behaviour in order to predict any deviation from normal behaviour. As normal behaviour for a customer X may be different from normal behaviour of customer Y, you will need as many models as customers.

With enterprises managing millions of products and millions of customers, suddenly the need to have millions of models becomes inevitable.

In such a scenario, it is better to treat models like data and apply all big data management principles also to models

Models are the Intellectual Property of your enterprise — keep them safe

Models are made from data, and they encode how your enterprise works. For example, a fraud detection model encodes how you intend to detect fraud. It is an intellectual property for your company and therefore should be managed and kept safe.

Imagine the fraud detection model is stolen and decrypted or even worse, the decrypted model is put on the internet for everyone to know how you detect fraud. Suddenly you will be left vulnerable to fraud attacks.

However, managing models like data and applying all security principles of data also to models will help to make your intellectual property safer.

Managing the economics of your model

There is a cost to develop a model and there is a cost to manage your models and keep them operational. If you invest in specialised systems to manage the models, you increase the cost of the model. So you need to give think carefully about the total costs involved in creating and managing a model.

As good models come from good and integrated data, if you have some good models, you already have a data management platform. So if you leverage the data management platform also to manage your models, you are keeping the overall cost of model low. This helps in long run to keep your models economic and profitable

Now as you have seen why it makes sense to manage models like data, let me briefly describe what goes into it. These are some of the building blocks of you would like to treat models as data

Model Repository— This is a place where your models are stored as data. Generally, it is table with specialized fields to hold the model encrypted definition

Model Metadata— Models are strange looking and hard to read for humans. You will need some kind of metadata which describes what the model is about. This is where model metadata is used. It has information such as purpose of model, what kind of algorithms it is using, and information about model accuracy

Model lineage— Like data, you will also need to know how the model was built as well as how it is used. You need to capture information on the data which went into building the model. This is very useful in traceability or audit situation

Design Patternsfor bringing external models inside database — Models are like data. Most of them have origins outside a data management platform. If you have to manage models like data, you need to bring them inside the database. This requires design patterns, which describes different ways in which external model can brought inside the database

In conclusion, if you treat models like data, they will be managed like the valuable assets that they are.


以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

程序员的数学2

程序员的数学2

平冈和幸、堀玄 / 陈筱烟 / 人民邮电出版社 / 2015-8-1 / CNY 79.00

本书沿袭《程序员的数学》平易近人的风格,用通俗的语言和具体的图表深入讲解程序员必须掌握的各类概率统计知识,例证丰富,讲解明晰,且提供了大量扩展内容,引导读者进一步深入学习。 本书涉及随机变量、贝叶斯公式、离散值和连续值的概率分布、协方差矩阵、多元正态分布、估计与检验理论、伪随机数以及概率论的各类应用,适合程序设计人员与数学爱好者阅读,也可作为高中或大学非数学专业学生的概率论入门读物。一起来看看 《程序员的数学2》 这本书的介绍吧!

JSON 在线解析
JSON 在线解析

在线 JSON 格式化工具

HTML 编码/解码
HTML 编码/解码

HTML 编码/解码

XML 在线格式化
XML 在线格式化

在线 XML 格式化压缩工具