7 Reasons To Not Hire a Data Scientist

栏目: IT技术 · 发布时间: 4年前

内容简介:I think the title is pretty clear, so let’s get straight to it.Before even thinking about hiring a data scientist, you should step back and consider your data.A data scientist’s job is to create value from data. If you are unsure whether you even have data

Because who needs data?

7 Reasons To Not Hire a Data Scientist

Photo by Nick Coleman on Unsplash

I think the title is pretty clear, so let’s get straight to it.

#1: You don’t have any data

Before even thinking about hiring a data scientist, you should step back and consider your data.

A data scientist’s job is to create value from data. If you are unsure whether you even have data, that is a very good sign that you’re not ready for a data scientist.

If you know you have data, but really have no idea how to access it, it’s reliability or any of the specifics, then you should first answer those questions.

You will get significantly more value from a data scientist hire if your company has a strong grasp on its data assets. Your understanding doesn’t have to be perfect, but you should be able to point a data scientist to some data with documentation.

The worst feeling for a new data scientist is to realize he or she just joined a company that actually has no grasp of their data.

#2: You don’t have the right data

I know what you’re thinking. You read #1 and laughed — who would hire a data scientist with no data.

Well, the next sign you are not ready is you have data, but not the right data.

The right data are data that address the problem you want to solve and hopefully are labeled. By labeled, I mean do you have data that also have the truth associated with the data?

For example, if you want a data scientist to come in and build a system to detect fraudulent activity on your site, you would want activity data from your site and know for some set of your data which activity was fraudulent and which wasn’t. Knowing which data points are fraudulent would be considered the label.

Now — you can solve problems without labels, but if this is your first foray into data science, I would strongly suggest starting with data that have labels.

If your data don’t have labels yet, invest some time and money to have people label your data or build a system that can do it automatically.

#3 You don’t have a clear problem to solve

Going back a bit to having the right data, to know if you have the right data, you also have to have a clear problem to solve.

I can’t stress enough how important this is.

Sure — it is possible that you hire a great data scientist and he or she comes in with an ambiguous problem and makes some magic happen. Don’t plan on this.

Maximize your chances of success by knowing exactly what problem you want to solve and how you will evaluate success. It is also helpful if the problem is generally solvable by humans without much issue. If that is the case, it’s a good sign you could also solve it using data science.

A good example might be that you would like to detect whether a comment on your site is inappropriate with at least 70 percent accuracy.

#4: What you actually need is an analyst

If you’ve gotten to this point, then hopefully you have the right data and a clear problem.

The next biggest issue I see is that a company thinks it wants a data scientist, but it actually wants an analyst. A data scientist can usually do the work of an analyst, but if what you need is analytics you are much better off hiring an analyst.

Generally, the difference lies in whether you are trying to predict new events or better understand historical events.

As an example, if you want someone to come in and aggregate your historical sales data into a pretty dashboard with some summary statistics, then you’re looking for an analyst.

Analysts can be incredibly valuable to a company. In fact, they can often be more valuable than data scientists because usually, the problems you want them to solve are more clear and have a lower risk.

#5: You’re not prepared for the true cost of a data scientist

A little known fact about data scientists is they are needy. Once you hire one they are going to want more data, more storage, and more compute. Before you know it, they will be convincing you that a $100,000 computer is absolutely vital to the success of your data science initiatives.

And they may not be wrong.

So — before making a data scientist hire, please consider the true cost of a data scientist. Your IT/infrastructure costs will surely go up.

Also, a single data scientist probably won’t be enough to generate significant value. Adding more engineers and data scientists to the team might be necessary to move at an acceptable speed.

#6: You’re expecting unicorns and rainbows

Data science projects are usually riskier than your average project. Often, it is unknown whether the problem is even solvable. You might end up hiring a data scientist and investing in a problem that isn’t easy to solve.

You need to be comfortable with iterations on failure and less strict timelines to make data science projects effective.

That isn’t to say you shouldn’t expect real value from your data science team, but you should expect that path to be less linear. If your company isn’t ready for that, then I would hold off.

#7: You don’t know how to hire a data scientist

Lastly, you shouldn’t be hiring a data scientist if you don’t know how to do so.

Data science has come to represent many different types of jobs, which makes it very hard to know what type of data scientist you’re getting without being knowledgable of the field.

For example, maybe your getting the true academic data scientist with multiple PhDs, but is pretty poor at coding. Or you could be getting a data scientist who is more like an engineer with a few online courses in data science.

Neither of these is bad, but depending on your needs, it could be the wrong hire for your company. So, before making your data science hire, make sure you feel comfortable being able to actually identify a good hire.


以上就是本文的全部内容,希望对大家的学习有所帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

工程问题C++语言求解

工程问题C++语言求解

Delores M.Etter、Jeanine A.Ingber / 冯力、周凯 / 机械工业出版社 / 2014-8 / 79元

本书介绍了如何利用ANSIC++编程语言以基于对象的编程方式来解决工程问题。书中引用了大量来自于不同工程、科学和计算机科学领域的示例,是一本理论和实践结合紧密的教材。针对C++基本语法的各个部分,由浅入深地进行讲解。每讲解一部分基础知识,同时会结合多个相关实例,实例内容详实,紧贴所讲内容,使读者能够立刻对所学知识进行练习,实战性强。一起来看看 《工程问题C++语言求解》 这本书的介绍吧!

HTML 压缩/解压工具
HTML 压缩/解压工具

在线压缩/解压 HTML 代码

Base64 编码/解码
Base64 编码/解码

Base64 编码/解码

XML、JSON 在线转换
XML、JSON 在线转换

在线XML、JSON转换工具