�ݺ�ߣ

데이터 읽는 남자
6번째 숵ӄ�

김영웅
꿈꾸는 데이터 디자이너 매니저
M.I.S 전공 박사과정
MBA (통신미디어트랙)
컴퓨터공학 전공
Ryan Kim | Convergence Business Designer
Facebook. https://www.facebook.com/keyassist
E-mail. youngwung.kim@gmail.com
Web. http://keyassist.tistory.com
데이터.읽는.남자

[week6] 데이터읽어주는남자

<source=http://www.i-scoop.eu/can-survive-age-integration-collaboration>

Those strong in statistics: they sometimes develop new statistical theories for big data, that even traditional statisticians are
not aware of. They are expert in statistical modeling, experimental design, sampling, clustering, data reduction, confidence
intervals, testing, modeling, predictive modeling and other related techniques.
Those strong in mathematics: NSA (national security agency) or defense/military people working on big data, astronomers,
and operations research people doing analytic business optimization (inventory management and forecasting, pricing
optimization, supply chain, quality control, yield optimization) as they collect, analyse and extract value out of data.
Those strong in data engineering, Hadoop, database/memory/file systems optimization and architecture, API's, Analytics as
a Service, optimization of data flows, data plumbing.
Those strong in machine learning / computer science (algorithms, computational complexity)
Those strong in business, ROI optimization, decision sciences, involved in some of the tasks traditionally performed by
business analysts in bigger companies (dashboards design, metric mix selection and metric definitions, ROI optimization, high-
level database design)
Those strong in production code development, software engineering (they know a few programming languages)
Those strong in visualization
Those strong in GIS, spatial data, data modeled by graphs, graph databases

Those strong in a few of the above. After 20 years of experience across many industries, big and small companies (and lots of
training), I'm strong both in stats, machine learning, business, mathematics and more than just familiar with visualization and
data engineering. This could happen to you as well over time, as you build experience. I mention this because so many people
still think that it is not possible to develop a strong knowledge base across multiple domains that are traditionally perceived as
separated (the silo mentality). Indeed, that's the very reason why data science was created.
Most of them are familiar or expert in big data.
There are other ways to categorize data scientists, see for instance our article on Taxonomy of data scientists. A different
categorization would be creative versus mundane. The "creative" category has a better future, as mundane can be outsourced
(anything published in textbooks or on the web can be automated or outsourced - job security is based on how much you know
that no one else know or can easily learn). Along the same lines, we have science users (those using science, that is,
practitioners; often they do not have a PhD), innovators (those creating new science, called researchers), and hybrids. Most
data scientists, like geologists helping predict earthquakes, or chemists designing new molecules for big pharma, are scientists,
and they belong to the user category.
Implications for other IT professionals
You (engineer, business analyst) probably do already a bit of data science work, and know already some of the stuff that some
data scientists do. It might be easier than you think to become a data scientist. Check out our book (listed below in "related
articles"), to find out what you already know, what you need to learn, to broaden your career prospects.
Are data scientists a threat to your job/career? Again, check our book (listed below) to find out what data scientists do, if the
risk for you is serious (you = the business analyst, data engineer or statistician; risk = being replaced by
a data scientist who does everything) and find out how to mitigate the risk (learn some of the data scientist skills from our
book, if you perceive data scientists as competitors)

<source=http://a16z.com/2014/01/14/the-four-stages-of-disruption/>

이 책은 데이터 과학에 대해 입문하려는 사람에게 적절한 책입니다. 단순하게 분석사례를 서술
한 기술서가 아니라 분석의 필요성, 의미, 그것을 위한 기반지식까지 상당히 쉽게 안내하고 있습
니다. 빅데이터라는 단어와 함께 필요이상으로 남달되는 '데이터 분석' 분야에 대해 체계적으로
설명하고 있으니, 이 카테고리에 관심이 있으시다면 이 책을 읽고 나서 전체적인 감을 잡으실 수
있습니다. 책 자체가 그리 두꺼운 편이 아니고 내용도 쉽게 기술되어 있기 때문에 전문을 읽는데
그리 오랜 시간이 걸리지는 않습니다.
<source=http://keyassist.tistory.com/150>

데이터 분석의 가장 큰 효과는

'의사 결정 프로세스의 최적화'

-'데이터 분석을 시작하려는 사람을 위한 책', p.15

복잡한 수리 모델을 구축하는 것도 대규모 데이터를 빠르게 처리하는
것도 아니다. 단순하면서도 운용하기 쉽게, 한정된 경영 자원이나 정책
자원을 효율적으로 사용할 수 있도록 의사 결정을 지원하는 것. 바로 이
것이 데이터 분석의 진정한 가치라고 할 수 있다

데이터 분석이 성공하기 위한 다섯 가지 요소
1) 분석 대상의 데이터와 분석 기반이 있다

2) 목적과 경영 과제가 명확하다

3) 첨단 기술을 보유하고 있으며 인식 능력이 있는 실행팀

4) 경영층의 지원

5) 열정과 창의력이 있는 인재

요컨대 목적에 따라 통계나 분석 수법을 올바르게 실천해 결과를 이끌
어내는 것이 중요하며, 이를 위해 분석 담당자는 단순히 분석 자체에 재
미를 느끼는 것으로 끝내서는 안 된다. 협업 필터링 같은 일부를 제외하
면 통계나 기계 학습 자체에는 의사 결정을 완전 자동화하는 힘이 없다.
어디까지나 의사 결정의 최적화를 지원할 뿐이다. 그런데 이 본질을 이
해하지 못하는 사람이 많은 것이 현실이다.

위기의식을 항상 느끼는 데이터 과학자는 데이터의 사전 처리와 분석
수법 선정, 유의성 검정 등 화려하지 않은 작업의 중요성을 잘 알고 있
다. 그러나 경험이 일천한 분석자 중에는 분석 대상에 관심을 보이기보
다 최첨단 분석 수법이나 도구에 정신이 팔려 이런 과정을 생략하고 숫
자와 씨름하는 데만 열중하는 사람도 있다.

데이터 분석 프로젝트를 성공시킬 때 중요한 것은 분석 자체가 아니라
분석을 통해 얻은 결과를 현장의 프로세스에 반영하는 것이다. 그러나
분석 결과를 과신하다 현장에 예측 모델을 도입하는 단계에서 큰 실패
를 맛보는 경우가 있다. 올바른 프로세스를 통해 분석한 결과는 높은 정
확도로 현실을 말해 준다. 그러나 단순히 그 결과를 보여주기만 해서는
사람, 즉 조직은 움직이지 않는다.

"Efforts and courage are not enough
without purpose and direction."
노력이나 용기 있는 한 발을 내디뎠다고 해도

목적이나 착지점에 대한 방향성이 없다면

그 노력이나 행동의 결과는

충분하지 못한 것이 될 것이다.

John F. Kennedy

�ݺ�ߣ

[week6] 데이터읽어주는남자

More Related Content

What's hot (9)

Viewers also liked (12)

Similar to [week6] 데이터읽어주는남자 (20)

More from neuroassociates (20)

[week6] 데이터읽어주는남자