-
권재명 박사의 카이스트 세미나Statistics 2018. 9. 13. 10:34
Leading AI + Data Science Team
Sep 2018
Jamie Kwon(https://dataninja.me)AI > ML > DL
www.matroid.com/scaledml/2017/jeff/pdf
데이터 사이즈가 클수록 정확도는 어떻게 되는가?https://www.cbinsight.com
www.hanalyze.com/2018/AI + DS Project Examples
Ex1. Top n pageshttps://www.evanmiller.org/how-not-to-sort-by-average-ratin…
Wilson Confidence interval
Ex2. Running A/B Test
https://www.evanmiller.org/sequential-ab-testing.html
http://pages.optimizely.com/rs/op...Ad Click Prediction : a View from the Trenches <- paper
https://ai.google/reserach/pubs/pub41159
Algorithm1 : Per - Coordinate FTRL-Proximal with L1 and L2 regularizationLow-Rank Matrix Factorization
classification, regression, clustering, dimensionality reduction
ComplicationConfiguration, Data Collection, Data Verification, Machine Resource Management, ...
http://www.longtail.com/about.html
Common AI + DS tool and workflow
python 3.6 ecosystem
Scikit-learn +R ecosystem
R Studio + tidyverse(+Jupyter)
www.stroybench.orgJupyter Ecosystem
https://medium.com/netflix...Tensorflow for DL
AI + DS Team
Profile of an analyst
-Basic Stats
-SQL
-Excel(Tableau)
-Attention to details
-Good CommunicationBuiling and Running AI + DS Team
1. Version control AI + DS codes
2. blogd.codinghorror.com/when-understanding-means-rewriting
Strive for readable codes
3. Strive for reproducibility
How R helps Aribnb make the most of its idea
5. Iterate fast
6. Produce high-quality power point
7. Open by default'Statistics' 카테고리의 다른 글
RCT(Random Controlled Trail)의 원칙 (0) 2018.12.14 RCT(Random Controlled Trial)의 장점 (0) 2018.12.14 초밥 장인 (0) 2018.12.13 데이터 고속도로 (0) 2018.08.31 표본분포(sampling distribution) (0) 2017.08.22