poniedziałek, 24 października 2016

ReadingList: Data Bases in R

1. Data Table by Matt Dowle.
Cheatsheet: https://s3.amazonaws.com/assets.datacamp.com/img/blog/data+table+cheat+sheet.pdf
https://www.analyticsvidhya.com/blog/2016/05/data-table-data-frame-work-large-data-sets/

Basics: https://www.dezyre.com/data-science-in-r-programming-tutorial/r-data-table-tutorial
https://www.r-bloggers.com/a-data-table-r-tutorial-by-datacamp-intro-to-dti-j-by/
https://www.r-bloggers.com/elegant-fast-data-manipulation-with-data-table/

Adv: https://www.r-bloggers.com/advanced-tips-and-tricks-with-data-table/
https://www.r-bloggers.com/efficient-aggregation-and-more-using-data-table/
Quick union: https://www.r-bloggers.com/r-quick-tip-collapse-a-lists-of-data-frames-with-data-table/
Aggregation: https://www.r-bloggers.com/two-of-my-favorite-data-table-features/
Binning: https://www.r-bloggers.com/using-data-table-for-binning/

Joining: http://www.magesblog.com/2014/06/the-joy-of-joining-datatables.html#more
Manipulation the fast way: http://www.magesblog.com/2012/11/datatable-rocks-data-manipulation-fast.html#more

Solving common problems: https://www.r-bloggers.com/solve-common-r-problems-efficiently-with-data-table/

Matt Dowle presentations:
http://datascience.la/matt-dowles-data-table-talk-from-user-2014/

Roll:
https://www.r-bloggers.com/understanding-data-table-rolling-joins/

https://www.r-bloggers.com/advanced-tips-and-tricks-with-data-table/

All articles:
https://github.com/Rdatatable/data.table/wiki/Articles

H2O data table?
http://t.co/TYGG6KcMpx
https://www.analyticsvidhya.com/blog/2016/05/h2o-data-table-build-models-large-data-sets/


2. Dplyr by Hadley Wickham... plenty of stuff everywhere (to be elaborated on)

piątek, 21 października 2016

Free books from Gitbook... R stuff.

General: 
Przewodnik po pakiecie, Przemysław Biecek https://pbiecek.gitbooks.io/przewodnik/content/
R for Data Science, Garrett Grolemund Hadley Wickham http://r4ds.had.co.nz/

Programming:
Efficient R programming, Colin Gillespie, Robin Lovelace https://csgillespie.github.io/efficientR/
Advanced R by Hadley Wickham http://adv-r.had.co.nz/

Time series:Analiza i prognozowanie szeregów czasowych – case studies Część I , Adam Zagdański, Artur Suchwałko http://quantup.pl/szeregi-czasowe-case-studies/

Visual analysis:
Geostatystyka w R, Jakub Nowosad (see also presentations pazuR https://nowosad.github.io/presentations/)https://bookdown.org/nowosad/Geostatystyka/eksploracyjna-analiza-danych-przestrzennych.html
Not compiled files for Ggplot2 book by Hadley Wickham: https://github.com/hadley/ggplot2-book

R for Excel users
http://quantup.pl/operacje-w-R-znane-z-Excela/
http://quantup.pl/wykresy-w-R-znane-z-Excela/

Report writing:
Authoring Books with R Markdown, Yihui Xiehttps://bookdown.org/yihui/bookdown/

Data mining:
The caret Package, Max Kuhn  http://topepo.github.io/caret/index.html

Python
PythonPython Data Science Handbook
Jake VanderPlas Soon to be published: https://github.com/jakevdp/PythonDataScienceHandbook/blob/master/README.md