Including NoSQL, Map-Reduce, Spark, big data, and more. This resource includes technical articles, books, training and general reading. Enjoy the reading!
Source for picture: click here
Here’s the list:
- Great list of resources – NoSQL, Big Data, Machine Learning and more | GitHub
- Implementing a Distributed Deep Learning Network over Spark
- Correlation and R-Squared for Big Data
- [Book] Big Data – Principles and best practices of scalable realtime data systems
- 9 Lessons: Picking the Right NoSQL Tools
- Lesson 2: NoSQL Databases are Good for Everything – Except Maybe this One Thing
- 16 resources to learn and understand Hadoop
- 8 Hadoop articles that you should read
- Fast clustering algorithms for massive datasets
- Hadoop – Whose to Choose
- 11 Features any database, SQL or NoSQL, should have
- Big Data: The 4 Layers Everyone Must Know
- The Book: Big Data, NoSQL, Cloud A Paradigm Shift
- Lesson 8: Graph Databases
- How to get started with Hadoop?
- Optimizing care gaps and outreach programs in Healthcare
- Business Intelligence Architecture
- Lesson 4: Features Common to (Most) NoSQL/NewSQL Databases
- Get started with Hadoop and Spark in 10 minutes
- Lesson 5: Key Value Stores (AKA ‘Tuple’ Stores)
- How to score data in Hadoop/Hive in a flash
- Interesting database questions
- Lesson 3: Open Source, Distribution, or Suite
- Big Data Applications Scaling Using Java Architecture in the Cloud
- Lesson 7: Column Oriented Databases (aka Big Table or Wide Column)
- Big Data Analytics Infrastructure
- Hadoop Technology Stack
- Lesson 6: Document Oriented Databases
- A synthetic variance designed for Hadoop and big data
- Practical illustration of Map-Reduce (Hadoop-style), on real data
- Old SQL, New SQL, NoSQL – Making Sense of the Five Major Classes of Database Technology
- How NoSQL Fundamentally Changed Machine Learning
- eBook: Getting Started With Hadoop
- Salaries for Hadoop professionals
- Modern BI Architecture & Analytical Ecosystems
- Wiley’s Hadoop Book Bundle — A Free 113 Page Sampler
- Earthwatch to Look at Climate Change in Acadia National Park
- Polyglot Persistence?
- 50+ Open Source Tools for Big Data (See Anything Missing?)
- Implementing a Distributed Deep Learning Network over Spark
- Which one is best: R, SAS or Python, for data science?
- 15+ Great Books for Hadoop
- Clustering Similar Images Using MapReduce Style Feature Extraction with C# and R
- A Comparison of NoSQL Offerings
- How To Avoid The Big Data Quicksand
- Deploy Hadoop Cluster
- SQL to NoSQL translator
- Programming for Data Science the Polyglot approach: Python + R + SQL
- Seek the grail up the Knowledge Pyramid, not down
- Big Data Logistics: data transfer using Apache Sqoop from RDBMS
DSC Resources
- Career: Training | Books | Cheat Sheet | Apprenticeship | Certification | Salary Surveys | Jobs
- Knowledge: Research | Competitions | Webinars | Our Book | Members Only | Search DSC
- Buzz: Business News | Announcements | Events | RSS Feeds
- Misc: Top Links | Code Snippets | External Resources | Best Blogs | Subscribe | For Bloggers
Additional Reading
- Data Scientist Reveals his Growth Hacking Techniques
- 10 Modern Statistical Concepts Discovered by Data Scientists
- Top data science keywords on DSC
- 4 easy steps to becoming a data scientist
- 13 New Trends in Big Data and Data Science
- 22 tips for better data science
- Data Science Compared to 16 Analytic Disciplines
- How to detect spurious correlations, and how to find the real ones
- 17 short tutorials all data scientists should read (and practice)
- 10 types of data scientists
- 66 job interview questions for data scientists
- High versus low-level data science
Follow us on Twitter: @DataScienceCtrl | @AnalyticBridge