Resources

There are tons of great resources all over the internet. I’ve bookmarked hundreds of URLs and this page is my categorized collection of the references and free tools I’ve found to be helpful. If you’re reading this and have something to add or find a dead link please send me a note. I’m continuing to add to this over time.

Business Intelligence | Career Management | Computer Science | Data Science | Datasets | Data Storage | Data Visualization | MarTech | Programming | Software Development | Web Application Frameworks | Web Services | Website Tools


Business Intelligence


OLAP Cubes

A nice intro guide on what these are and why they are used.
olap.com/learn-bi-olap/olap-bi-definitions/olap-cube

Transactional vs. Analytical Processing

Good cross-comparison between OLTP and OLAP systems.
datawarehouse4u.info/OLTP-vs-OLAP


Career Management


3 Data Career Paths Decoded

Helpful article that compares and contrasts the role of a data analyst vs. data scientist vs. data engineer.
blog.udacity.com/data-analyst-vs-data-scientist-vs-data-engineer

Git Showcase

Free portfolio site that allows developers to easily feature projects from their GitHub repositories.
gitshowcase.com

Google Cloud Certification - Data Engineer

A Google Certified Professional - Data Engineer enables data-driven decision making by collecting, transforming, and visualizing data. To earn this certification you must pass the in-person exam. This webpage offers a collection of useful training resources and reference materials aimed at achieving this certification.
cloud.google.com/certification/data-engineer

Skills Index

Undoubtedly you’ve heard about the skills gap challenges in the U.S. economy. Using select data from LinkedIn and [email protected]’s proprietary analysis, the [email protected] Skills Index takes a look at supply vs. demand around specific skill sets across top industries and provides actionable recommendations for getting up to speed.
skillsindex.com

TechPizza

This site aggregates information around upcoming data-related Meetups and also gives users the ability to browse for slides, code, and video from Meetup tech groups around the world.
techpizza.org


Computer Science


Big-O Notation Cheat Sheet

This webpage covers the space and time Big-O complexities of common algorithms used in Computer Science.
bigocheatsheet.com


Data Science


Beginner’s Guide to Big Data Terminology

Walkthrough on some of the common lingo of data science, such as DaaS and Neural Networking.
dataconomy.com

Best Practices for ML Engineering

This guide is intended to help those with a basic knowledge of machine learning get the benefit of best practices in machine learning. If you have taken a class in machine learning, built, or worked on a machine-learned model, then you have the necessary background to read this document.
martin.zinkevich.org/rules_of_ml.pdf

Data Mining in Python: A Guide

Data mining is the process of discovering predictive information from the analysis of large databases. This guide provides an example-filled introduction to data mining using Python, one of the most widely used data mining tools - from cleaning and data organization to applying machine learning algorithms.
springboard.com/blog/data-mining-python-tutorial

Handy Python Libraries for Formatting and Cleaning Data

Data scientists spend a lot of time cleaning messy data. This is a list of Python libraries that help make data more orderly and legible - from styling DataFrames to anonymizing datasets.
blog.modeanalytics.com/python-data-cleaning-libraries

Kaggle

Offers a means of learning data science through both public and private competitions.
kaggle.com

KDnuggets

This website looks like its design hasn’t changed since the 90s, but it is home to lots of great content on business analytics, big data, data mining, and data science.
kdnuggets.com

R or Python for Data Science?

This is a nice blog post on opendatascience.com that digs into the differences/advantages of using either R or Python for performing data science tasks.
opendatascience.com/blog/r-or-python-for-data-science

RegexOne

Regular expressions are extremely useful in extracting information from text such as code, log files, spreadsheets, or even documents. This site offers an interactive tutorial and practice exercises to help you learn them.
regexone.com

rOpenSci

Open source R packages that allow access to data repositories and provide programmatic access to a variety of scientific data and other real-time metrics of scholarly impact.
ropensci.org


Datasets


Crowdflower: Data for Everyone

Collection of free, downloadable, and categorized datasets that have gone through the Crowdflower platform.
crowdflower.com/data-for-everyone

Data.World: The Social Network for Data People

Discover and share cool data, connect with interesting people, and work together to solve problems faster. Users can find and use a vast array of high-quality open data.
data.world

Google’s My Activity Page

This portal reveals everything Google knows about you - every search you’ve made, the apps you’ve used, the videos you’ve watched, and everything in between. Visit to see how your data is being collected, modify activity settings, and delete data that you prefer not retained.
myactivity.google.com


Data Storage


7 Steps to Understanding NoSQL Databases

The term NoSQL has come to be synonymous with schema-less, non-relational data storage schemes. NoSQL is an umbrella term, one which encompasses a number of different technologies. This article provides newcomers an overview of NoSQL technologies and architectures it includes.
kdnuggets.com/seven-steps-understanding-nosql-databases.html

What Is ETL?

ETL is shorthand for the extraction, transformation, and loading process used in most data movement operations. This article provides a nice overview for those wanting to understand the basics around these phases.
timmitchell.net/what-is-etl


Data Visualization


Google Charts

Google Charts is a JavaScript-based tool that lets people easily create a chart from some data and embed it in a web page. It’s free and has a solid library of interactive charts and data tools available for use.
developers.google.com/chart

Google Data Studio

Free product lets you connect to all your marketing data and turn that data into beautiful, informative reports that are easy to understand, share, and fully customizable.
datastudio.google.com


MarTech


Google Analytics Demo Account

If you’re like me, you learn by doing. This fully functional Google Analytics account is a great way to look at real business data and experiment with Google Analytics features. The data is from the Google Merchandise Store, a real ecommerce store, and it’s typical of what you would see for an ecommerce website.
support.google.com/analytics

The Definitive Glossary of Programmatic Advertising

Programmatic advertising is about using automated systems (technology) and data to make media buying decisions without humans. This is a helpful glossary of terms associated with this concept.
blog.hubspot.com/agency/programmatic-advertising-glossary

Real Story Group

This firm specializes in evaluating vendors in the MarTech space to help you find the right provider among a glut of offerings.
realstorygroup.com


Programming


Bento

A curated collection of tutorials and free learning resources for learning to code in new languages.
bento.io

CodeEnv

Free site lets you share your code with others in CodeEnv online environments. Good for teaching, prototyping, and sharing fiddles.
codeenv.com

CodeProject: Diving in OOP

Comprehensive article that covers almost every OOP (object-oriented programming) concept in detail with C# examples.
codeproject.com/Diving-in-OOP

Fiddles.io

Offers sandbox environments for developers to play around with and modify live sample code for all kinds of languages. It’s also easy to share or demonstrate solutions to problems.
fiddles.io

Google’s Go Language

This article represents a nice primer on the differentiating features of Google’s Go language (Golang) and its tools, including its extremely lightweight concurrency.
infoworld.com/googles-go-language

Python Challenge

Python Challenge is a game in which each level can be solved by a bit of (Python) programming. It’s a good way to practice through solving riddles.
pythonchallenge.com

Topcoder

Topcoder is a company that administers contests in computer programming, through which prize money can be won. Competition aside, this site also offers regular challenges and matches through which you can learn new skills and hone skills you already have.
topcoder.com

Understanding Go Pointers

This post is for programmers coming to Go who are unfamiliar with the idea of pointers or a pointer type in Go. It also digs deeper into the concept of computer memory (RAM) and how memory location is accessed through your code.
dave.cheney.net/understand-go-pointers

OverAPI

A huge selection of cheat sheets for almost any current programming language and other technologies.
overapi.com


Software Development


Git Branching Model

This post outlines a development model for git branching strategy and release management.
nvie.com/git-branching-model

Queues

This page tries to collect the libraries for the queueing systems (job, messaging, etc.) that are widely popular and have a successful record of running on (big) production systems.
queues.io

StackShare

StackShare provides online software for displaying and sharing your technology stack, which is made up of the software that you use. It’s an online community that features comparisons, ratings, reviews, recommendations, and discussions of the best software tools and software infrastructure services.
stackshare.io

The Twelve-Factor App

In the modern era, software is commonly delivered as a service: called web apps, or software-as-a-service (SaaS). The twelve-factor app is a methodology for building SaaS apps that can be applied to apps written in any programming language, and which use any combination of backing services (database, queue, memory cache, etc).
12factor.net


Web Application Frameworks


The Djanjo Book

Free online book offers comprehensive Python Django Tutorials, easy to understand Django documentation, the Model-View-Controller (MVC) design pattern, and more.
djangobook.com

Python Web Frameworks

This report surveys 30 Python web frameworks that have more than 1,000 monthly downloads and provides a deeper look into six of the most widely used. Also provides general overview of web application frameworks and what they do.
oreilly.com/learning/python-web-frameworks


Web Services


Google Analytics Query Explorer

This tool lets you play with the Core Reporting API by building queries to get data from your Google Analytics views (profiles). You can use these queries with any of the client libraries to build your own tools.
ga-dev-tools.appspot.com/query-explorer

OpenWeatherMap

API for accessing current weather data for any location including over 200,000 cities.
openweathermap.org/api

Postman

This free Chrome extension allows developers to explore, test, and build APIs using a powerful collaborative testing and development suite.
getpostman.com

RESTful Architecture

Technical documentation for RESTful web services with references and language-specific examples.
smartsheet-platform.github.io/api-docs

What is a REST API?

Thorough overview on what REST APIs are and how to use them.
idratherbewriting.com/docapis_what-is-a-rest-api


Website Tools


BuiltWith

Enter the URL of a website and quickly find a list of the technologies used to support that site including email services, nameserver providers, JavaScript libraries, widgets, server information, and more.
builtwith.com

Google Design: Resizer

An interactive viewer to see and test how digital products respond to material design breakpoints across desktop, mobile, and tablet.
design.google.com/resizer

How To Use GitHub Pages To Make Websites

Step-by-step tutorial to getting started with building a website hosted on Github Pages.
readwrite.com/github-pages

How to Host Your Static Site with HTTPS on GitHub Pages and CloudFlare

While GitHub offers free static website hosting and custom domain support, it is currently not possible to configure HTTPS for custom domains directly through GitHub Pages. This is where CloudFlare comes in.
developer.ubuntu.com/static-site-https-github-pages-and-cloudflare/

Mobile Website Speed Testing Tool

Another great Google product. Find out how well your site works across mobile and desktop devices by simply entering the URL.
testmysite.thinkwithgoogle.com

Static Site Generators

A leaderboard of the top open-source static site generators based on Github stars.
staticgen.com

Website Grader

Free online tool that grades any website against key metrics such as performance, mobile readiness, SEO, and security.
website.grader.com

Who Is Hosting This

Allows a user to simply enter the domain name of any site and instantly uncover the identity of the company that is hosting the site.
whoishostingthis.com