Copy environment variables between SSIS catalogues on different servers

Problem Usually when developing SSIS packages, you want the project to run on different servers, e.g. development, test and production server. If you have many environment variables creating all of them is tedious and unnecessary work. I’ll show you how you can copy environment variables between SSIS catalogues that are located on different servers. Solution … Continue reading Copy environment variables between SSIS catalogues on different servers

Presentation “R for Data Science”

Some weeks ago I had a presentation at my work place about “R for data science” that I’d like to share with you. I’ve written the slides in R and rmarkdown and uploaded them to rpubs.com. I chose to use rmarkdown for my slides although we have great company PowerPoint templates, because I wanted to … Continue reading Presentation “R for Data Science”

Accessing MSSQL Server with R (RSQLServer with dplyr)

Recently I have been starting to use dplyr for handling my data in R. It makes everything a lot smoother! My previous workflow – running an SQL query, storing the results as CSV, loading it in RStudio – is now history. With dplyr you can directly query data from many different databases in a very … Continue reading Accessing MSSQL Server with R (RSQLServer with dplyr)

Accessing your Fitbit data

Since I am a data junkie and bought my Fitbit Charge HR mainly because I wanted to collect and analyse data about myself, I was looking for ways to download the data to your computer. For most people the great stats overview in the app and in the online dashboard will be sufficient but some … Continue reading Accessing your Fitbit data

Learning Club 05-07: Starting to love rmarkdown (Naive Bayes, Clustering, Linear Regression)

I remember when I had an R course at university I was really not a fan of rmarkdown and knitr. But since I participate in a Learning Club, where people are encouraged to document and present their code, data and results, I started to love it. Prior to that I’ve always documented my assignments at the university either … Continue reading Learning Club 05-07: Starting to love rmarkdown (Naive Bayes, Clustering, Linear Regression)

I was guest at the Becoming a Data Scientist Podcast!

Almost 2 weeks ago the Becoming a Data Scientist podcast had 4 special interviews – each of them with members of the Learning Club, including me! I was super excited when Renee asked me some weeks ago if I wanted to participate and I was a little bit nervous during the interview. But I think … Continue reading I was guest at the Becoming a Data Scientist Podcast!

Finding data sets PART 3: Weather, geographical and government data

This is the third post in my dataset series. The first part gave a more general overview on where to get data. In the second post I listed sources for sports, movies, music and books data. This section will give you information on how to get weather, public/governmental data and how to find GIS data. … Continue reading Finding data sets PART 3: Weather, geographical and government data

Data Analysis with Microsoft Excel: Tables

Auf ambassadorbase.at ist mein Artikel auf Deutsch verfügbar. In my job and my studies I recently finished I work with lots of different data sources and you will also meet all of them throughout your career as a data scientist. Data can be given to you as an SQL dump, XML files and many other … Continue reading Data Analysis with Microsoft Excel: Tables

Finding data sets Part 2: TV, music, book ratings and sports data

The first part gave a more general overview on where to get data. This section will give you specific data sources, e.g. if you like sports, movies, books, … and so on. Over the next couple of weeks you’ll find these posts on my blog: General data sources TV, music, book ratings and sports data … Continue reading Finding data sets Part 2: TV, music, book ratings and sports data

Finding data sets Part 1: General data sources

I often encounter interesting algorithms or R packages which I want to test. The nice ones provide data for testing but often it is only dummy data. To get a good understanding of the method and its limitations real data might be required. Sometimes I would also like to explore data I have not used … Continue reading Finding data sets Part 1: General data sources