Monthly Archives: March 2015

Archlinux and Docker for my Raspberry PI2

What is the best linux distribution for Raspberry PI2? I started with Raspian (Debian is my preferred Linux distribution for servers, desktops and laptops) but docker didn’t work.

But with Archlinux it works fine.

How to create a docker images with Archlinux & RPI2? See http://linucc.github.io/docker-arch-rpi2/

matteoredaelli/docker-karaf-rpi is the first docker image I have created.

Below my docker info output:

[root@raspi1 ~]# docker info
Containers: 4
Images: 9
Storage Driver: aufs
Root Dir: /var/lib/docker/aufs
Backing Filesystem: extfs
Dirs: 17
Execution Driver: native-0.2
Kernel Version: 3.18.10-1-ARCH
Operating System: Arch Linux ARM
CPUs: 1
Total Memory: 432.8 MiB
Name: raspi1
ID: UOFM:E7CP:2OTL:VTTM:QRP2:JNJ7:UFCI:2MDE:AOYQ:MGTM:Q25F:FL37
WARNING: No memory limit support
WARNING: No swap limit support

Below some docker survival commands:

docker run -i -t --name karaf \
           -p 1099:1099 -p 8101:8101 \
           -p 44444:44444 -v /apps/karaf-deploy:/deploy \
           matteoredaelli/karaf-docker-rpi /bin/bash
docker start karaf
docker stop karaf
docker exec -it karaf bash
docker top
docker ps
docker ps -a
docker images


A case study of adopting Bigdata technologies in your company

Bigdata projects can be very expensive and can easily fail: I suggest to start with a small, useful but not critical project. Better if it is about unstructured data collection and batch processing. In this case you have time to get practise with the new technologies and the Apache Hadoop system can have not critical downtimes.

At home I have the following system running on a small Raspberry PI: for sure it is not fast 😉

At work I introduced Hadoop just few months ago for collecting web data and generating daily reports.

Competitor_Analysis_BigData

Before SQL then NOSQL and BIGDATA: now BIGDATA and SQL again

drill

The trend of these years has been switching from SQL (RDBMS) databases to NoSQL databases like Hadoop, MongoDB, Cassandra, Riak, …

SQL is a old but easy and fast way to query data. And people STILL look at it for quering Hadoop and bigdata:

Read details from 10 ways to query hadoop with sql ..