All Answers: [pakgrid] From Cloud, big data to Data-Intensive computing

Tuesday, 19 March 2013

[pakgrid] From Cloud, big data to Data-Intensive computing

AoA All,

Just sharing our roadmap. Feel free to comments .

We started cloud computing at PDC around 2009 with actively engaging in Workshops , EU Wide private cloud deployments and research .

We recently changed the name from "HPCViz Cloud Group" to "HPCViz Data-Intensive Computing Group".

http://www.pdc.kth.se/research/projects/national/data-intensive-computing

Very soon we are launching a course in Data-intensive computing at KTH.

"Cloud" has become so common it doesn't make sense to keep it in the name of a research group.

Here's how we saw the group's work divide:

1) Application layer

Bioinformatics, Security, Visualization (started to look more at
this), ... - focus on machine learning techniques for big data
analytics, possibly looking also at functional programming advantages
to big data. E.g. Spark use mainly Scala for its execution.

2) Data processing layer

E.g. Hadoop, HIVE, Pig, HBase, Storm, MPI, Spark, Spark Streaming, Shark,
Spark Graph, MLBase

3) Data management layer

E.g. HDFS, Tachyon, ...

4) Resource management layer

E.g. Hadoop YARN, Mesos, ...

5) Infrastructure layer

If you want to see how well we fit the description, see e.g.
http://en.wikipedia.org/wiki/Data_Intensive_Computing

We deliberately avoided using 'BigData' in the name, since that is
also a hyped word that already got used to much.

"Data Intensive Computing" is solid, clearly defined, and what we do,
and has been around for long, and will stay for a foreseeable future.

Regards

Zeeshan Ali Shah

System Administrator - PDC HPC

PhD researcher (IT security)

Kungliga Tekniska Hogskolan

+46 8 790 9115

http://www.pdc.kth.se/members/zashah

__._,_.___

Reply via web post

Reply to sender

Reply to group

Start a New Topic