Tuesday, 19 March 2013

[pakgrid] From Cloud, big data to Data-Intensive computing

 

AoA All, 

Just sharing our roadmap.  Feel free to comments .

We started cloud computing at PDC around 2009 with actively engaging in Workshops , EU Wide private cloud deployments and research . 

We recently changed the name from "HPCViz Cloud Group" to "HPCViz Data-Intensive Computing Group". 


Very soon we are launching a course in Data-intensive computing at KTH.

"Cloud" has become so common it doesn't make sense to keep it in the name of a research group.

Here's how we saw the group's work divide:

1) Application layer

Bioinformatics, Security, Visualization (started to look more at
this), ... - focus on machine learning techniques for big data
analytics, possibly looking also at functional programming advantages
to big data. E.g. Spark use mainly Scala for its execution.

2) Data processing layer

E.g. Hadoop, HIVE, Pig, HBase, Storm, MPI, Spark, Spark Streaming, Shark,
Spark Graph, MLBase

3) Data management layer

E.g. HDFS, Tachyon, ...

4) Resource management layer

E.g. Hadoop YARN, Mesos, ...

5) Infrastructure layer


If you want to see how well we fit the description, see e.g.
http://en.wikipedia.org/wiki/Data_Intensive_Computing

We deliberately avoided using 'BigData' in the name, since that is
also a hyped word that already got used to much.

"Data Intensive Computing" is solid, clearly defined, and what we do,
and has been around for long, and will stay for a foreseeable future.


-- 

Regards

Zeeshan Ali Shah
System Administrator - PDC HPC
PhD researcher (IT security)
Kungliga Tekniska Hogskolan
+46 8 790 9115

__._,_.___
Reply via web post Reply to sender Reply to group Start a New Topic Messages in this topic (1)
Recent Activity:
.

__,_._,___

No comments:

Post a Comment