AoA All,
Just sharing our roadmap. Feel free to comments .
We started cloud computing at PDC around 2009 with actively engaging in Workshops , EU Wide private cloud deployments and research .
We recently changed the name from "HPCViz Cloud Group" to "HPCViz Data-Intensive Computing Group".
Very soon we are launching a course in Data-intensive computing at KTH.
"Cloud" has become so common it doesn't make sense to keep it in the name of a research group.
Here's how we saw the group's work divide:
1) Application layer
Bioinformatics, Security, Visualization (started to look more at
this), ... - focus on machine learning techniques for big data
analytics, possibly looking also at functional programming advantages
to big data. E.g. Spark use mainly Scala for its execution.
2) Data processing layer
E.g. Hadoop, HIVE, Pig, HBase, Storm, MPI, Spark, Spark Streaming, Shark,
Spark Graph, MLBase
3) Data management layer
E.g. HDFS, Tachyon, ...
4) Resource management layer
E.g. Hadoop YARN, Mesos, ...
5) Infrastructure layer
If you want to see how well we fit the description, see e.g.
http://en.wikipedia.org/wiki/Data_Intensive_Computing
We deliberately avoided using 'BigData' in the name, since that is
also a hyped word that already got used to much.
"Data Intensive Computing" is solid, clearly defined, and what we do,
and has been around for long, and will stay for a foreseeable future.
--
Regards
"Cloud" has become so common it doesn't make sense to keep it in the name of a research group.
Here's how we saw the group's work divide:
1) Application layer
Bioinformatics, Security, Visualization (started to look more at
this), ... - focus on machine learning techniques for big data
analytics, possibly looking also at functional programming advantages
to big data. E.g. Spark use mainly Scala for its execution.
2) Data processing layer
E.g. Hadoop, HIVE, Pig, HBase, Storm, MPI, Spark, Spark Streaming, Shark,
Spark Graph, MLBase
3) Data management layer
E.g. HDFS, Tachyon, ...
4) Resource management layer
E.g. Hadoop YARN, Mesos, ...
5) Infrastructure layer
If you want to see how well we fit the description, see e.g.
http://en.wikipedia.org/wiki/Data_Intensive_Computing
We deliberately avoided using 'BigData' in the name, since that is
also a hyped word that already got used to much.
"Data Intensive Computing" is solid, clearly defined, and what we do,
and has been around for long, and will stay for a foreseeable future.
--
Zeeshan Ali Shah
System Administrator - PDC HPC
PhD researcher (IT security)
Kungliga Tekniska Hogskolan
+46 8 790 9115
__._,_.___
Reply via web post | Reply to sender | Reply to group | Start a New Topic | Messages in this topic (1) |
.
__,_._,___
No comments:
Post a Comment