A Big Data Day Workshop at Harvey Mudd College

BigData_2267x1146_trasparentWe are pleased to announce a one-day Big Data workshop on Tuesday, April 7, 2015 from 8 am to 2 pm on the Harvey Mudd campus (Shanahan 2461). The workshop will be led by Pittsburgh Supercomputing Center and the Scientific Computing Specialist at Harvey Mudd will be the on-site TA for local participants. This workshop will focus on topics such as Hadoop and Spark. If you are unable to attend the whole workshop due to your class schedule, I would recommend you the first two sessions to learn the basics of Big Data and do some hands-on programming using Java.

The workshop registration is required for the hands-on part. Please register at https://www.xsede.org/web/xup/course-calendar/-/training-user/class/378/session/629. The registration requires an XSEDE account which you can obtain from https://www.xsede.org/web/xup/my-xsede?p_p_id=58&p_p_lifecycle=0&p_p_state=maximized&p_p_mode=view&p_p_col_id=column-1&p_p_col_pos=1&p_p_col_count=2&_58_struts_action=/login/create_account.

* Workshop Agenda *
8:00 am: Welcome
8:30 am: Intro to Big Data
9:15 am: Hadoop
10:00 am: Lunch break
11:00 am: Hadoop (cont)
11:30 am: Exercises
12:15 pm: Spark
1:15 pm: Exercise 2
2:00 pm: Adjourn
(All times given are PST)

Due to demand, this workshop will be telecast to several satellite sites. This workshop is NOT available via a webcast.

Please feel free to let me know if you have any questions. I hope to see you at the workshop. Thanks!

Jeho Park, Ph.D.
Scientific Computing Specialist, HMC
909) 607-9023

SuperComputing (SC11) Conference for College Educators

SuperComputing (SC) conference is the leading international conference on High Performance Computing (HPC), Networking, Storage and Analysis. This year the 24th annual SC conference (SC11) was held in Seattle, WA, in November, 2011. More than 5000 participants were gathered in one place to learn, discuss, and show off cutting-edge technologies in HPC and related areas.

Although the conference is huge in all respects, the beauty of the SC conference is in its specialized sub-community conferences. One of the sub-community conferences called Education Program is very well organized to suit to college educators who teach HPC and Scientific Computing. The main focus of the Education Program is to learn and share better ways of teaching HPC and Scientific Computing (or Computational Sciences) tools to undergraduate faculty and students.

Jeho Park (Scientific Computing Specialist) at CIS attended the SC11 conference, and learned many good practices on HPC education and made relevant connections on behalf of our HMC community. A few of the takeaways worth mentioning are Bootable Cluster CD (BCCD), LittleFe Project, and FutureGrid Project.

BCCD is a turn key solution to build a Beowulf style cluster on the fly. The BCCD boot image comes with a complete parallel computing environment such as network setup, libraries, compilers, benchmarks and applications needed to teach HPC to undergraduate faculty and students. So to teach distributed and parallel computing, you just need BCCD and a couple of networked workstations or a computer with a multicore processor(s). BCCD even runs in virtual machine (VM) environments. This mean that you may boot multiple BCCD VMs on different cores and emulate the cluster environment right in front of your audience. CIS will be testing BCCD on our High Performance Workstations during the winter break. For more information, please visit http://bccd.net/.

LittleFe Build OutLittleFe is an interesting project funded in part by Intel (until this year) to build a portable (< 50 lb) six-node cluster with a relatively small amount of money (< $3,000). The LittleFe portable cluster is a simple and easy way to build a hardware and software resource for teaching  parallel processing speedup, efficiency, and load balancing. CIS will keep an eye on their call for applications for 2012 LittleFe grants. If you are interested in being involved in this project at HMC, please contact Jeho at CIS.

If you are looking for a more serious type of HPC resource, take a good look at the FutureGrid Project. The FutureGrid Project focuses on offering new and dedicated test-bed environments for research challenges on grid-enabled and cloud-enabled computational schemes in sciences and engineering. The FutureGrid also actively supports education and broader outreach activities:

“…. The project will advance education and training in distributed computing at academic institutions with less diverse computational resources. It will do this through the development of instructional resources that include preconfigured environments that provide students with sandboxed virtual clusters….”

So it sounds like the FutureGrid is waiting for your innovative ideas to exploit their new experimental testbed for your research and teaching on HPC, scientific computing, parallel computing, distributed computing and cloud computing. Harvey Mudd College is especially good fit for FutureGrid in terms of its scope. So we encourage faculty members to look at the FutureGrid website and feel free to contact CIS for any assistance to apply for FutureGrid instances.

The next SC12 conference will be held in Salt Lake City, Utah on November 10, 2012.

MATLAB Seminars for Mudders

In April, CIS offered a series of MATLAB seminars to HMC community.  There were five seminar meetings covering three different topics: basic MATLAB programming, advanced MATLAB programming and parallel processing with Parallel Processing Toolbox.

Basic MATLAB programming seminars taught by Jeho Park at CIS covered fundamental, yet essential, MATLAB programming skills for MATLAB beginners. The seminar participants enjoyed creating function m-files and supporting documents. The basic MATLAB seminar attracted many freshmen who wish to prepare themselves for the courses that require MATLAB programming skills. CIS plans to offer additional basic MATLAB seminar classes in early fall semester for those who missed the April seminar meetings. So please stay tuned.

CIS also invited the MathWorks Senior Application Engineer, Doug Eastman, to HMC campus to discuss advanced MATLAB programming topics. The MathWorks on-site seminar discussed how to make use of different MATLAB functions and memory allocation methods for a better computing performance. The presenter also introduced MATLAB parallel processing features that may lead to a significant performance improvement for some number crunching applications. The seminar was very helpful for those who seek ways to improve performance of their MATLAB codes.

For future MATLAB seminars at HMC, we welcome your suggestions for topics: http://www.formstack.com/forms/hmc-matlab_seminar_topic_suggestion.

COMSOL workshop on HMC campus

A: “Dude, where’s my car? I need to drive to L.A. to attend the COMSOL workshop.”
B: “What car? And for what? Dude, did you miss the COMSOL workshop in the Learning Studio classroom last month?”
A: “Doh!” 

That’s right. CIS brought the COMSOL workshop to our campus on Friday, January 28th. It attracted a large turnout: 28 participants from Harvey Mudd (22), CMC (2), Pomona (1), CGU (1) and Keck (2). We were especially excited to see the majority of the participants were from HMC.

The workshop was led by Dr. Mina Sierou from COMSOL, Inc..  During the first half of the workshop, she covered an overview of COMSOL Multiphysics Version 4.1 by creating a simple model to explain its capabilities, basic usages and new user interface. And for the rest of the workshop, participants tried the new COMSOL 4.1 on their own laptops and asked a lot of questions they had had for a COMSOL expert.

This event was successful not only in CIS’ point of view but also in faculty participants’ perspectives. “I thought the workshop was quite effective.  …  I would recommend it to students as an time-effective way to get up to speed on the basics of COMSOL.” said Prof. David Harris from Engineering Department. We are working to offer another on-campus COMSOL workshop in Fall. So, Dude A, please stay tuned!

Visit http://www.comsol.com/events/ for more information on free/non-free COMSOL seminars and training sessions.