| Labeling it a major advance for both large-scale computing and stellar physics, NCSA researchers have generated the highest resolution, 3D simulation of convection in an entire, rotating model star by running a new communication-latency-tolerant code for parallel computer architectures on the center's new SGI/CRAY Origin2000. This achievement opens the way for calculations that may illuminate the detailed dynamic processes in rotating stars. At the same time it also demonstrates that scalable computers built from clusters of multiprocessor parallel machines can operate as efficiently as single supercomputing systems. |
|
In a nine-day run on NCSA's
128-processor SGI/Cray Origin2000,
consuming more than 25,000
cpu-hours,David Porter, Steve
Anderson, Joe Habermann, Tom
Ruwart, and Paul Woodward from
the University of Minnesota's
Laboratory for Computational
Science and Engineering (LCSE)
calculated the more than 18,000
time steps -- advancing 57 million
active computational cells out of a
grid with 169 million cells overall --
to simulate the convection process in
a rotating star. This astrophysical gas
dynamics simulation was computa-
tionally intensive, consisting of over
3.5 million billion floating point
operations and generating more than
two terabytes of archived infor-
mation.(This data was captured on 8
Ampex DST tape cartridges for
transport to LCSE for later analysis
and visualization)
|
Vorticity structures in a rotating model star with convection in equilibrium in its unstable outer half, PPM simulation by Porter, Anderson, Habermann, and Woodward on two 64-processor SGI Origins at NCSA, 1997. The simulation used two 64-processor systems interconnected by 100 megabyte per second HiPPI channels. NCSA plans to upgrade this system within the year and to upgrade the interconnection network to the 800 megabyte per second CrayLink. For this simulation Porter, Anderson, and Habermann developed a new version of the LCSE's PPM gas dynamics code that employs a novel strategy for overcoming latency in parallel applications. Latency refers to the time it takes to retrieve data from memory. In large multiprocessor computing systems, such as the Origin2000, each group of processors that shares a common memory is simultaneously advancing a different portion of the problem. This means of attacking the problem in parallel is how the computer attains speed and efficiency. Latency becomes an issue when the processor groups (there were two groups of 64 processors used at NCSA) must retrieve information from other processor groups in order for the calculations to continue. This communication between groups of processors introduces latency. The LCSE team's new PPM code prevents latency from slowing a computation because it overlaps the computation with the exchange of information among the other processor groups. Rather than eliminating latency, it hides it. Hiding latency becomes easier as the size of the problem increases because the amount of time spent computing increases faster than does the time spent exchanging data between processor groups. In large problems, such as solar convection, the time spent computing overwhelmingly exceeds the time spent exchanging data. Velocity structures in a rotating model star with convection in equilibrium in its unstable outer half, PPM simulation by Porter, Anderson, Habermann, and Woodward on two 64-processor SGI Origins at NCSA, 1997. |
|
|
The LCSE team plans to apply this same strategy for
latency tolerance to enable very widely separated
computing resources connected by the new NSF vBNS
network to cooperate on the solution of single,
tightly coupled, grand challenge applications like the
stellar convection problem.
Convection is one of two ways (thermal conduction being the other) that energy produced in thermonuclear reactions at the core of a star is transported to the surface. In the outer layers of stars like the sun, heat is transported by convection. Deep within a star, regions of gas heated by conduction become buoyant and rise toward the surface where the heat can be radiated directly into space. Gas cooled near the surface becomes denser and sinks. These rising and sinking flows of gas create complicated patterns of turbulence that affect the evolution of the star. For instance, the turbulent convection flow can redistribute angular momentum as well as heat, which causes gases to rotate more slowly near a star's poles than near its equator. Previous models of stellar convection were limited to small areas on the surface of the star, areas too small to help scientists understand how such differential rotation becomes established in a star. This simulation was the first attempt at a high- resolution model of an entire star to better visualize its complex dynamics.
|