ClearSpeed Updates Advance Product Family, Providing Energy-Efficient Accelerators that Deliver 20 Times the Performance per Watt When Compared with Industry-Standard Servers.Updated Product Line Includes Performance and Functional Enhancements for CSXL Libraries, the Advance[TM] e620 PCIe Accelerator accelerator: see particle accelerator. (1) A key combination such as Alt-G or Ctrl-Shift H that is used to activate a task. (2) An incubator that expects to develop the company considerably faster than normal. See incubator. and the ClearSpeed Visual Profiler Toolset BRISTOL, United Kingdom -- ClearSpeed Technology (LSE LSE - Language Sensitive Editor :CSD CSD Commission on Sustainable Development CSD Serbian Dinar (ISO currency code) CSD Christopher Street Day CSD Circuit Switched Data (Sprint) CSD Computer Science Department CSD Community School District ), the world leader in acceleration technology for high performance computing computing - computer (HPC (Handheld PC) A palmtop computer that weighs less than one pound and runs specialized versions of popular applications. Microsoft coined the term for its Windows CE operating system, which is an abbreviated version of Windows. See Pocket PC. ), today announced new software and hardware enhancements to its Advance[TM] product family. The new offerings include performance and functionality enhancements to ClearSpeed CSXL software libraries, the Advance e620 PCI Express A high-speed peripheral interconnect from Intel introduced in 2002. Note that although sometimes abbreviated "PCX," PCI Express is not the same as "PCI-X" (see PCI-SIG and PCI-X for comparison). As a result of the confusion, "PCI-E" or "PCIe" is the accepted abbreviation. (PCIe) accelerator and the ClearSpeed Visual Profiler. Benchmarks using these enhanced CSXL libraries consolidate ClearSpeed's leadership in energy efficiency by delivering 20 times the performance per watt watt [for James Watt], abbr. W, unit of power, or work done per unit time, equal to 1 joule per second. It is used as a measure of electrical and mechanical power. compared with industry standard servers when running the high performance LINPACK Benchmark1. The new 2.50 release of ClearSpeed's CSXL acceleration libraries introduces native support for Microsoft Windows See Windows. (operating system) Microsoft Windows - Microsoft's proprietary window system and user interface software released in 1985 to run on top of MS-DOS. Widely criticised for being too slow (hence "Windoze", "Microsloth Windows") on the machines available then. and simplifies deployment with documentation updates and End User License Agreements. It provides a number of performance enhancements to the core linear algebra linear algebra Branch of algebra concerned with methods of solving systems of linear equations; more generally, the mathematics of linear transformations and vector spaces. routines for matrix multiplication Noun 1. matrix multiplication - the multiplication of matrices matrix operation - a mathematical operation involving matrices . Also included in the 2.50 release are the new ClearSpeed Vector Math Library and ClearSpeed Random Number Generators Computer random number generators are important in mathematics, cryptography and gambling. This list includes all common types, regardless of quality. Pseudorandom number generators (PRNGs) The following algorithms are pseudorandom number generators:
A problem solving technique used to approximate the probability of certain outcomes by running multiple trial runs, called simulations, using random variables. for option pricing in the financial services The examples and perspective in this article or section may not represent a worldwide view of the subject. Please [ improve this article] or discuss the issue on the talk page. industry. Performance comparisons based on benchmark code for European Option European Option An option that can only be exercised at the end of its life. Notes: In other words, you must ride the rollercoaster until the maturity date, and only then can you cash in. pricing provided by a major international bank showed up to 20 times performance speedup using a ClearSpeed Advance accelerator compared with an industry server2. The use of multiple Advance accelerators in the system delivered up to 100 times performance speedup. For scientific applications such as molecular modeling, recent results have demonstrated real-world application acceleration of between 3.4 to 9.4 times the speedup with AMBER amber, fossilized tree resin. Amber can vary in color from yellow to red to green and blue. The best commercial amber is transparent, but some varieties are cloudy. modules and 4.5 times the speedup with the Bristol University Docking Engine (BUDE) program3. On April 27 Cambridge Healthtech Institute's Bio-IT World announced that ClearSpeed Technology was one of three Best of Show finalists for the Information Technology Infrastructure category. Executive Editor of Bio-IT World John Russell John Russell may refer to:
"Large consumers of compute To perform mathematical operations or general computer processing. For an explanation of "The 3 C's," or how the computer processes data, see computer. power are looking for Looking for In the context of general equities, this describing a buy interest in which a dealer is asked to offer stock, often involving a capital commitment. Antithesis of in touch with. ways to improve both their system performance and performance per watt," said Steve Conway, research vice president of technical computing systems at IDC. "There is strong and increasing interest in acceleration technologies that could deliver improved performance without exceeding power, cooling and facilities constraints CONSTRAINTS - A language for solving constraints using value inference. ["CONSTRAINTS: A Language for Expressing Almost-Hierarchical Descriptions", G.J. Sussman et al, Artif Intell 14(1):1-39 (Aug 1980)]. . ClearSpeed's acceleration technology is making advances in this area." Building on the success of ClearSpeed's current PCI-X-based Advance X620 accelerator, the introduction of the complementary and smaller form factor PCIe-based Advance e620 accelerator brings all the benefits of ClearSpeed's acceleration technology to the latest generation of multi-core industry standard servers that incorporate the PCIe standard. Together the existing Advance X620 and the Advance e620 significantly increase the number of server platforms that can take advantage of ClearSpeed acceleration. For developers, the new ClearSpeed Visual Profiler toolset provides that insight at every level of the system, including the interactions between multiple host processors and one or more ClearSpeed Advance accelerator boards An add-in board that replaces the existing CPU with a higher performance CPU. See graphics accelerator. . By delivering a consistent visual representation across the entire system, it provides the best possible environment in which to develop code that will perform optimally in today's multi-core and heterogeneous accelerated systems. "The world's leading financial institutions and research organizations that depend upon the availability of compute power to maintain their competitive edge are struggling with the constraints of facilities space, power and cooling," said Stephen McKinnon, ClearSpeed's chief operating officer Chief Operating Officer (COO) The officer of a firm responsible for day-to-day management, usually the president or an executive vice-president. . "The enhancements to our product family are delivering three, five or even twenty times the application performance of unaccelerated systems, while adding less than five percent to the total energy bill. Acceleration technology is causing a radical rethink re·think tr. & intr.v. re·thought , re·think·ing, re·thinks To reconsider (something) or to involve oneself in reconsideration. re of datacenter design."
Notes to Editors on Performance Results
(1) LINPACK performance and performance per watt results
Comparative results
Accelerated cluster: 218.9% performance of standard system
Accelerated cluster: 53.6% less energy per job
Accelerated cluster: 5.3% more power (peak)
Accelerated cluster: 1.6% more power (average)
Standard node: 0.07 GFLOPS per watt
Accelerated node: 0.14 GFLOPS per watt, 2x energy efficiency of
standard node
ClearSpeed X620: 1.37 GFLOPS per watt, 20x energy efficiency of
standard node
ClearSpeed "Top Up" 4.95 GFLOPS per watt, 70x energy efficiency of
standard node
ClearSpeed "Top Up" is defined as the additional performance delivered
for the additional average power consumption when compared with an
unaccelerated system.
Measured benchmark results
Standard Cluster: 114.8 GFLOPS, 40.8 minutes runtime
Power: 1900w peak, 1722w average, Energy: 0.29kWhr, 0.07 GFLOPS/w
ClearSpeed Accelerated Cluster: 251.3 GFLOPS, 18.7 minutes runtime
Power: 2000w peak, 1750w average, Energy: 0.14kWhr, 0.14 GFLOPS/w
Standard Node
Node: 28.7 GFLOPS, 431w, 0.07 GFLOPS/w - base energy efficiency
ClearSpeed Accelerated Node
Node: 62.8 GFLOPS, 438w, 0.14 GFLOPS/w - 2x base energy efficiency
ClearSpeed Advance X620 accelerator
X620: 34.1 GFLOPS, 25w, 1.37 GFLOPS/w - 20x base energy efficiency
ClearSpeed "Top Up" additional performance for additional power
X620: 34.1 GFLOPS, 6.9w, 4.95 GFLOPS/w - 70x base energy efficiency
The LINPACK Benchmark was introduced by Jack Dongarra. It is used to
solve a dense system of linear equations. For the Top500, a version of
the benchmark is used that allows the user to scale the size of the
problem and to optimize the software in order to achieve the best
performance for a given machine. This performance does not reflect the
overall performance of a given system, as no single number ever can.
It does, however, reflect the performance of a dedicated system for
solving a dense system of linear equations. Since the problem is very
regular, the performance achieved is quite high, and the performance
numbers give a good correction of peak performance. A parallel
implementation of the LINPACK Benchmark and instructions on how to run
it can be found at http://www.netlib.org/benchmark/hpl/.
System specifications
Base system: HP DL380 G5, CPU: Intel Xeon 5160 (Woodcrest) x 2 @ 3GHz
Memory: 14GB, Operating System: RedHat EL4 64
ClearSpeed Acceleration: Advance X620, CSXL 2.24, BLAS: Intel MKL
8.1.1
LINPACK parameters: Host assist: 25%, HPL.dat: N: 75000, NB: 1152
Standard cluster: 4 nodes, 0 ClearSpeed Advance X620
ClearSpeed accelerated cluster 4 nodes, 4 ClearSpeed Advance X620
accelerator boards
(2) Monte Carlo Simulation
Statistical methods such as Monte Carlo simulation are used by
financial institutions to derive future prices of complex option
models that cannot be easily modeled by algorithmic approaches such as
the Black-Scholes model. ClearSpeed chose to demonstrate Monte Carlo
simulation for European options so that both the acceleration could be
demonstrated as well as the accuracy of the result when compared with
the Black-Scholes method. The benchmark code was supplied by a well
known global banking organization.
Monte Carlo simulation for European option pricing.
1 CPU, no acceleration: 400M samples, 60 seconds, Speedup 1x
1 Advance board: 400M samples, 2.9 seconds, Speedup 20x
2 Advance boards: 400M samples, 1.5 seconds, Speedup 40x
4 Advance boards: 400M samples, 0.8 seconds, Speedup 79x
System specifications
Base System: Dell 2880, CPU: 2 x 3.0GHz Xeon , Memory: 3 GB
ClearSpeed Acceleration: 1 to 4 ClearSpeed Advance X620
Host Compiler: gcc, libraries: Randc, random number generator: C
Gaussian
ClearSpeed Advance X620 Libraries: CS VML & CS RNG
(3) AMBER and Bristol University Docking Engine (BUDE) Performance
Results
AMBER
To demonstrate application level performance of accelerated systems we
have modified a set of Amber 9 methods to take advantage of
ClearSpeed's Advance(TM) accelerator board. This includes the
effective radius and force calculation of AMBER's Generalized Born
(GB) models, 1, 2, and 6. Supported options include constant pH7 and
analytical linearized Poission Boltzmann (ALPB) as well as options
that do not directly change the force calculation, including NMR
restraints.
While the genborn module of Amber is a small part of the sander
executable, it typically amounts for 95-97% of the CPU compute time
for GB simulations. The CPU compute time is mainly spent in three
loops: effective radii calculations, diagonal and off-diagonal force
calculations.
The overall structure of the code was maintained. A thin layer written
in C, using ClearSpeed's CSAPI library, was added to handle the
communication between the host and board.
Host Advance X620 Speedup
Generalized born 1 Minutes 83.5 24.6 3.39
Generalized born 2 Minutes 84.6 23.5 3.60
Generalized born 6 Minutes 37.9 4.0 9.35
Host: 2.8GHz Pentium 4 EMT64, OS: RHEL4-64, CSXL: version 2.50
Bristol University Docking Engine (BUDE)
1 host CPU, no acceleration: 48.2 seconds, Speedup 1.0x
1 Advance board: 10.6 seconds, Speedup 4.5x
2 Advance boards: 5.8 seconds, Speedup 8.3x
3 Advance boards: 4.4 seconds, Speedup 11.0x
Host: 2 x 2.8 GHz Xeon, OS RHEL4-64 ,CSXL version 2.24
About ClearSpeed ClearSpeed Technology is a semiconductor company that develops massively parallel See MPP. coprocessors, accelerator boards and software that deliver unmatched performance per watt for high performance computing applications in financial services, universities and national labs. ClearSpeed has offices in San Jose, California San Jose (IPA: /ˌsænhoʊˈzeɪ/) is the third-largest city in California, and the tenth-largest in the United States. It is the county seat of Santa Clara County. , and Bristol, UK and has 84 patents granted and pending. For more information, visit www.clearspeed.com. |
|
||||||||||||||

Printer friendly
Cite/link
Email
Feedback
Reader Opinion