Printer Friendly
The Free Library
14,495,914 articles and books
Member login
User name  
Password 
 
Join us Forgot password?

Stream Processors, Inc. Announces Breakthrough Digital Signal Processor Architecture at ISSCC 2007.


Building On More Than Eight Years of Research at Stanford and MIT MIT - Massachusetts Institute of Technology , Startup Reveals a New Class of DSPs That Makes Parallel Processing Simple

SAN FRANCISCO -- Emerging from two years of commercial development and more than eight years of university research, Stream Processors, Inc. (SPI (1) (Stateful Packet Inspection) See stateful inspection.

(2) (Service Provider Interface) The programming interface for developing Windows drivers under WOSA.
) today unveiled a breakthrough digital signal processor A digital signal processor (DSP) is a specialized microprocessor designed specifically for digital signal processing, generally in real-time computing. Characteristics of typical Digital Signal Processors
  • Designed for real-time processing
 (DSP (1) (Digital Signal Processor) A special-purpose CPU used for digital signal processing applications (see definition #2 below). It provides ultra-fast instruction sequences, such as shift and add, and multiply and add, which are commonly used in math-intensive ) architecture that removes the barriers to programming high-performance, massively parallel processors.

Detailed in a paper, "A 512 GOPS (Giga [billion] Operations Per Second) The measurement of instructional performance of a chip or system. It typically refers to DSP operations. See MOPS.  Stream Processor for Signal, Image and Video Processing," being presented at this year's International Solid State Circuits Conference, SPI's Stream Processor[TM] Architecture combines unmatched levels of DSP performance with a simple and efficient C programming model. The approach has resulted in the development of the industry's highest-performance family of DSPs, capable of delivering greater than an order of magnitude A change in quantity or volume as measured by the decimal point. For example, from tens to hundreds is one order of magnitude. Tens to thousands is two orders of magnitude; tens to millions is three orders of magnitude, etc.  (more than 10 times) higher performance than current commercially available DSP solutions.

To place this level of processing performance in context, a single fully software-programmable SPI Stream Processor is capable of encoding H.264 high-definition 1080p video in real-time with enough processing power to perform customer-specific video enhancements, image tuning, and video content analysis. Achieving that level of performance using traditional DSPs could require as many as 15 chips, significantly increasing engineering effort, development time and overall project risk.

Making Parallelism Work

Once confined to the realm of supercomputers, the concept of parallel or multi-core processing - using more than one central processing unit See CPU.

(architecture, processor) central processing unit - (CPU, processor) The part of a computer which controls all the other parts. Designs vary widely but the CPU generally consists of the control unit, the arithmetic and logic unit (ALU), registers, temporary buffers
 (CPU CPU
 in full central processing unit

Principal component of a digital computer, composed of a control unit, an instruction-decoding unit, and an arithmetic-logic unit.
) or processor core to increase computation speed - has long been seen as a way to achieve higher levels of performance. Recently, multi-core solutions such as the AMD (Advanced Micro Devices, Inc., Sunnyvale, CA, www.amd.com) A major manufacturer of semiconductor devices including x86-compatible CPUs, embedded processors, flash memories, programmable logic devices and networking chips. ([R])dual-core Opteron[TM] and the Intel([R])Core[TM]2 Duo have been shown to be effective when running large independent tasks at the operating system level. However, multi-core architectures have not been successful at accelerating individual embedded DSP applications.

"The key problem has been that writing software to take full advantage of the increased processing power offered by parallelism has always been time-consuming and difficult," said Will Strauss, president of the market research firm Forward Concepts. "While Intel and AMD have started to solve this problem in the personal computing and server markets with multi-core processors, the problem remains in embedded markets. These markets require an energy-efficient, programmable digital signal processor with the computational capacity of tens to hundreds of cores applied to individual tasks. By re-thinking the roles of the architecture, programming model and compiler tools, SPI has created a new class of DSPs that makes parallel processing practical."

Prof. Bill Dally, co-founder, chairman and chief science officer for SPI added, "When we began our research 12 years ago, we quickly realized that traditional architectures were running out of steam. A new approach was needed. Simply putting more cores on a chip doesn't address the real issues of bandwidth, data locality, and ease of programming. Today's demanding embedded applications like H.264 HD encoding and analytics, image processing, video surveillance, wireless communication, search, and encryption, all benefit from the performance gain and programming simplicity offered by SPI's Stream Processor Architecture."

The Stream Processor Architecture

At the heart of SPI's Stream Processor Architecture (Figure 1) is a high-performance data-parallel unit (DPU DPU Data Processing Unit
DPU DePauw University (Indiana, USA)
DPU Democratic Pacific Union (Taiwan)
DPU DePaul University
DPU Defects Per Unit
DPU Digital Processing Unit
), which is able to sustain hundreds of billions of operations per second (GOPS). Two industry-standard CPU cores are included to support the DPU: a system CPU runs Linux and handles I/O (Input/Output) The transfer of data between the CPU and a peripheral device. Every transfer is an output from one device and an input to another. See PC input/output.

I/O - Input/Output
; another core runs main DSP threads and offloads processing of compute-intensive kernel functions to the DPU.

A key feature of the architecture is its compiler-managed memory hierarchy that leverages the data-parallelism and locality characteristics of signal processing applications. A simple C programming model allows specification of compute-intensive kernel functions that process streams of data records, enabling the compiler and hardware to efficiently manage on-chip memory and synchronize runtime direct-memory access (DMA (1) (Digital Media Adapter) See digital media hub.

(2) (Document Management Alliance) A specification that provides a common interface for accessing and searching document databases.
). This approach eliminates the need for a cache and greatly increases predictability of throughput, simplifying the overall programming task.

The architecture exploits multiple levels of parallelism:

* task-level parallelism between the system processor, DSP processor and DPU

* data-level parallelism (DLP (Digital Light Processing) A data projection technology from TI that produces clear, readable images on screens in lit rooms. DLP is used in all types of projection devices, from data projectors that weigh only a few pounds to large rear-projection TVs to electronic ) with multiple lanes executing the same instructions on different data in parallel

* instruction-level parallelism (ILP) via very long instruction word (VLIW (Very Long Instruction Word) A CPU architecture that reads a group of instructions and executes them at the same time. For example, the group (word) might contain four instructions, and the compiler ensures that those four instructions are not dependent on each ) driving multiple arithmetic logic units (ALUs) per lane

* sub-word single instruction multiple data (SIMD (Single Instruction stream Multiple Data stream) A computer that performs one operation on multiple sets of data. It is typically used to add or multiply eight or more sets of numbers at the same time for multimedia encoding and rendering as well as scientific ) in which each ALU (Arithmetic Logic Unit) The high-speed CPU circuit that does calculating and comparing. Numbers are transferred from memory into the ALU for calculation, and the results are sent back into memory. Alphanumeric data are sent from memory into the ALU for comparing.  can operate on multiple operands

On the DPU, a kernel function runs identically on every lane processing different data. Built-in support for conditionals and high-speed inter-lane communications provides more versatility than conventional SIMD architectures. The single-threaded execution model provides inherent load-balancing, eliminating the need for code partitioning across multiple cores. Another advantage to SPI's architecture is the ability to easily scale to higher levels of performance by adding more lanes without the need to restructure software.

Development Tools

SPI's RapiDev[TM] tool suite supports a standard development and debug flow using C language tools running on a Windows or Linux platform. RapiDev leverages the predictability of SPI's Stream Processor Architecture to provide a linear path to performance-optimized code. The tools suite enables application source code compatibility across devices with different numbers of lanes and ALUs, providing greater scalability and portability.

About Stream Processors, Inc.

Stream Processors, Inc. (SPI) is a privately held fabless semiconductor company A fabless semiconductor company specializes in the design and sale of hardware devices implemented on semiconductor chips. It achieves an advantage by outsourcing the fabrication of the devices to a specialized semiconductor manufacturer called a semiconductor foundry or "fab.  delivering an innovative stream processing architecture that helps consumer and industrial companies accelerate product development cycles and dramatically reduce system development costs. SPI was founded in 2004 to address the new era of compute-intensive applications requiring radically increased levels of processor performance and power efficiency. The company's technology and products improve application productivity by making parallel processing easier to program and use. Additional information can be found at http://www.streamprocessors.com/.

[c] Copyright Stream Processors, Inc. 2007. All rights reserved.

Stream Processors, Inc., SPI Stream Processing Architecture, RapiDev, and SPI are trademarks of Stream Processors, Inc.

Other product and company names mentioned herein may be trademarks of their respective owners.
COPYRIGHT 2007 Business Wire
No portion of this article can be reproduced without the express written permission from the copyright holder.
Copyright 2007, Gale Group. All rights reserved. Gale Group is a Thomson Corporation Company.

 Reader Opinion

Title:

Comment:



 

Article Details
Printer friendly Cite/link Email Feedback
Publication:Business Wire
Date:Feb 12, 2007
Words:972
Previous Article:VERSAR Reports Strong Revenue and Financial Performance for the Second Quarter of Fiscal Year 2007.(Financial report)
Next Article:BitDefender Unveils Antivirus Solution for Smartphones and PDAs.
Topics:



Related Articles
Adaptive Solutions announces joint relationship with Motorola on VECOMP technology.
Analog devices announces availability of network-enabled Blackfin processors.
Tensilica Announces Diamond Standard 330HiFi, a Low-Power 24-bit Audio DSP; Twenty Audio Application Software Solutions Available for New Diamond...
Analog Devices Vice President to Deliver ISSCC Keynote Address.
Stream Processors, Inc. Announces Storm-1 Family of Data-Parallel Digital Signal Processors.
Stream Processors, Inc. Announces Strategy to Enable Burgeoning Video Surveillance Market.
Stream Processors, Inc. Sets Price-Performance Standard With Digital Signal Processor for Emerging Video Applications.
Stream Processors, Inc. Named a Finalist For Red Herring 100.
Stream Processors, Inc. to Detail New 112 GMACS C-Programmable DSP at In-Stat Microprocessor Forum 2007.
Stream Processors Inc. Delivers Industry's Highest-Performing Digital Signal Processor.

Terms of use | Copyright © 2009 Farlex, Inc. | Feedback | For webmasters | Submit articles