hh.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Models and Methods for Development of DSP Applications on Manycore Processors
Högskolan i Halmstad, Sektionen för Informationsvetenskap, Data– och Elektroteknik (IDE), Halmstad Embedded and Intelligent Systems Research (EIS), Centrum för forskning om inbyggda system (CERES).
2009 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Advanced digital signal processing systems require specialized high-performance embedded computer architectures. The term high-performance translates to large amounts of data and computations per time unit. The term embedded further implies requirements on physical size and power efficiency. Thus the requirements are of both functional and non-functional nature. This thesis addresses the development of high-performance digital signal processing systems relying on manycore technology. We propose building two-level hierarchical computer architectures for this domain of applications. Further, we outline a tool flow based on methods and analysis techniques for automated, multi-objective mapping of such applications on distributed memory manycore processors. In particular, the focus is put on how to provide a means for tunable strategies for mapping of task graphs on array structured distributed memory manycores, with respect to given application constraints. We argue for code mapping strategies based on predicted execution performance, which can be used in an auto-tuning feedback loop or to guide manual tuning directed by the programmer. Automated parallelization, optimisation and mapping to a manycore processor benefits from the use of a concurrent programming model as the starting point. Such a model allows the programmer to express different types and granularities of parallelism as well as computation characteristics of importance in the addressed class of applications. The programming model should also abstract away machine dependent hardware details. The analytical study of WCDMA baseband processing in radio base stations, presented in this thesis, suggests dataflow models as a good match to the characteristics of the application and as execution model abstracting computations on a manycore. Construction of portable tools further requires a manycore machine model and an intermediate representation. The models are needed in order to decouple algorithms, used to transform and map application software, from hardware. We propose a manycore machine model that captures common hardware resources, as well as resource dependent performance metrics for parallel computation and communication. Further, we have developed a multifunctional intermediate representation, which can be used as source for code generation and for dynamic execution analysis. Finally, we demonstrate how we can dynamically analyse execution using abstract interpretation on the intermediate representation. It is shown that the performance predictions can be used to accurately rank different mappings by best throughput or shortest end-to-end computation latency.

Ort, förlag, år, upplaga, sidor
Göteborg: Chalmers University of Technology , 2009. , s. 173
Serie
Doktorsavhandlingar vid Chalmers tekniska högskola. Ny serie, ISSN 0346-718X ; 2969
Nyckelord [en]
parallel processing, manycore processors, high-performance digital signal processing, dataflow, concurrent models of computation, parallel code mapping, parallel machine model, dynamic performance analysis
Nationell ämneskategori
Datorteknik
Identifikatorer
URN: urn:nbn:se:hh:diva-14706ISBN: 978-91-7385-288-3 OAI: oai:DiVA.org:hh-14706DiVA, id: diva2:408234
Disputation
2009-06-10, Wigforssalen, house Visionen, Halmstad University, Kristian IV:s väg 3, Halmstad, 13:15 (Engelska)
Opponent
Handledare
Tillgänglig från: 2011-04-20 Skapad: 2011-04-04 Senast uppdaterad: 2018-03-23Bibliografiskt granskad
Delarbeten
1.
Posten kunde inte hittas. Det kan bero på att posten inte längre är tillgänglig eller att du har råkat ange ett felaktigt id i adressfältet.
2. Baseband Processing in 3G UMTS Radio Base Stations
Öppna denna publikation i ny flik eller fönster >>Baseband Processing in 3G UMTS Radio Base Stations
2006 (Engelska)Rapport (Övrigt vetenskapligt)
Abstract [en]

This report presents a study of functionality, service dataflows, computation characteristics and processing parameters for baseband processing in radio base stations. The study has been performed with the objective to develop a programming model that is natural and efficient to use for baseband programming and which can be efficiently compiled to parallel computing structures. In order to achieve this objective it is necessary to analyse and understand the logical architecture of the application in order to be able to define processing characteristics and thereby requirements on languages as well as on physical system architectures. Moreover, to be able to test and verify programming and mapping of functions it is necessary to have realistic but still manageable test cases. The study is focused on the third generation partnership project (3GPP) standard specifications for 3G radio base stations. The specifications cover the complete 3G network-architecture and are quite extensive and complex. To make experiments manageable, it is necessary to abstract system functionality that is not directly relevant for the RBS baseband processing. Moreover, the standard specifications only describe the required processing functionality on an abstract logical level. In this report, the functionality of the baseband functions is explained and also described using illustrations of dataflows and abstract mapping of two 3G service cases. The results of the study constitute a comprehensive description of the processing flow and the mapping of user data channels in 3G radio base stations – spanning data and control input from layer 2 to physical channel output from layer 1. Data dependencies between functions are illustrated with figures and it is concluded that these dependencies are of producer/consumer type. It is discussed how different functions can be mapped in MIMD and SIMD fashion with regard to the data dependencies, the data stream lengths and the control operations required to handle bit stream processing on word-length processor architectures.

Ort, förlag, år, upplaga, sidor
Halmstad: Halmstad University, 2006
Serie
Technical Report ; IDE 0629
Nyckelord
Baseband processing, Radio base stations
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
urn:nbn:se:hh:diva-2721 (URN)2082/3123 (Lokalt ID)2082/3123 (Arkivnummer)2082/3123 (OAI)
Tillgänglig från: 2009-07-06 Skapad: 2009-07-06 Senast uppdaterad: 2018-03-23Bibliografiskt granskad
3. A configurable framework for stream programming exploration in baseband applications
Öppna denna publikation i ny flik eller fönster >>A configurable framework for stream programming exploration in baseband applications
2006 (Engelska)Ingår i: 2006 IEEE International Parallel & Distributed Processing Symposium: Rhodes Island, Greece : 25-29 April, 2006, Piscataway, N.J.: IEEE Press, 2006, s. 8-Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

This paper presents a configurable framework to be used for rapid prototyping of stream based languages. The framework is based on a set of design patterns defining the elementary structure of a domain specific language for high-performance signal processing. A stream language prototype for baseband processing has been implemented using the framework. We introduce language constructs to efficiently handle dynamic reconfiguration of distributed processing parameters. It is also demonstrated how new language specific primitive data types and operators can be used to efficiently and machine independently express computations on bitfields and data-parallel vectors. These types and operators yield code that is readable, compact and amenable to a stricter type checking than is common practice. They make it possible for a programmer to explicitly express parallelism to be exploited by a compiler. In short, they provide a programming style that is less error prone and has the potential to lead to more efficient implementations.

Ort, förlag, år, upplaga, sidor
Piscataway, N.J.: IEEE Press, 2006
Nyckelord
distributed processing, program compilers, software prototyping, telecommunication, computing, telecommunication signalling
Nationell ämneskategori
Teknik och teknologier
Identifikatorer
urn:nbn:se:hh:diva-2104 (URN)10.1109/IPDPS.2006.1639502 (DOI)2-s2.0-33847132885 (Scopus ID)2082/2499 (Lokalt ID)1-4244-0054-6 (ISBN)2082/2499 (Arkivnummer)2082/2499 (OAI)
Konferens
20th International Parallel and Distributed Processing Symposium, IPDPS 2006, Rhodes Island, Greece : 25-29 April, 2006
Anmärkning

©2006 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Tillgänglig från: 2008-11-04 Skapad: 2008-11-04 Senast uppdaterad: 2018-03-23Bibliografiskt granskad
4. A Domain-specific Approach for Software Development on Manycore Platforms
Öppna denna publikation i ny flik eller fönster >>A Domain-specific Approach for Software Development on Manycore Platforms
2008 (Engelska)Ingår i: SIGARCH Computer Architecture News, ISSN 0163-5964, E-ISSN 1943-5851, Vol. 36, nr 5, s. 2-10Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

The programming complexity of increasingly parallel processors calls for new tools that assist programmers in utilising the parallel hardware resources. In this paper we present a set of models that we have developed as part of a tool for mapping dataflow graphs onto manycores. One of the models captures the essentials of manycores identified as suitable for signal processing, and which we use as target for our algorithms. As an intermediate representation we introduce timed configuration graphs, which describe the mapping of a model of an application onto a machine model. Moreover, we show how a timed configuration graph by very simple means can be evaluated using an abstract interpretation to obtain performance feedback. This information can be used by our tool and by the programmer in order to discover improved mappings.

Ort, förlag, år, upplaga, sidor
New York: ACM Press, 2008
Nyckelord
Programming, Manycores
Nationell ämneskategori
Datorteknik
Identifikatorer
urn:nbn:se:hh:diva-5990 (URN)10.1145/1556444.1556446 (DOI)
Konferens
Association for Computing Machinery Special Interest Group on Computer Architecture
Tillgänglig från: 2010-09-23 Skapad: 2010-09-23 Senast uppdaterad: 2018-03-23Bibliografiskt granskad
5. Manycore performance analysis using timed configuration graphs
Öppna denna publikation i ny flik eller fönster >>Manycore performance analysis using timed configuration graphs
2009 (Engelska)Ingår i: International Symposium on Systems, Architectures, Modeling, and Simulation, 2009. SAMOS '09 / [ed] Michael Joseph Schulte and Walid Najjar, Piscataway, N.J.: IEEE Press, 2009, s. 108-117Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The programming complexity of increasingly parallel processors calls for new tools to assist programmers in utilising the parallel hardware resources. In this paper we present a set of models that we have developed to form part of a tool which is intended for iteratively tuning the mapping of dataflow graphs onto manycores. One of the models is used for capturing the essentials of manycores that are identified as suitable for signal processing and which we use as target architectures. Another model is the intermediate representation in the form of a timed configuration graph, describing the mapping of a dataflow graph onto a machine model. Moreover, this IR can be used for performance evaluation using abstract interpretation. We demonstrate how the models can be configured and applied in order to map applications on the Raw processor. Furthermore, we report promising results on the accuracy of performance predictions produced by our tool. It is also demonstrated that the tool can be used to rank different mappings with respect to optimisation on throughput and end-to-end latency.

Ort, förlag, år, upplaga, sidor
Piscataway, N.J.: IEEE Press, 2009
Nyckelord
graphs, microcomputers, parallel architectures, parallel programming, program compilers, software performance evaluation, task analysis
Nationell ämneskategori
Datorteknik
Identifikatorer
urn:nbn:se:hh:diva-5987 (URN)10.1109/ICSAMOS.2009.5289221 (DOI)000276377000014 ()2-s2.0-71949094275 (Scopus ID)978-1-4244-4502-8 (ISBN)
Konferens
2009 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation, IC-SAMOS 2009, Samos, 20 - 23 July, 2009
Anmärkning

©2009 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Tillgänglig från: 2010-09-23 Skapad: 2010-09-23 Senast uppdaterad: 2018-03-23Bibliografiskt granskad

Open Access i DiVA

fulltext(1983 kB)773 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 1983 kBChecksumma SHA-512
19999f7cb7bdbfee9dcf0822f87977f42012707690251c80e6af261fded0ebadc58fe91fede155803870fa88f3ec265c2c63adaa032fe1644f8e9baee5ff51f0
Typ fulltextMimetyp application/pdf

Övriga länkar

Fulltext

Personposter BETA

Bengtsson, Jerker

Sök vidare i DiVA

Av författaren/redaktören
Bengtsson, Jerker
Av organisationen
Centrum för forskning om inbyggda system (CERES)
Datorteknik

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 773 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

isbn
urn-nbn

Altmetricpoäng

isbn
urn-nbn
Totalt: 483 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf