hh.sePublikationer
Ändra sökning
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf
Programming of Coarse-Grained Reconfigurable Architectures
Högskolan i Halmstad, Akademin för informationsteknologi, Halmstad Embedded and Intelligent Systems Research (EIS), Centrum för forskning om inbyggda system (CERES).ORCID-id: 0000-0002-4932-4036
2011 (Engelska)Doktorsavhandling, sammanläggning (Övrigt vetenskapligt)
Abstract [en]

Coarse-grained reconfigurable architectures, which offer massive parallelism coupled with the capability of undergoing run-time reconfiguration, are gaining attention in order to meet not only the increased computational demands of high-performance embedded systems, but also to fulfill the need of adaptability to functional requirements of the application. This thesis focuses on the programming aspects of such coarse-grained reconfigurable computing devices, including the relevant computation models that are capable of exposing different kinds of parallelism inherent in the application and the ability of these models to capture the adaptability requirements of the application. The thesis suggests the occam-pi language for programming of a broad class of coarse-grained reconfigurable architectures as an intermediate language; we call it intermediate, since we believe that the applicationprogramming is best done in a high-level domain-specific language. The salient properties of the occam-pi language are explicit concurrency with built-in mechanisms for interprocessorcommunication, provision for expressing dynamic parallelism, support for the expression of dynamic reconfigurations, and placement attributes. To evaluate the programming approach, a compiler framework was extended to support the language extensions in the occam-pi language, and backends were developed to target two different coarse-grained reconfigurable architectures. XPP and Ambric. The results on XPP reveal that the occam-pi based implementations produce comparable throughput to those of NML programs, while programming at a much higher level of abstraction than that of NML. Similarly the two occam-pi implementations of autofocus criterion calculation targeted to the Ambric platform outperform the CPU implementation by factors of 11-23. Thus, the results of the implemented case-studies suggest that the occam-pi language based approach simplifies the development of applications employing run-time reconfigurable devices without compromising the performance benefits.

Ort, förlag, år, upplaga, sidor
Örebro: Örebro universitet , 2011.
Serie
Örebro Studies in Technology, ISSN 1650-8580 ; 48
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
URN: urn:nbn:se:hh:diva-15050OAI: oai:DiVA.org:hh-15050DiVA, id: diva2:415735
Disputation
2011-05-26, Wigforssalen, Högskolan i Halmstad, Halmstad, 10:15 (Engelska)
Opponent
Handledare
Tillgänglig från: 2011-05-09 Skapad: 2011-04-28 Senast uppdaterad: 2018-01-12Bibliografiskt granskad
Delarbeten
1. Evolution in architectures and programming methodologies of coarse-grained reconfigurable computing
Öppna denna publikation i ny flik eller fönster >>Evolution in architectures and programming methodologies of coarse-grained reconfigurable computing
2009 (Engelska)Ingår i: Microprocessors and microsystems, ISSN 0141-9331, E-ISSN 1872-9436, Vol. 33, nr 3, s. 161-178Artikel i tidskrift (Refereegranskat) Published
Abstract [en]

In order to meet the increased computational demands of, e.g., multimedia applications, such as video processing in HDTV, and communication applications, such as baseband processing in telecommunication systems, the architectures of reconfigurable devices have evolved to coarse-grained compositions of functional units or program controlled processors, which are operated in a coordinated manner to improve performance and energy efficiency. In this survey we explore the field of coarse-grained reconfigurable computing on the basis of the hardware aspects of granularity, reconfigurability, and interconnection networks, and discuss the effects of these on energy related properties and scalability. We also consider the computation models that are being adopted for programming of such machines, models that expose the parallelism inherent in the application in order to achieve better performance. We classify the coarse-grained reconfigurable architectures into four categories and present some of the existing examples of these categories. Finally, we identify the emerging trends of introduction of asynchronous techniques at the architectural level and the use of nano-electronics from technological perspective in the reconfigurable computing discipline.

Ort, förlag, år, upplaga, sidor
Elsevier, 2009
Nyckelord
Reconfigurable architectures, Coarse-grained arrays, Computation models, Globally-asynchronous locally-synchronous
Nationell ämneskategori
Datorteknik
Identifikatorer
urn:nbn:se:hh:diva-81 (URN)10.1016/j.micpro.2008.10.003 (DOI)000266230500001 ()2-s2.0-66349084712 (Scopus ID)
Projekt
Embedded Parallel Computing
Tillgänglig från: 2009-09-18 Skapad: 2009-09-16 Senast uppdaterad: 2018-01-13Bibliografiskt granskad
2. Using a CSP based programming model for reconfigurable processor arrays
Öppna denna publikation i ny flik eller fönster >>Using a CSP based programming model for reconfigurable processor arrays
2008 (Engelska)Ingår i: International Conference on Reconfigurable Computing and FPGAs, 2008. ReConFig '08, Los Alamitos, California: IEEE Computer Society, 2008, s. 343-348Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The growing trend towards adoption of flexible and heterogeneous, parallel computing architectures has increased the challenges faced by the programming community. We propose a method to program an emerging class of reconfigurable processor arrays by using the CSP based programming model of occam-pi. The paper describes the extension of an existing compiler platform to target such architectures. To evaluate the performance of the generated code, we present three implementations of the DCT algorithm. It is concluded that CSP appears to be a suitable computation model for programming a wide variety of reconfigurable architectures.

Ort, förlag, år, upplaga, sidor
Los Alamitos, California: IEEE Computer Society, 2008
Nyckelord
CSP, Programming Models, Coarse-grained Reconfigurable Architectures
Nationell ämneskategori
Datorteknik
Identifikatorer
urn:nbn:se:hh:diva-83 (URN)10.1109/ReConFig.2008.41 (DOI)2-s2.0-62349104086 (Scopus ID)978-0-7695-3474-9 (ISBN)
Konferens
2008 International Conference on Reconfigurable Computing and FPGAs, ReConFig 2008, 3-5 December 2008, Cancun, Mexico
Anmärkning

©2008 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Tillgänglig från: 2009-09-18 Skapad: 2009-09-16 Senast uppdaterad: 2018-01-13Bibliografiskt granskad
3. Programming Real-time Autofocus on a Massively Parallel Reconfigurable Architecture using Occam-pi
Öppna denna publikation i ny flik eller fönster >>Programming Real-time Autofocus on a Massively Parallel Reconfigurable Architecture using Occam-pi
2011 (Engelska)Ingår i: Proceedings of the 19th Annual IEEE International Symposium on Field-Programmable Custom Computing Machines (FCCM'2011), Los Alamitos, Calif.: IEEE Computer Society, 2011, s. 194-201Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Recently we proposed occam-pi as a high-level language for programming massively parallel reconfigurable architectures. The design of occam-pi incorporates ideas from CSP and pi-calculus to facilitate expressing parallelism and reconfigurability. The feasability of this approach was illustratedby building three occam-pi implementations of DCT executing on an Ambric. However, because DCT is a simple and well studied algorithm it remained uncertain whether occam-pi would also be effective for programming novel, more complex algorithms.

In this paper, we demonstrate the applicability of occam-pi for expressing various degrees of parallelism by implementinga significantly large case-study of focus criterion calculation inan autofocus algorithm on the Ambric architecture. Autofocus is a key component of synthetic aperture radar systems. Two implementations of focus criterion calculation were developedand evaluated on the basis of performance. The comparison of the performance results with a single threaded software implementation of the same algorithm show that the throughput of the two implementations are 11x and 23x higher than the sequential implementation despite a much lower (9x) clock frequency. The two designs are, respectively, 29x and 40x moreenergy efficient.

Ort, förlag, år, upplaga, sidor
Los Alamitos, Calif.: IEEE Computer Society, 2011
Serie
Annual IEEE Symposium on Field-Programmable Custom Computing Machines
Nationell ämneskategori
Datorteknik
Identifikatorer
urn:nbn:se:hh:diva-14616 (URN)10.1109/FCCM.2011.20 (DOI)000298664800034 ()2-s2.0-79958730167 (Scopus ID)978-1-61284-277-6 (ISBN)978-0-7695-4301-7 (ISBN)
Konferens
IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM'2011), Campus Univ Utah, Salt Lake City, UT, MAY 01-03, 2011
Projekt
SMECY
Forskningsfinansiär
EU, FP7, Sjunde ramprogrammet, 100230
Anmärkning

©2011 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Tillgänglig från: 2011-03-22 Skapad: 2011-03-22 Senast uppdaterad: 2018-03-23Bibliografiskt granskad
4. Specifying Run-time Reconfiguration in Processor Arrays using High-level language
Öppna denna publikation i ny flik eller fönster >>Specifying Run-time Reconfiguration in Processor Arrays using High-level language
2010 (Engelska)Ingår i: WRC 2010: 4th HiPEAC Workshop on Reconfigurable Computing, Pisa, 2010, s. 1-10Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

The adoption of run-time reconfigurable parallel architectures for high-performance embedded systems is constrained by the lackof a unified programming model which can express both parallelism and reconfigurability. We propose to program an emerging class of reconfigurable processor arrays by using the programming model of occam-pi and describe how the extensions of channel direction specifiers, mobile data, dynamic process invocation, and process placement attributes can be used to express run-time reconfiguration in occam-pi. We present implementations of DCT algorithm to demonstrate the applicability of occam-pi to express reconfigurability. We concluded that occam-pi appears to be a suitable programming model for programming run-time reconfigurable processor arrays.

Ort, förlag, år, upplaga, sidor
Pisa: , 2010
Nationell ämneskategori
Datavetenskap (datalogi)
Identifikatorer
urn:nbn:se:hh:diva-5442 (URN)
Konferens
HiPEAC Workshop on Reconfigurable Computing
Tillgänglig från: 2010-09-27 Skapad: 2010-08-26 Senast uppdaterad: 2018-01-12Bibliografiskt granskad
5. A Study of Design Efficiency with a High-Level Language for FPGAs
Öppna denna publikation i ny flik eller fönster >>A Study of Design Efficiency with a High-Level Language for FPGAs
2007 (Engelska)Ingår i: Proceedings - 21st International Parallel and Distributed Processing Symposium, IPDPS 2007, Abstracts and CD-ROM, Piscataway, N.J.: IEEE Press, 2007, s. 1-7Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Over the years reconfigurable computing devices such as FPGAs have evolved from gate-level glue logic to complex reprogrammable processing architectures. However, the tools used for mapping computations to such architectures still require the knowledge about architectural details of the target device to extract efficiency. A study of the Mobius language and tools is presented in this paper, with a focus on generated hardware performance. A number of streaming and memory-intensive applications have been developed and the results have been compared with the corresponding implementations in VHDL and a behavioral hardware description language. Based upon experimental evidences, it is concluded that Mobius, a minimal parallel processing language targeted for reconfigurable architectures, enhances productivity in terms of design time and code maintainability without considerably compromising performance and resources.

Ort, förlag, år, upplaga, sidor
Piscataway, N.J.: IEEE Press, 2007
Nyckelord
FPGA, Mobius language, VHDL, behavioral hardware description language, high-level language, minimal parallel processing language, reconfigurable computing device, eprogrammable processing architecture
Nationell ämneskategori
Inbäddad systemteknik
Identifikatorer
urn:nbn:se:hh:diva-1968 (URN)10.1109/IPDPS.2007.370394 (DOI)2-s2.0-34548787179 (Scopus ID)2082/2363 (Lokalt ID)1-4244-0910-1 (ISBN)2082/2363 (Arkivnummer)2082/2363 (OAI)
Konferens
21st International Parallel and Distributed Processing Symposium, IPDPS 2007, Long Beach, CA, 26 - 30 March, 2007
Anmärkning

©2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Tillgänglig från: 2008-09-26 Skapad: 2008-09-26 Senast uppdaterad: 2020-05-15Bibliografiskt granskad
6. Occam-pi as a High-level Language for Coarse-Grained Reconfigurable Architectures
Öppna denna publikation i ny flik eller fönster >>Occam-pi as a High-level Language for Coarse-Grained Reconfigurable Architectures
2011 (Engelska)Ingår i: IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, Washington, USA: IEEE Computer Society, 2011, s. 236-243Konferensbidrag, Publicerat paper (Refereegranskat)
Abstract [en]

Recently we proposed occam-pi as a high-levellanguage for programming coarse grained reconfigurable architectures. The constructs of occam-pi combine ideas from CSPand pi-calculus to facilitate expressing parallelism, communication, and reconfigurability. The feasability of this approachwas illustrated by developing a compiler framework to compile occam-pi implementations to the Ambric architecture. In this paper, we demonstrate the applicability of occam-pif or programing an array of functional units, eXtreme ProcessingPlatform (XPP). This is made possible by extending the compilerframework to target the XPP architecture, including automatic floating to fixed-point conversion. Different implementations of a FIR filter and a DCT algorithm were developed and evaluated on the basis of performance and resource consumption. The reported results reveal that the approach of using occam-pito program the category of coarse grained reconfigurable architectures appears to be promising. The resulting implementations are generally much superior to those programmed in C and comparable to those hand-coded in the low-level native language NML.

Ort, förlag, år, upplaga, sidor
Washington, USA: IEEE Computer Society, 2011
Serie
IEEE International Parallel and Distributed Processing Symposium Proceedings, ISSN 1530-2075 ; 2011
Nyckelord
Coarse grained reconfigurable architecture, DCT algorithms, Fixed-point conversion, Functional units, Native language, Occam-pi, Pi calculus, Processing platform, Reconfigurability, Resource consumption
Nationell ämneskategori
Datorteknik
Identifikatorer
urn:nbn:se:hh:diva-14615 (URN)10.1109/IPDPS.2011.147 (DOI)2-s2.0-83455266629 (Scopus ID)978-1-61284-425-1 (ISBN)978-076954385-7 (ISBN)
Konferens
25th IEEE International Parallel and Distributed Processing Symposium, Workshops and Phd Forum, IPDPSW 2011, Anchorage, AK, 16 - 20 May, 2011
Forskningsfinansiär
EU, FP7, Sjunde ramprogrammet, 100230
Anmärkning

©2011 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.

Tillgänglig från: 2011-03-22 Skapad: 2011-03-22 Senast uppdaterad: 2018-01-12Bibliografiskt granskad

Open Access i DiVA

fulltext(4831 kB)419 nedladdningar
Filinformation
Filnamn FULLTEXT01.pdfFilstorlek 4831 kBChecksumma SHA-512
2b3e376b6d48ee419de37dced82dbccfadf13dfbf4543ca982b2c22f64a4fd6e3bab2986c60d5730ab10c9698a92ed1ceb7acb4b184db3e81c595187b8472884
Typ fulltextMimetyp application/pdf
omslag(1581 kB)39 nedladdningar
Filinformation
Filnamn COVER01.pdfFilstorlek 1581 kBChecksumma SHA-512
e68215be4cb918bc0b22ea49f92af3767ef34c7b96f5185acc28e40168d79ec3158ed0fcf9adf7b9606f0476cfdb322596455164353e8fda4141af199c8af616
Typ coverMimetyp application/pdf
spikblad(130 kB)80 nedladdningar
Filinformation
Filnamn SPIKBLAD01.pdfFilstorlek 130 kBChecksumma SHA-512
202ba776213415b2ae38274371916166c67b067abdb15de63d3a47bf9c23b16f5548e8e491c36dab3a973fb07ce687e9d088926a10579b84413d233b786ab821
Typ spikbladMimetyp application/pdf

Personposter BETA

Ul-Abdin, Zain

Sök vidare i DiVA

Av författaren/redaktören
Ul-Abdin, Zain
Av organisationen
Centrum för forskning om inbyggda system (CERES)
Datavetenskap (datalogi)

Sök vidare utanför DiVA

GoogleGoogle Scholar
Totalt: 419 nedladdningar
Antalet nedladdningar är summan av nedladdningar för alla fulltexter. Det kan inkludera t.ex tidigare versioner som nu inte längre är tillgängliga.

urn-nbn

Altmetricpoäng

urn-nbn
Totalt: 475 träffar
RefereraExporteraLänk till posten
Permanent länk

Direktlänk
Referera
Referensformat
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Annat format
Fler format
Språk
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Annat språk
Fler språk
Utmatningsformat
  • html
  • text
  • asciidoc
  • rtf