Loading...
15 results
Search Results
Now showing 1 - 10 of 15
- Scalable unified transform architecture for advanced video coding embedded systemsPublication . Dias, Tiago; Lopez, Sebastian; Roma, Nuno; Sousa, LeonelA novel high throughput and scalable unified architecture for the computation of the transform operations in video codecs for advanced standards is presented in this paper. This structure can be used as a hardware accelerator in modern embedded systems to efficiently compute all the two-dimensional 4 x 4 and 2 x 2 transforms of the H.264/AVC standard. Moreover, its highly flexible design and hardware efficiency allows it to be easily scaled in terms of performance and hardware cost to meet the specific requirements of any given video coding application. Experimental results obtained using a Xilinx Virtex-5 FPGA demonstrated the superior performance and hardware efficiency levels provided by the proposed structure, which presents a throughput per unit of area relatively higher than other similar recently published designs targeting the H.264/AVC standard. Such results also showed that, when integrated in a multi-core embedded system, this architecture provides speedup factors of about 120x concerning pure software implementations of the transform algorithms, therefore allowing the computation, in real-time, of all the above mentioned transforms for Ultra High Definition Video (UHDV) sequences (4,320 x 7,680 @ 30 fps).
- Figure follow: A step by step liberating devicePublication . Cavaleiro Rodrigues, José; Dias, Tiago; Silva, CatarinaFigure Follow (FF) is an award-winning project, envisioned to facilitate the learning of “Compulsory Figures”, an individual discipline of Artistic Roller Skating. This idea has now become a prototype under development, which uses sensors to detect the athletes position over figure lines, and provides, in real time, information concerning the general performance of the athlete and a corresponding score, both sent to a mobile app. FF aims to help class teaching and personal training, creating a more playful way to progress and maintain young athletes motivation. The technological development process of this product is a perfect context to put to the test some sociological ideas about the relation between technologies and society, particularly in a time of accelerated, permanent and transformative innovation. Even if we are just confronted with one more gamification of a learning device, it is an opportunity to discuss the liberating effects of digital communication techs on society and personal lives.
- High throughput and scalable architecture for unified transform coding in embedded H.264/AVC video coding systemsPublication . Dias, Tiago; Lopez, Sebastian; Roma, Nuno; Sousa, LeonelAn innovative high throughput and scalable multi-transform architecture for H.264/AVC is presented in this paper. This structure can be used as a hardware accelerator in modern embedded systems to efficiently compute the 4×4 forward/inverse integer DCT, as well as the 2-D 4×4 / 2×2 Hadamard transforms. Moreover, its highly flexible design and hardware efficiency allows it to be easily scaled in terms of performance and hardware cost to meet the specific requirements of any given video coding application. Experimental results obtained using a Xilinx Virtex-4 FPGA demonstrate the superior performance and hardware efficiency levels provided by the proposed structure, which presents a throughput per unit of area at least 1.8× higher than other similar recently published designs. Furthermore, such results also showed that this architecture can compute, in realtime, all the above mentioned H.264/AVC transforms for video sequences with resolutions up to UHDV.
- A portable lab for the practical study of modern computer engineeringPublication . Dias, Tiago; Sampaio, Pedro; Miguens Matutino, PedroThis demo paper presents the μLIC educational platform. μLIC makes available to the students of Digital Systems and Computer Architecture courses a quite simple, intuitive and portable hardware platform for the realization of their laboratory exercises. μLI C can also be used in other related courses, such as Embedded Systems or Hardware/Software Co-Design, which allows reducing the time needed to study the manuals of diverse development boards and tools and to concentrate the education towards the core contents of the courses. In addition, the low cost, diminished size, and portable nature of the μLIC boards enables students to carry a personal unit with them all the time to implement the class exercises also outside the classrooms, as well as hobby projects. In the demo, three typical lab assignments of Digital Systems, Computer Architecture, and Hardware/Software Co-Design courses are used to showcase the μLIC educational platform: a traffic light controller, a 4-bit Arithmetic and Logic Unit (ALU) and a mini Space Invaders inspired game, respectively.
- Unified transform architecture for AVC, AVS, VC-1 and HEVC high-performance codecsPublication . Dias, Tiago; Roma, Nuno; Sousa, LeonelA unified architecture for fast and efficient computation of the set of two-dimensional (2-D) transforms adopted by the most recent state-of-the-art digital video standards is presented in this paper. Contrasting to other designs with similar functionality, the presented architecture is supported on a scalable, modular and completely configurable processing structure. This flexible structure not only allows to easily reconfigure the architecture to support different transform kernels, but it also permits its resizing to efficiently support transforms of different orders (e. g. order-4, order-8, order-16 and order-32). Consequently, not only is it highly suitable to realize high-performance multi-standard transform cores, but it also offers highly efficient implementations of specialized processing structures addressing only a reduced subset of transforms that are used by a specific video standard. The experimental results that were obtained by prototyping several configurations of this processing structure in a Xilinx Virtex-7 FPGA show the superior performance and hardware efficiency levels provided by the proposed unified architecture for the implementation of transform cores for the Advanced Video Coding (AVC), Audio Video coding Standard (AVS), VC-1 and High Efficiency Video Coding (HEVC) standards. In addition, such results also demonstrate the ability of this processing structure to realize multi-standard transform cores supporting all the standards mentioned above and that are capable of processing the 8k Ultra High Definition Television (UHDTV) video format (7,680 x 4,320 at 30 fps) in real time.
- Extending the Synoptics of Things (SoT) Framework to Manage ISoS Technology LandscapesPublication . Serras, Bruno; Gonçalves, Carlos; Dias, Tiago; F. G. Osório, A. LuísManaging and monitoring the software and hardware artifacts of an industrial organization are fundamental efforts that can often be challenging endeavors to achieve, especially when such technological landscapes are composed of multiple heterogeneous systems. Usually, Internet of Things (IoT) devices are provided by different suppliers and may use different protocols and interfaces. Thus, the integration of these devices results in complex development and maintenance cycles. The Synoptics of Things (SoT) framework can address these problems, in conjunction with the Informatics System of Systems (ISoS) platform by promoting an open market competitive tech nology landscape for organizations. The purpose of the research presented in this paper was to extend the SoT framework in order to manage and monitor the different elements of an ISoS enabled organization, namely the ISystems (Informatics Systems), CES (Cooperation Enabled Services), and Services. We argue that the SoT framework can be essential in a supervisory control and data acquisition (SCADA) system in today’s modern web, by adopting the concept of Web Components as a standard to enable the development of custom and reusable components. We present and discuss such issues in the context of the HORUS system, an informatic system responsible for payment enforcement in fueling stations. In this system, several devices must work in coordination, such as video cameras and video recorders, which are fundamental for the retrieving of license plate images and, therefore, be continually monitored to ensure the correct functioning of the HORUS system.
- A Flexible Architecture for the Computation of Direct and Inverse Transforms in H.264/AVC Video CodecsPublication . Dias, Tiago; Lopez, S.; Roma, N.; Sousa, L.A new high throughput and scalable architecture for unified transform coding in H.264/AVC is proposed in this paper. Such flexible structure is capable of computing all the 4x4 and 2x2 transforms for Ultra High Definition Video (UHDV) applications (4320x7680@ 30fps) in real-time and with low hardware cost. These significantly high performance levels were proven with the implementation of several different configurations of the proposed structure using both FPGA and ASIC 90 nm technologies. In addition, such experimental evaluation also demonstrated the high area efficiency of theproposed architecture, which in terms of Data Throughput per Unit of Area (DTUA) is at least 1.5 times more efficient than its more prominent related designs(1).
- High Performance Multi-Standard Architecture for DCT Computation in H.264/AVC High Profile and HEVC CodecsPublication . Dias, Tiago; Roma, Nuno; Sousa, LeonelA new high performance architecture for the computation of all the DCT operations adopted in the H.264/AVC and HEVC standards is proposed in this paper. Contrasting to other dedicated transform cores, the presented multi-standard transform architecture is supported on a completely configurable, scalable and unified structure, that is able to compute not only the forward and the inverse 8×8 and 4×4 integer DCTs and the 4×4 and 2×2 Hadamard transforms defined in the H.264/AVC standard, but also the 4×4, 8×8, 16×16 and 32×32 integer transforms adopted in HEVC. Experimental results obtained using a Xilinx Virtex-7 FPGA demonstrated the superior performance and hardware efficiency levels provided by the proposed structure, which outperforms its more prominent related designs by at least 1.8 times. When integrated in a multi-core embedded system, this architecture allows the computation, in real-time, of all the transforms mentioned above for resolutions as high as the 8k Ultra High Definition Television (UHDTV) (7680×4320 @ 30fps).
- Fully parameterizable VLSI architecture for sub-pixel motion estimation with low memory bandwidth requirementsPublication . Dias, Tiago; Roma, Nuno; Sousa, LeonelThis paper proposes a new scalable and efficient VLSI type-II architecture for real-time motion estimation optimized for subpel refinement algorithms. Based on the proposed architecture, which provides minimum latency, maximum throughput, and full utilization of the hardware resources, the implementation of a dedicated motion estimation coprocessor is also presented in this paper. This circuit is characterized by low memory bandwidth requirements, a modular and highly flexible structure and is capable of estimating motion vectors with half-pixel accuracy using the bilinear interpolation algorithm. Experimental results for implementations on ASIC and FPGA devices show that by using the proposed architecture it is possible to estimate motion vectors up to the 16CIF image format in real-time, with any given sub-pixel accuracy.
- High performance IP core for HEVC quantizationPublication . Dias, Tiago; Roma, Nuno; Sousa, LeonelA new class of quantization architectures suitable for the realization of high performance and hardware efficient forward, inverse and unified quantizers for HEVC is presented. The proposed structures are based on a highly flexible and optimized integer datapath that can be configured to provide several pipelined and non-pipelined implementations, offering distinct trade-offs between performance and hardware cost, which makes them highly suitable for most video coding application domains. The experimental results obtained using a 90 nm CMOS process show that the proposed class of quantization architectures is able to process 4k UHDTV video sequences in real-time (3840 x 2160 @ 30fps), with a power consumption as low as 3.9 mW when the unified architecture is operated at 374 MHz.