Temat: GPU - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Wykorzystanie CPU i GPU do obliczeń w Matlabie
The use of CPU and GPU for calculations in Matlab
Autorzy:: Woźniak, Jarosław
Powiązania:: https://bibliotekanauki.pl/articles/98264.pdf
Data publikacji:: 2019
Wydawca:: Politechnika Lubelska. Instytut Informatyki
Tematy:: CPU
GPU
Matlab
Opis:: W artykule zostały przedstawione wybrane rozwiązania wykorzystujące procesory CPU oraz procesory graficzne GPU do obliczeń w środowisku Matlab. Porównywano różne metody wykonywania obliczeń na CPU, jak i na GPU. Zostały wskazane różnice, wady, zalety oraz skutki stosowania wybranych sposobów obliczeń.
The article presents selected solutions using CPU processors and GPUs for calculations in the Matlab environment. Various methods of performing calculations on the CPU as well as on the GPU were compared. Differences, disadvantages, advantages and effects of using selected calculation methods have been indicated.
Źródło:: Journal of Computer Sciences Institute; 2019, 10; 32-35
2544-0764
Pojawia się w:: Journal of Computer Sciences Institute
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: GPU enhanced simulation of angiogenesis
Autorzy:: Worecki, M.
Wcisło, R.
Powiązania:: https://bibliotekanauki.pl/articles/305573.pdf
Data publikacji:: 2012
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: GPU
angiogenesis
tumor
Opis:: In the paper we present the use of graphic processor units to accelerate the most time-consuming stages of a simulation of angiogenesis and tumor growth. By the use of advanced CUDA mechanisms such as shared memory, textures and atomic operations, we managed to speed up the CUDA kernels by a factor of 57x. However, in our simulation we used the GPU as a co-processor and data from CPU was copied back and forth in each phase. It decreased the speedup of rewritten stages by 40%. We showed that the performance of the entire simulation can be improved by a factor of 10 up to 20.
Źródło:: Computer Science; 2012, 13 (1); 35-48
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Assessment of various GPU acceleration strategies in text categorization processing flow
Autorzy:: Korduła, Ł.
Wielgosz, M.
Karwatowski, M.
Pietroń, M.
Żurek, D.
Wiatr, K.
Powiązania:: https://bibliotekanauki.pl/articles/114132.pdf
Data publikacji:: 2017
Wydawca:: Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Tematy:: GPU
NLP
text categorization
OpenCL
Opis:: Automatic text categorization presents many difficulties. Modern algorithms are getting better in extracting meaningful information from human language. However, they often significantly increase complexity of computations. This increased demand for computational capabilities can be facilitated by the usage of hardware accelerators like general purpose graphic cards. In this paper we present a full processing flow for document categorization system. Gram-Schmidt process signatures calculation up to 12 fold decrease in computing time of system components.
Źródło:: Measurement Automation Monitoring; 2017, 63, 6; 203-205
2450-2855
Pojawia się w:: Measurement Automation Monitoring
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Real Time Processing of Networked Passive Coherent Location Radar System
Autorzy:: John, M.
Inggs, M.
Petri, D.
Powiązania:: https://bibliotekanauki.pl/articles/226268.pdf
Data publikacji:: 2011
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: passive
PCL
real-time
GPU
Opis:: A Passive Coherent Location (PCL) Radar system, consisting of spatially distributed transmitters and receivers is currently being integrated at the University of Cape Town (UCT). The paper investigates the feasibility of real-time processing of PCL system signals using Graphic Processing Units (GPUs), specifically a study of two distinct clutter cancellation algorithms: ECA (Extensive Cancellation Algorithm) and NLMS (Normalised Least Mean Square). Clutter cancellation is the most computationally demanding part of PCL signal processing. This investigation compares the processing speed-up achieved by GPU over CPU implementations, with very encouraging results.
Źródło:: International Journal of Electronics and Telecommunications; 2011, 57, 3; 363-368
2300-1933
Pojawia się w:: International Journal of Electronics and Telecommunications
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Accelerating SELECT WHERE and SELECT JOIN queries on a GPU
Autorzy:: Pietroń, M.
Russek, P.
Wiatr, K.
Powiązania:: https://bibliotekanauki.pl/articles/305797.pdf
Data publikacji:: 2013
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: SQL
CUDA
relational databases
GPU
Opis:: This paper presents implementations of a few selected SQL operations using the CUDA programming framework on the GPU platform. Nowadays, the GPU’s parallel architectures give a high speed-up on certain problems. Therefore, the number of non-graphical problems that can be run and sped-up on the GPU still increases. Especially, there has been a lot of research in data mining on GPUs. In many cases it proves the advantage of offloading processing from the CPU to the GPU. At the beginning of our project we chose the set of SELECT WHERE and SELECT JOIN instructions as the most common operations used in databases. We parallelized these SQL operations using three main mechanisms in CUDA: thread group hierarchy, shared memories, and barrier synchronization. Our results show that the implemented highly parallel SELECT WHERE and SELECT JOIN operations on the GPU platform can be significantly faster than the sequential one in a database system run on the CPU.
Źródło:: Computer Science; 2013, 14 (2); 243-252
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: Modeling and rendering of convective cumulus clouds for real-time graphics purposes
Autorzy:: Kobak, P.
Alda, W.
Powiązania:: https://bibliotekanauki.pl/articles/305238.pdf
Data publikacji:: 2017
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: computer graphics
natural phenomena
GPU programming
Opis:: The paper presents a simulation and rendering model of three dimensional covective cloud evolution. The model is physically based, however its purpose is graphical. The main stress is put on balancing two parts of a model: the atmsphere simulation with convective motion of air and water vapor combined with rendering of semi-transparent and light-scattering clouds, in order to achieve realistic animation in real-time. We examine and compare two algorithmic approaches based on CPU and GPU computations.
Źródło:: Computer Science; 2017, 18 (3); 241-268
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: Exploiting multi-core and many-core parallelism for subspace clustering
Autorzy:: Datta, Amitava
Kaur, Amardeep
Lauer, Tobias
Chabbouh, Sami
Powiązania:: https://bibliotekanauki.pl/articles/331126.pdf
Data publikacji:: 2019
Wydawca:: Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:: data mining
subspace clustering
multicore processor
many core processor
GPU computing
eksploracja danych
procesor wielordzeniowy
obliczenia GPU
Opis:: Finding clusters in high dimensional data is a challenging research problem. Subspace clustering algorithms aim to find clusters in all possible subspaces of the dataset, where a subspace is a subset of dimensions of the data. But the exponential increase in the number of subspaces with the dimensionality of data renders most of the algorithms inefficient as well as ineffective. Moreover, these algorithms have ingrained data dependency in the clustering process, which means that parallelization becomes difficult and inefficient. SUBSCALE is a recent subspace clustering algorithm which is scalable with the dimensions and contains independent processing steps which can be exploited through parallelism. In this paper, we aim to leverage the computational power of widely available multi-core processors to improve the runtime performance of the SUBSCALE algorithm. The experimental evaluation shows linear speedup. Moreover, we develop an approach using graphics processing units (GPUs) for fine-grained data parallelism to accelerate the computation further. First tests of the GPU implementation show very promising results.
Źródło:: International Journal of Applied Mathematics and Computer Science; 2019, 29, 1; 81-91
1641-876X
2083-8492
Pojawia się w:: International Journal of Applied Mathematics and Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: The Java profiler based on byte code analysis and instrumentation for many-core hardware accelerators
Autorzy:: Pietroń, M.
Karwatowski, M.
Wiatr, K.
Powiązania:: https://bibliotekanauki.pl/articles/114614.pdf
Data publikacji:: 2015
Wydawca:: Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Tematy:: virtual machine
CUDA
GPU
profiling
parallel computing
Opis:: One of the most challenging issues in the case of many and multi-core architectures is how to exploit their potential computing power in legacy systems without a deep knowledge of their architecture. The analysis of static dependence and dynamic data dependences of a program run, can help to identify independent paths that could have been computed by individual parallel threads. The statistics of reusing the data and its size is also crucial in adapting the application in GPU many-core hardware architecture because of specific memory hierarchies. The proposed profiling system accomplishes static data analysis and computes dynamic dependencies for Java programs as well as recommends parts of source code with the highest potential for parallelization in GPU. Such an analysis can also provide starting point for automatic parallelization.
Źródło:: Measurement Automation Monitoring; 2015, 61, 7; 385-387
2450-2855
Pojawia się w:: Measurement Automation Monitoring
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: Analiza efektywności sprzętowych implementacji algorytmów detekcji twarzy w obrazach cyfrowych
Effectiveness analysis of hardware implementations of face detection algorithms in digital images
Autorzy:: Wujek, P.
Pełka, R.
Powiązania:: https://bibliotekanauki.pl/articles/156623.pdf
Data publikacji:: 2012
Wydawca:: Stowarzyszenie Inżynierów i Techników Mechaników Polskich
Tematy:: detekcja twarzy
FPGA
GPU
GPGPU
face detection
Opis:: W artykule przedstawiono i porównano wyniki implementacji przykładowego algorytmu detekcji twarzy w obrazach cyfrowych na trzech platformach sprzętowych: z użyciem CPU (Matlab), w strukturze programowalnej FPGA z procesorem sprzętowym PowerPC [1], oraz z wykorzystaniem CPU z akceleracją GPU. Powyższe implementacje przebadano eksperymentalnie pod względem złożoności implementacji i szybkości działania poszczególnych fragmentów algorytmu. Porównano je ze sobą oraz przedstawiono najlepsze obszary zastosowań poszczególnych z nich.
This paper describes comparison of hardware implementations of a face detection algorithm using three different platforms: (1) classic CPU implementation (Matlab), (2) implementation with use of programmable logic - FPGA with hardware processor PowerPC [1], and (3) CPU based version with GPU acceleration. These tree versions have been experimentally tested and compared in terms of the required hardware resources and operating speed, which is of great importance in most practical applications. We also discuss advantages and drawbacks of these three approaches to hardware implementation of face detection algorithms. In particular, we formulate some important conditions that the analyzed image must meet to obtain the optimum effectiveness of the face detection algorithm implemented on each platform. Finally, we show that use of GPU acceleration can take advantage of the classic CPU and parallel computing accessible to FPGA. The proposed solution of skin color detection time for the CPU with GPU acceleration is over 100 times shorter than that for the solution with the classical CPU. As a programmable device we have used FPGA Virtex-4 chip from Xilinx, and as a GPU accelerator we have utilized graphic card nVidia GeForce 8600 GT.
Źródło:: Pomiary Automatyka Kontrola; 2012, R. 58, nr 7, 7; 581-583
0032-4140
Pojawia się w:: Pomiary Automatyka Kontrola
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 10.

Tytuł:: Using shared memory as a cache in cellular automata water flow simulations on GPUs
Autorzy:: Topa, P.
Młocek, P.
Powiązania:: https://bibliotekanauki.pl/articles/305529.pdf
Data publikacji:: 2013
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: cellular automata
GPU computation
modelling physical phenomena
Opis:: Graphics processors (GPU – Graphic Processor Units) recently have gained a lot of interest as an efficient platform for general-purpose computation. Cellular Automata approach which is inherently parallel gives the opportunity to implement high performance simulations. This paper presents how shared memory in GPU can be used to improve performance for Cellular Automata models. In our previous works, we proposed algorithms for Cellular Automata model that use only a GPU global memory. Using a profiling tool, we found bottlenecks in our approach. With this paper, we will introduce modifications that takes an advantage of fast shared memory. The modified algorithm is presented in details, and the results of profiling and performance test are demonstrated. Our unique achievement is comparing the efficiency of the same algorithm working with a global and shared memory.
Źródło:: Computer Science; 2013, 14 (3); 385-401
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 11.

Tytuł:: G-DNA – a highly efficient multi-GPU/MPI tool for aligning nucleotide reads
Autorzy:: Frohmberg, W.
Kierzynka, M.
Blazewicz, J.
Gawron, P.
Wojciechowski, P.
Powiązania:: https://bibliotekanauki.pl/articles/200827.pdf
Data publikacji:: 2013
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: DNA assembly preprocessing
sequence alignment
GPU computing
Opis:: DNA/RNA sequencing has recently become a primary way researchers generate biological data for further analysis. Assembling algorithms are an integral part of this process. However, some of them require pairwise alignment to be applied to a great deal of reads. Although several efficient alignment tools have been released over the past few years, including those taking advantage of GPUs (Graphics Processing Units), none of them directly targets high-throughput sequencing data. As a result, a need arose to create software that could handle such data as effectively as possible. G-DNA (GPU-based DNA aligner) is the first highly parallel solution that has been optimized to process nucleotide reads (DNA/RNA) from modern sequencing machines. Results show that the software reaches up to 89 GCUPS (Giga Cell Updates Per Second) on a single GPU and as a result it is the fastest tool in its class. Moreover, it scales up well on multiple GPUs systems, including MPI-based computational clusters, where its performance is counted in TCUPS (Tera CUPS).
Źródło:: Bulletin of the Polish Academy of Sciences. Technical Sciences; 2013, 61, 4; 989-992
0239-7528
Pojawia się w:: Bulletin of the Polish Academy of Sciences. Technical Sciences
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 12.

Tytuł:: Smoothed particle hydrodynamics simulations using graphics processing units
Autorzy:: Szewc, K.
Powiązania:: https://bibliotekanauki.pl/articles/1954445.pdf
Data publikacji:: 2014
Wydawca:: Politechnika Gdańska
Tematy:: fluid dynamics
CFD
particle methods
SPH
GPU calculations
Opis:: Smoothed Particle Hydrodynamics (SPH) is a fully Lagrangian, particle-based technique for fluid-flow modeling. As a gridless method, it appears to be a natural approach to simulate multi-phase flow with complex geometries. Since SPH involves a large set of short-range particle-particle interactions, numerical implementations present a high degree of spatial data locality and a significant number of independent computations. Therefore, the numerical code can be easily written in a massively parallel manner. The main purpose of this study is to discuss the issues related to the implementation of the SPH method for computation using Graphics Processing Units (GPU). The study is supported by two-dimensional validation cases: the lid-driven cavity and oscillation of a droplet. The obtained results show a good accuracy of the method, as well as, high numerical efficiency of its GPU implementation.
Źródło:: TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk; 2014, 18, 1; 67-80
1428-6394
Pojawia się w:: TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 13.

Tytuł:: GPU implementation of atomic fluid MD simulation.
Autorzy:: Dawid, Aleksander
Powiązania:: https://bibliotekanauki.pl/articles/2197547.pdf
Data publikacji:: 2022
Wydawca:: Politechnika Gdańska
Tematy:: MD simulabon
GPU
atomic fluid
MD parallel algorithm
Opis:: A computer simulation of an atomic fluid on a GPU was implemented using the CUDA architecture. It was shown that the programming model for efficient numerical computing applications was changing with the development of the CUDA architecture. The introduction of the L2 cache decreased the latency between the global GPU memory and the registers. The performed MD simulation using the global memory and registers showed that the average acceleration relative to the CPU reached 80 times for single-precision calculations. Usually, the shared block memory gives much better results for this kind of calculation. We have found that using the shared memory gives acceleration over 116 times in comparison to the CPU. It is about 49% faster than using the global memory and registers. It is shown here that the performance of generally available graphics cards for double-precision calculations is significantly lower than for single-precision calculations. The recorded double-precision acceleration relative to the CPU in our experiment averaged 6 and 7 times for the global and shared memory, respectively. We performed these calculations on two different CUDA enable device systems.
Źródło:: TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk; 2022, 26, 1; 25-37
1428-6394
Pojawia się w:: TASK Quarterly. Scientific Bulletin of Academic Computer Centre in Gdansk
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 14.

Tytuł:: G-PAS 2.0 - an improved version of protein alignment tool with an efficient backtracking routine on multiple GPUs
Autorzy:: Frohmberg, W.
Kierzynka, M.
Blazewicz, J.
Wojciechowski, P.
Powiązania:: https://bibliotekanauki.pl/articles/201593.pdf
Data publikacji:: 2012
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: pairwise alignment
GPU computing
alignment with backtracking procedure
Opis:: Several highly efficient alignment tools have been released over the past few years, including those taking advantage of GPUs (Graphics Processing Units). G-PAS (GPU-based Pairwise Alignment Software) was one of them, however, with a couple of interesting features that made it unique. Nevertheless, in order to adapt it to a new computational architecture some changes had to be introduced. In this paper we present G-PAS 2.0 - a new version of the software for performing high-throughput alignment. Results show, that the new version is faster nearly by a fourth on the same hardware, reaching over 20 GCUPS (Giga Cell Updates Per Second).
Źródło:: Bulletin of the Polish Academy of Sciences. Technical Sciences; 2012, 60, 3; 491-494
0239-7528
Pojawia się w:: Bulletin of the Polish Academy of Sciences. Technical Sciences
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 15.

Tytuł:: Digital image correlation - universal tools versus custom solutions
Cyfrowa Korelacja Obrazu - Uniwersalne narzędzia a rozwiązania użytkownika
Autorzy:: Marciniak, T.
Bujnowski, S.
Lutowski, Z.
Boroński, D.
Giesko, T.
Powiązania:: https://bibliotekanauki.pl/articles/257990.pdf
Data publikacji:: 2010
Wydawca:: Sieć Badawcza Łukasiewicz - Instytut Technologii Eksploatacji - Państwowy Instytut Badawczy
Tematy:: metody korelacji
DIC
GPU
CUDA
wielowątkowość
correlation methods
multithreading
Opis:: The development of optoelectronics and increasing the processing speed of processors encourage the use of different measurement methods with the use of data obtained from the digital cameras. The article presents the test results of the time consumption of the normalised 2D-correlation function. The correlation is one of most commonly used functions in image processing. It searches for a correlation between the two images and requires quite a lot of calculation operations. The speed of this function was examined in a variety of tools, such as MatLab, MatLab with tools Parallel Computing, library OpenCV and OpenCV wer.2.2 with the CUDA platform extension. During tests, the speeds of image processing were determined, especially for images acquired with high-resolution cameras. Conclusions presented concern with application possibilities of correlation methods, including multithreaded processing.
Rozwój optoelektroniki i rosnąca szybkość przetwarzania procesorów zachęca do stosowania różnorodnych metod pomiarowych wykorzystujących dane pozyskane z kamer cyfrowych. W artykule zaprezentowano wyniki badań szybkości działania funkcji korelacji wykorzystywanych przy obróbce obrazów. Jedną z najczęściej wykorzystywanych funkcji w przetwarzaniu obrazów jest funkcja korelacji. Polega ona na poszukiwaniu współzależności pomiędzy dwoma obrazami i wymaga stosunkowo dużego nakładu obliczeniowego. Szybkość działania funkcji sprawdzono podczas wykorzystania różnych narzędziach takich, jak MatLab, MatLab z biblioteką Parallel Computing, biblioteka OpenCV oraz OpenCV wer.2.2 z możliwością wykorzystania technologii CUDA. W trakcie testów wyznaczono uzyskiwane prędkości przetwarzania obrazów, zwłaszcza w przypadku obrazów rejestrowanych za pomocą kamer wysokiej rozdzielczości. Przedstawiono wnioski dotyczące możliwości zastosowania funkcji korelacji, w tym z wykorzystaniem mechanizmu wielowątkowości.
Źródło:: Problemy Eksploatacji; 2010, 4; 19-28
1232-9312
Pojawia się w:: Problemy Eksploatacji
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 16.

Tytuł:: Heterogeneous GPU&CPU cluster for High Performance Computing in cryptography
Autorzy:: Marks, M.
Jantura, J.
Niewiadomska-Szynkiewicz, E.
Strzelczyk, P.
Góźdź, K.
Powiązania:: https://bibliotekanauki.pl/articles/305288.pdf
Data publikacji:: 2012
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: parallel computing
HPC
clusters
GPU computing
OpenCL
cryptography
cryptanalysis
Opis:: This paper addresses issues associated with distributed computing systems and the application of mixed GPU&CPU technology to data encryption and decryption algorithms. We describe a heterogenous cluster HGCC formed by two types of nodes: Intel processor with NVIDIA graphics processing unit and AMD processor with AMD graphics processing unit (formerly ATI), and a novel software framework that hides the heterogeneity of our cluster and provides tools for solving complex scientific and engineering problems. Finally, we present the results of numerical experiments. The considered case study is concerned with parallel implementations of selected cryptanalysis algorithms. The main goal of the paper is to show the wide applicability of the GPU&CPU technology to large scale computation and data processing.
Źródło:: Computer Science; 2012, 13 (2); 63-79
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 17.

Tytuł:: The comparison of parallel sorting algorithms implemented on different hardware platforms
Autorzy:: Żurek, D.
Pietroń, M.
Wielgosz, M.
Wiatr, K.
Powiązania:: https://bibliotekanauki.pl/articles/305317.pdf
Data publikacji:: 2013
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: parallel algorithms
GPU
OpenMP
CUDA
sorting networks
merge-sort
Opis:: Sorting is a common problem in computer science. There are a lot of well-known sorting algorithms created for sequential execution on a single processor. Recently, many-core and multi-core platforms have enabled the creation of wide parallel algorithms. We have standard processors that consist of multiple cores and hardware accelerators, like the GPU. Graphic cards, with their parallel architecture, provide new opportunities to speed up many algorithms. In this paper, we describe the results from the implementation of a few different parallel sorting algorithms on GPU cards and multi-core processors. Then, a hybrid algorithm will be presented, consisting of parts executed on both platforms (a standard CPU and GPU). In recent literature about the implementation of sorting algorithms in the GPU, a fair comparison between many core and multi-core platforms is lacking. In most cases, these describe the resulting time of sorting algorithm executions on the GPU platform and a single CPU core.
Źródło:: Computer Science; 2013, 14 (4); 679-691
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 18.

Tytuł:: A Novel FE/MC-based Mathematical Model of Mushy Steel Deformation with GPU Support
Autorzy:: Hojny, Marcin
Dębiński, Tomasz
Powiązania:: https://bibliotekanauki.pl/articles/2106557.pdf
Data publikacji:: 2022
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: FEM
Monte Carlo
extra-high temperatures
soft-reduction
GPU
Opis:: The paper presents the results of work leading to the construction of a spatial hybrid model based on finite element (FE) and Monte Carlo (MC) methods allowing the computer simulation of physical phenomena accompanying the steel sample testing at temperatures that are characteristic for soft-reduction process. The proposed solution includes local density variations at the level of mechanical solution (the incompressibility condition was replaced with the condition of mass conservation), and at the same time simulates the grain growth in a comprehensive resistance heating process combined with a local remelting followed by free/controlled cooling of the sample tested. Simulation of grain growth in the entire computing domain would not be possible without the support of GPU processors. There was a 59-fold increase in the computing speed on the GPU compared to single-threaded computing on the CPU. The study was complemented by examples of experimental and computer simulation results, showing the correctness of the adopted model assumptions.
Źródło:: Archives of Metallurgy and Materials; 2022, 67, 2; 735--742
1733-3490
Pojawia się w:: Archives of Metallurgy and Materials
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 19.

Tytuł:: Sequential Classification of Palm Gestures Based on A* Algorithm and MLP Neural Network for Quadrocopter Control
Autorzy:: Wodziński, M.
Krzyżanowska, A.
Powiązania:: https://bibliotekanauki.pl/articles/221525.pdf
Data publikacji:: 2017
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: machine learning
shortest path
sequential data
quadrocopter
GPU
CUDA
Opis:: This paper presents an alternative approach to the sequential data classification, based on traditional machine learning algorithms (neural networks, principal component analysis, multivariate Gaussian anomaly detector) and finding the shortest path in a directed acyclic graph, using A* algorithm with a regression-based heuristic. Palm gestures were used as an example of the sequential data and a quadrocopter was the controlled object. The study includes creation of a conceptual model and practical construction of a system using the GPU to ensure the realtime operation. The results present the classification accuracy of chosen gestures and comparison of the computation time between the CPU- and GPU-based solutions.
Źródło:: Metrology and Measurement Systems; 2017, 24, 2; 265-276
0860-8229
Pojawia się w:: Metrology and Measurement Systems
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 20.

Tytuł:: Akceleracja obliczeń kryptograficznych z wykorzystaniem procesorów GPU
Powiązania:: Studia Bezpieczeństwa Narodowego 2014, nr 6, s. 341-357
Współwytwórcy:: Bęza, Patryk. Autor
Gocławski, Jakub. Autor
Mral, Paweł. Autor
Sapiecha, Piotr. Autor
Buda, Michał. Autor
Data publikacji:: 2014
Tematy:: Kryptologia
Szyfry
Procesor graficzny (GPU)
Artykuł z czasopisma naukowego
Opis:: Bibliografia, netografia na stronach 355-357.
Dostawca treści:: Bibliografia CBW

Artykuł

Zmień widok

na półce

Skocz do pozycji: 21.

Tytuł:: Równoległa realizacja przykładowego algorytmu genetycznego z wykorzystaniem akceleratorów GPU
Autorzy:: Ratuszniak, P.
Stasiak, A.
Łańcucki, R.
Powiązania:: https://bibliotekanauki.pl/articles/118416.pdf
Data publikacji:: 2018
Wydawca:: Politechnika Koszalińska. Wydawnictwo Uczelniane
Tematy:: algorytm genetyczny
programowanie równoległe
akceleracja obliczeń
akceleratory GPU
CUDA
problem komiwojażera
genetic algorithm
parallel programming
computing acceleration
GPU
travelling salesman problem
Opis:: W artykule zaprezentowano praktyczną implementację aplikacji rozwiązującej przykładowy algorytm genetyczny z wykorzystaniem akceleratorów GPU. W tym przypadku zdecydowano się na rozwiązanie za pomocą algorytmu genetycznego typowego problemu optymalizacyjnego, jakim jest problem komiwojażera. Dodatkowo w celu wykorzystania mocy karty graficznej w tworzonej aplikacji wykorzystano technologię programowania na karcie graficznej – technologię Nvidia CUDA.
The paper presents a practical implementation of a local desktop application that solves exemplary genetic algorithm with the use of GPU accelerators. In this case decided with the use of genetic algorithm to solve typical optimization problem which is travelling salesman problem. Additionally used Nvidia CUDA programming technology in order to use power of GPU in created application.
Źródło:: Zeszyty Naukowe Wydziału Elektroniki i Informatyki Politechniki Koszalińskiej; 2018, 13; 63-78
1897-7421
Pojawia się w:: Zeszyty Naukowe Wydziału Elektroniki i Informatyki Politechniki Koszalińskiej
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 22.

Tytuł:: Oskarżenie o współpracę z wywiadem polskim jako narzędzie GPU/NKWD USRS w czystkach w Armii Sowieckiej w latach dwudziestych i trzydziestych XX wieku. Przypadek Henryka Wieczffińskiego i polski wątek sprawy kryptonim „Wiesna”
Accusation of Collaboration with Polish Intelligence as a Tool of the GPU/NKVD of the Ukrainian SSR in the Soviet Army Purges in the 1920s and 1930s. The Case of Henryk Wieczffiński and a Polish Theme in the "Vesna" Case
Autorzy:: Kokin, Serhii
Powiązania:: https://bibliotekanauki.pl/articles/20265847.pdf
Data publikacji:: 2022
Wydawca:: Instytut Pamięci Narodowej, Komisja Ścigania Zbrodni przeciwko Narodowi Polskiemu
Tematy:: Henryk Wieczffiński
Józef Piłsudski
Ukrainian Soviet Socialist Republic
OGPU
GPU of Ukraine
"Vesna" case
Ukraińska Socjalistyczna Republika Sowiecka
GPU Ukrainy
sprawa „Wiesna”
Opis:: Porażka Sowietów w wojnie polsko-bolszewickiej w 1920 r. oznaczała upadek ich nadziei na szybki i udany eksport rewolucji proletariackiej do Europy Środkowej i Zachodniej. Symbolem tej ciężkiej klęski geopolitycznej na wiele lat stał się dla bolszewików Józef Piłsudski, toteż walka z jego dziedzictwem w Ukraińskiej Socjalistycznej Republice Sowieckiej (USRS) należała do najważniejszych zadań komunistycznych służb bezpieczeństwa w latach dwudziestych i trzydziestych XX w. Pokazowe pod tym względem były liczne rozpracowania operacyjne i duże grupowe sprawy karne wszczęte w tym okresie przez organy OGPU, w taki czy inny sposób zawsze bowiem był w nich obecny „polski ślad”. Przykładem takiej działalności jest likwidacja przez GPU Ukrainy Wszechzwiązkowej Wojskowo-Oficerskiej Organizacji Kontrrewolucyjnej (sprawa „Wiesna”), w której ramach został aresztowany, a następnie rozstrzelany (12 lipca 1931 r. w Charkowie) por. Henryk Wieczffiński, lotnik obserwator, szef sztabu 20 Brygady Lotniczej Ukraińskiego Okręgu Wojskowego.
The defeat of the Soviets in the Polish-Bolshevik War in 1920 marked the collapse of their hopes for the rapid and successful export of the proletarian revolution to Central and Western Europe. Józef Piłsudski became a long-time symbol of this grave geopolitical defeat for the Bolsheviks, so the fight against his legacy in the Ukrainian Soviet Socialist Republic (USSR) was one of the most important tasks of the Communist security services in the 1920s and 1930s. Demonstrative in this respect were the numerous operational examinations and large collective criminal cases initiated during this period by the OGPU authorities, for in one way or another there was always a "Polish trace" present. An example of such activity is the liquidation of the All-Union Military-Officer Counterrevolutionary Organisation by the GPU of Ukraine (the "Vesna" case), as part of which Lt. Henryk Wieczffiński, an observer pilot, chief of staff of the 20th Aviation Brigade of the Ukrainian Military District, was arrested and subsequently executed (12 July 1931 in Kharkov).
Źródło:: Aparat Represji w Polsce Ludowej 1944–1989; 2022, 20; 375-392
1733-6996
Pojawia się w:: Aparat Represji w Polsce Ludowej 1944–1989
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 23.

Tytuł:: Stereoscopic video chroma key processing using NVIDIA CUDA
Autorzy:: Sagan, J.
Powiązania:: https://bibliotekanauki.pl/articles/106272.pdf
Data publikacji:: 2013
Wydawca:: Uniwersytet Marii Curie-Skłodowskiej. Wydawnictwo Uniwersytetu Marii Curie-Skłodowskiej
Tematy:: NVIDIA CUDA
chroma key processing
GPU
CPU
stereoscopic images
Opis:: In this paper, I use the NVIDIA CUDA technology to perform the chroma key algorithm on stereoscopic images. NVIDIA CUDA allows to process parallel algorithms on GPU. Input data are stereoscopic images with the monochromatic background and the destination background image. Output data is the combination of inputs by using the chroma key. I compare the algorithm efficiency between the GPU and CPU execution.
Źródło:: Annales Universitatis Mariae Curie-Skłodowska. Sectio AI, Informatica; 2013, 13, 1; 81-87
1732-1360
2083-3628
Pojawia się w:: Annales Universitatis Mariae Curie-Skłodowska. Sectio AI, Informatica
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 24.

Tytuł:: Real-time motion tracking using optical flow on multiple GPUs
Autorzy:: Mahmoudi, S. A.
Kierzynka, M.
Manneback, P.
Kurowski, K.
Powiązania:: https://bibliotekanauki.pl/articles/200476.pdf
Data publikacji:: 2014
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: Lucas-Kanade method
sparse optical flow
multiple GPU computations
Opis:: Motion tracking algorithms are widely used in computer vision related research. However, the new video standards, especially those in high resolutions, cause that current implementations, even running on modern hardware, no longer meet the needs of real-time processing. To overcome this challenge several GPU (Graphics Processing Unit) computing approaches have recently been proposed. Although they present a great potential of a GPU platform, hardly any is able to process high definition video sequences efficiently. Thus, a need arose to develop a tool being able to address the outlined problem. In this paper we present software that implements optical flow motion tracking using the Lucas-Kanade algorithm. It is also integrated with the Harris corner detector and therefore the algorithm may perform sparse tracking, i.e. tracking of the meaningful pixels only. This allows to substantially lower the computational burden of the method. Moreover, both parts of the algorithm, i.e. corner selection and tracking, are implemented on GPU and, as a result, the software is immensely fast, allowing for real-time motion tracking on videos in Full HD or even 4K format. In order to deliver the highest performance, it also supports multiple GPU systems, where it scales up very well.
Źródło:: Bulletin of the Polish Academy of Sciences. Technical Sciences; 2014, 62, 1; 139-150
0239-7528
Pojawia się w:: Bulletin of the Polish Academy of Sciences. Technical Sciences
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 25.

Tytuł:: A Novel GPU-Enabled Simulator for Large Scale Spiking Neural Networks
Autorzy:: Szynkiewicz, P.
Powiązania:: https://bibliotekanauki.pl/articles/307680.pdf
Data publikacji:: 2016
Wydawca:: Instytut Łączności - Państwowy Instytut Badawczy
Tematy:: GPU computing
OpenCL programming technology
parallel simulation
spiking neural networks
Opis:: The understanding of the structural and dynamic complexity of neural networks is greatly facilitated by computer simulations. An ongoing challenge for simulating realistic models is, however, computational speed. In this paper a framework for modeling and parallel simulation of biological-inspired large scale spiking neural networks on high-performance graphics processors is described. This tool is implemented in the OpenCL programming technology. It enables simulation study with three models: Integrate-andfire, Hodgkin-Huxley and Izhikevich neuron model. The results of extensive simulations are provided to illustrate the operation and performance of the presented software framework. The particular attention is focused on the computational speed-up factor.
Źródło:: Journal of Telecommunications and Information Technology; 2016, 2; 34-42
1509-4553
1899-8852
Pojawia się w:: Journal of Telecommunications and Information Technology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "GPU" wg kryterium: Temat

Źródło danych

Dostawca treści

Podbaza

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język