Wszystkie pola: similarity search - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Character frequency-based approach for searching for substrings of circular patterns and their conjugates in online text
Autorzy:: Prasad, Vinod
Powiązania:: https://bibliotekanauki.pl/articles/2097972.pdf
Data publikacji:: 2021
Wydawca:: Akademia Górniczo-Hutnicza im. Stanisława Staszica w Krakowie. Wydawnictwo AGH
Tematy:: pattern matching in strings
circular pattern matching
similarity search
substring search
Opis:: A fundamental problem in computational biology is dealing with circular patterns. The problem consists of finding a pattern and its rotations in a database. In this paper, we present two online algorithms. The first algorithm reports all of the substrings (factors) of a given pattern in an online text. Then, without losing efficiency, we extend the algorithm to process the circular rotations of the pattern. For a given pattern P of size M and a text T of size N, the extended algorithm reports all of the locations in the text where a substring of Pc is found where Pc is one of the rotations of P. For an alphabet size σ using O(M) space, the desired goals are achieved in an average O(MN/σ) time, which is O(N) for all patterns with length M ≤ σ. Traditional string-processing algorithms make use of advanced data structures such as suffix trees and automaton. The experimental results we have provided show that basic data structures such as arrays can be used in text-processing algorithms without compromising efficiency.
Źródło:: Computer Science; 2021, 22 (2); 235-250
1508-2806
2300-7036
Pojawia się w:: Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Virtual screening strategies in drug design – methods and applications
Autorzy:: Bielska, E.
Lucas, X.
Czerwoniec, A.
Kasprzak, J.M.
Kaminska, K.H.
Bujnicki, J.M.
Powiązania:: https://bibliotekanauki.pl/articles/80139.pdf
Data publikacji:: 2011
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: virtual screening
drug design method
application
drug discovery
similarity search
drug development
high-throughput screening
medical chemistry
chemical compound
experimental approach
Źródło:: BioTechnologia. Journal of Biotechnology Computational Biology and Bionanotechnology; 2011, 92, 3
0860-7796
Pojawia się w:: BioTechnologia. Journal of Biotechnology Computational Biology and Bionanotechnology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: Finding similar documents in web search results
Identyfikowanie dokumentów podobnych w wynikach wyszukiwania w sieci WWW
Autorzy:: Kużelewska, U.
Powiązania:: https://bibliotekanauki.pl/articles/341131.pdf
Data publikacji:: 2012
Wydawca:: Politechnika Białostocka. Oficyna Wydawnicza Politechniki Białostockiej
Tematy:: grupowanie wyników wyszukiwania
podobieństwo dokumentów
grupowanie snippetów
web search results clustering
documents similarity
snippets clustering
Opis:: Searching the Web is a challenging task. According to the Zamir and Etzioni’s definition, Internet is “unorganized, unstructured and decentralized place”. Although there are powerful search engines available, the number of indexed web pages exceeds 1 trillion [20] and still grows. Most of the search engines return list of documents from their bases sorted according to their relevance to a search query. Such approach is not the best, because the returned list is very long and may contain documents not related to the query. To increase efficiency of a searching process one may identify groups of similar documents from result list. One of the tools to do it are traditional clustering algorithms. The article presents clustering Web search results directly from a search engine as well as sets created from results for different queries. Documents were grouped using the following methods: EM and XMeans.
Przeszukiwanie sieci WWW jest niezmiernie trudnym zadaniem. Według Zamira i Etzioniego Internet to "miejsce bez struktury, niezorganizowane i zdecentralizowane". Chociaz istnieją potężne narzędzia w postaci wyszukiwarek internetowych, ich użycie staje się z czasem trudniejsze, gdyż ilość zaindeksowanych stron internetowych przekracza 1 bln [20] i nadal rośnie. Większość wyszukiwarek generuje wyniki posortowane według ich zgodności z treścią zapytania w postaci bardzo długich list. Takie podejście nie jest najlepszym rozwiązaniem z powodu rozmiaru list oraz zawierania w nich dokumentów nie związanych z zapytaniem. W celu zwiększenia efektywności przeszukiwania Internetu można ˙ zastosowac grupowanie podobnych dokumentów z generowanej przez wyszukiwarki listy wyników. Jednym z takich narzędzi są tradycyjne algorytmy grupujące. W artykule przedstawiono wyniki grupowania dokumentów bezpośrednio z listy zwróconej przez wyszukiwarkę oraz zbiorów dokumentów utworzonych z wyników wyszukiwania dla kilku zapytań. Wykorzystano następujące metody grupujące: EM i XMeans.
Źródło:: Zeszyty Naukowe Politechniki Białostockiej. Informatyka; 2012, 9; 61-76
1644-0331
Pojawia się w:: Zeszyty Naukowe Politechniki Białostockiej. Informatyka
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Automatic alignment of intramodal tomographic data using s-distance approach
Autorzy:: Król, Z.
Powiązania:: https://bibliotekanauki.pl/articles/334027.pdf
Data publikacji:: 2003
Wydawca:: Uniwersytet Śląski. Wydział Informatyki i Nauki o Materiałach. Instytut Informatyki. Zakład Systemów Komputerowych
Tematy:: rejestracja
podobieństwo pomiaru
topografia obszaru przeszukiwania
registration
similarity measure
s-distance
cross-metric
intramodality data
search space topography
Opis:: The alignment of volumetric datasets is an important problem in the processing of medical data. It is a prerequisite to numerous image based applications in diagnostic and therapeutic routines. In this paper, a new method is proposed for matching of 3D intramodality medical images. Our approach is based on some generalization of feature distance definition. Analogous to the standard surface matching, our algorithm uses also the chamfer distance like metric to define the quality of match function, however, the evaluation of the distance map is performed in a different way. The s-distance method is a step towards an automatic extraction of features, where each feature’s role in the registration process is weighted based on its relative statistical or spatial significance. As an alternative to the user-dependent non-automatic registration methods this approach offers a good assessment of similarity in the intramodality case. The elimination of less significant features in the registration process has resulted in a greatly improved efficiency over the voxel-based methods. Studying certain properties of the search space topography provides some insights into the performance of the proposed method as well as the standard registration algorithms in the rigid body registration problem.
Źródło:: Journal of Medical Informatics & Technologies; 2003, 6; IT3-11
1642-6037
Pojawia się w:: Journal of Medical Informatics & Technologies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "similarity search" wg kryterium: Wszystkie pola

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język