Temat: web mining - Katalog OPAC zbiorów

Skocz do pozycji: 1.

Tytuł:: Towards Finding Scholarly Articles in Internet Using Hadoop MapReduce with Oozie Workflow
Autorzy:: Jurkiewicz, J.
Nowiński, A.
Powiązania:: https://bibliotekanauki.pl/articles/115951.pdf
Data publikacji:: 2013
Wydawca:: Fundacja na Rzecz Młodych Naukowców
Tematy:: Hadoop
web mining
scientific content finding
web page classification
Opis:: An article focuses on the new methods for automatic processing and analysis of the scientific papers. It covers the very first part of this task – discovery and harvesting of scientific publications from the internet. Article is focused on discovery and analysis of the html documents to identify publication resources. Usage of data from Common Crawl project allows operating on large subset of the web pages without a need to perform an expensive crawl of the WWW. We present methods for automatic identification of pages describing scholarly documents in WWW network using html meta headers. Presented set of rules applied to the data achieves reasonable quality. A system based on these tools is also presented. It allows easy operating and transferring output to the COntent ANalysis SYStem(CoAnSys) - a processing and analysis system developed in ICM. For achieving this goal set of MapReduce tasks running with Hadoop And Ozzie has been used. The quality and efficiency of described rules are discussed. Finally future challenges for our system are presented.
Źródło:: Challenges of Modern Technology; 2013, 4, 4; 3-6
2082-2863
2353-4419
Pojawia się w:: Challenges of Modern Technology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 2.

Tytuł:: Mailing Lists Archives Analyzer
Autorzy:: Rzecki, K.
Riegel, M.
Powiązania:: https://bibliotekanauki.pl/articles/93058.pdf
Data publikacji:: 2006
Wydawca:: Uniwersytet Przyrodniczo-Humanistyczny w Siedlcach
Tematy:: e-mail header
data analyzing
web mining
Opis:: Article describes chance to explore data hidden in headers of e-mails taken from archive of mailing lists. Scientist part of the article presents a way of transforms information enclosed in Internet resources, explains idea of mailing lists archive and points out knowledge can be taken from. Technical part presents implemented and working system analyzing headers of e-mail messages stored in mailing lists archives. Some example results of this experiment are also given.
Źródło:: Studia Informatica : systems and information technology; 2006, 1(7); 117-125
1731-2264
Pojawia się w:: Studia Informatica : systems and information technology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 3.

Tytuł:: The impact of relevance feedback on web-based information retrieval for horizon scanning applications
Autorzy:: Palomino, Marco A.
Taylor, Tim
McBride, Geoff
Mortimer, Hugh
Owen, Richard
Depledge, Michael
Powiązania:: https://bibliotekanauki.pl/articles/432410.pdf
Data publikacji:: 2013
Wydawca:: Wydawnictwo Uniwersytetu Ekonomicznego we Wrocławiu
Tematy:: horizon scanning
web mining
strategic planning
search engines
Opis:: Horizon scanning is being increasingly regarded as an instrument to support strategic decision making. It requires the systematic examination of data to identify potential threats and opportunities to improve resilience and decrease risk exposure. Horizon scanning may benefit from various retrieval techniques to augment the acquisition of data, though this involves a search for novel and emerging issues without knowing them beforehand. To optimise such a search, we propose the use of relevance feedback, which involves human interaction in the retrieval process so as to improve the results. As a proof-of-concept demonstration, we have carried out a horizon scanning exercise which showed that our utilisation of relevance feedback for horizon scanning applications was able to maintain the retrieval of relevant documents constant over the entire length of the experiment, without any reduction. This represents an improvement over previous studies where relevance feedback was not considered.
Źródło:: Informatyka Ekonomiczna; 2013, 2(28); 77-99
1507-3858
Pojawia się w:: Informatyka Ekonomiczna
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 4.

Tytuł:: Mining indirect association rules for web recommendation
Autorzy:: Kazienko, P.
Powiązania:: https://bibliotekanauki.pl/articles/907860.pdf
Data publikacji:: 2009
Wydawca:: Uniwersytet Zielonogórski. Oficyna Wydawnicza
Tematy:: reguły asocjacji
system zalecany
badanie sieci
association rules
indirect association rules
recommender system
web mining
web usage mining
Opis:: Classical association rules, here called 'direct', reflect relationships existing between items that relatively often co-occur in common transactions. In the web domain, items correspond to pages and transactions to user sessions. The main idea of the new approach presented is to discover indirect associations existing between pages that rarely occur together but there are other, 'third' pages, called transitive, with which they appear relatively frequently. Two types of indirect associations rules are described in the paper: partial indirect associations and complete ones. The former respect single transitive pages, while the latter cover all existing transitive pages. The presented IDARM* Algorithm extracts complete indirect association rules with their important measure-confidence-using pre-calculated direct rules. Both direct and indirect rules are joined into one set of complex association rules, which may be used for the recommendation of web pages. Performed experiments revealed the usefulness of indirect rules for the extension of a typical recommendation list. They also deliver new knowledge not available to direct ones. The relation between ranking lists created on the basis of direct association rules as well as hyperlinks existing on web pages is also examined.
Źródło:: International Journal of Applied Mathematics and Computer Science; 2009, 19, 1; 165-186
1641-876X
2083-8492
Pojawia się w:: International Journal of Applied Mathematics and Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 5.

Tytuł:: Asymptotic trust algorithm: extension for reputation systems in online auctions
Autorzy:: Leszczyński, K.
Zakrzewicz, M.
Powiązania:: https://bibliotekanauki.pl/articles/206059.pdf
Data publikacji:: 2011
Wydawca:: Polska Akademia Nauk. Instytut Badań Systemowych PAN
Tematy:: online auction sites
reputation system
trust management
web mining
Opis:: Online auctions have become a big business and the number of auction site users is growing rapidly. These virtual marketplaces give traders a lot of opportunities to find a contracting party. However, lack of physical contact between users decreases the degree of trust. Auction portals require an efficient mechanism for building trust between participants, whereas most of them provide simple participation counts for reputation rating. Moreover, a single opinion has virtually no effect on a big online store that already has many reputation points, so buyers are very hesitant to give negative feedback for fear of retaliation. Consequently, almost no negative feedback is provided1. In this paper we introduce a new trust system called Asymptotic Trust Algorithm (ATA) which prevents many fraud attempts and still is both simple and easy to understand for most users. Our new method can be applied in addition to the participation counts systems currently used by Allegro, eBay and most of other online auction sites because it does not require any additional information other than positive, negative or neutral feedback on transactions. Most importantly, ATA encourages users to submit unbiased comments, regardless of the number of previous transactions.
Źródło:: Control and Cybernetics; 2011, 40, 3; 651-666
0324-8569
Pojawia się w:: Control and Cybernetics
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 6.

Tytuł:: A Comprehensive study: - Sarcasm detection in sentimental analysis
Autorzy:: Ratawal, Yamini
Tayal, Devendra
Powiązania:: https://bibliotekanauki.pl/articles/1159725.pdf
Data publikacji:: 2018
Wydawca:: Przedsiębiorstwo Wydawnictw Naukowych Darwin / Scientific Publishing House DARWIN
Tematy:: Sentimental analysis
Web mining
deep learning
machine learning
opinion mining
text mining
Opis:: Sarcasm detection is one of the active research area in sentimental analysis. However this paper talks about one of the recent issue in sentimental analysis that us sarcasm detection. In our work, we have described different techniques used in sarcasm detection that helps a novice researcher in efficient way. This paper represent different methodologies of carrying out research in this field.
Źródło:: World Scientific News; 2018, 113; 1-9
2392-2192
Pojawia się w:: World Scientific News
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 7.

Tytuł:: A k-Nearest Neighbors Method for Classifying User Sessions in E-Commerce Scenario
Autorzy:: Suchacka, G.
Skolimowska-Kulig, M.
Potempa, A.
Powiązania:: https://bibliotekanauki.pl/articles/308645.pdf
Data publikacji:: 2015
Wydawca:: Instytut Łączności - Państwowy Instytut Badawczy
Tematy:: data mining
e-commerce
k-Nearest Neighbors
k-NN
log file analysis
online store
R-project
supervised classification
web mining
Web store
Web traffic
Web usage mining
Opis:: This paper addresses the problem of classification of user sessions in an online store into two classes: buying sessions (during which a purchase confirmation occurs) and browsing sessions. As interactions connected with a purchase confirmation are typically completed at the end of user sessions, some information describing active sessions may be observed and used to assess the probability of making a purchase. The authors formulate the problem of predicting buying sessions in a Web store as a supervised classification problem where there are two target classes, connected with the fact of finalizing a purchase transaction in session or not, and a feature vector containing some variables describing user sessions. The presented approach uses the k-Nearest Neighbors (k-NN) classification. Based on historical data obtained from online bookstore log files a k-NN classifier was built and its efficiency was verified for different neighborhood sizes. A 11-NN classifier was the most effective both in terms of buying session predictions and overall predictions, achieving sensitivity of 87.5% and accuracy of 99.85%.
Źródło:: Journal of Telecommunications and Information Technology; 2015, 3; 64-69
1509-4553
1899-8852
Pojawia się w:: Journal of Telecommunications and Information Technology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 8.

Tytuł:: Data mining
Autorzy:: Morzy, Tadeusz
Powiązania:: https://bibliotekanauki.pl/articles/703139.pdf
Data publikacji:: 2007
Wydawca:: Polska Akademia Nauk. Czytelnia Czasopism PAN
Tematy:: data mining
data analysis
evolution of information technology
association analysis
classification
clustering
Web mining
Opis:: Recent advances in data capture, data transmission and data storage technologies have resulted in a growing gap between more powerful database systems and users' ability to understand and effectively analyze the information collected. Many companies and organizations gather gigabytes or terabytes of business transactions, scientific data, web logs, satellite pictures, textreports, which are simply too large and too complex to support a decision making process. Traditional database and data warehouse querying models are not sufficient to extract trends, similarities and correlations hidden in very large databases. The value of the existing databases and data warehouses can be significantly enhanced with help of data mining. Data mining is a new research area which aims at nontrivial extraction of implicit, previously unknown and potentially useful information from large databases and data warehouses. Data mining, also referred to as database mining or knowledge discovery in databases, can help answer business questions that were too time consuming to resolve with traditional data processing techniques. The process of mining the data can be perceived as a new way of querying – with questions such as ”which clients are likely to respond to our next promotional mailing, and why?”. The aim of this paper is to present an overall picture of the data mining field as well as presents briefly few data mining methods. Finally, we summarize the concepts presented in the paper and discuss some problems related with data mining technology.
Źródło:: Nauka; 2007, 3
1231-8515
Pojawia się w:: Nauka
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 9.

Tytuł:: Analiza kontekstu zachowań e-klientów w zależności od dynamiki zmian w nawigacji internetowej względem przeprowadzanych akcji marketingowych
E-customers behaviors context analysis based on the dynamics of changes in web navigation due to marketing action performed
Autorzy:: Dziczkowski, Grzegorz
Juszczuk, Przemysław
Powiązania:: https://bibliotekanauki.pl/articles/593212.pdf
Data publikacji:: 2015
Wydawca:: Uniwersytet Ekonomiczny w Katowicach
Tematy:: Analiza kontekstu
Analiza zachowań
Web usage mining
Context analysis
Customer behavior analysis
Opis:: Handel internetowy pozwala na automatyzację wielu procesów marketingowych oraz na pozyskanie cennych danych o zachowaniu klientów i ich nawigacji na stronach internetowych. Przy użyciu technik eksploracji danych można uzyskać pełną analizę zachowań klienta oraz przeprowadzić segmentację populacji. Sam proces segmentacji populacji nie pozwala jednak na określenie celu klienta, gdyż proces nawigacji jest zmienny w czasie i zależny od zewnętrznych czynników. Określenie celu i zrozumienie potrzeby klienta wymusza wprowadzenie analizy kontekstu zachowań e-klienta. Artykuł przedstawia analizę zachowań e-klientów, segmentację populacji oraz analizę kontekstu zachowań względem przeprowadzanych akcji marketingowych.
E-commerce allows to automate marketing processes and to gain valuable data about customer behavior and their navigation on the website. Using data mining techniques, we can get a complete analysis of customer behavior and to segment the po-pulation. However, population segmentation process does not identify the customer, because the navigation process is unpredictable over time and depends on external factors. This article presents an analysis of the behavior of e-customer, segmentation of population and analysis of the context of population behavior towards marketing actions.
Źródło:: Studia Ekonomiczne; 2015, 216; 26-36
2083-8611
Pojawia się w:: Studia Ekonomiczne
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 10.

Tytuł:: Web-based software system for processing bilingual digital resources
Autorzy:: Dutsova, Ralitsa
Powiązania:: https://bibliotekanauki.pl/articles/677188.pdf
Data publikacji:: 2014
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: aligned corpus
concordance
data mining
dictionary entry
digital dictionary
search tool
web-interface
web-application
Opis:: Web-based software system for processing bilingual digital resourcesThe article describes a software management system developed at the Institute of Mathematics and Informatics, BAS, for the creation, storing and processing of digital language resources in Bulgarian. Independent components of the system are intended for the creation and management of bilingual dictionaries, for information retrieval and data mining from a bilingual dictionary, and for the presentation of aligned corpora. A module which connects these components is also being developed. The system, implemented as a web-application, contains tools for compilation, editing and search within all components.
Źródło:: Cognitive Studies; 2014, 14
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 11.

Tytuł:: Pozyskiwanie i analiza danych na temat ofert pracy z wykorzystaniem big data
The collection and analysis of the data on job advertisements with the use of big data
Autorzy:: Maślankowski, Jacek
Powiązania:: https://bibliotekanauki.pl/articles/962829.pdf
Data publikacji:: 2019
Wydawca:: Główny Urząd Statystyczny
Tematy:: big data
text mining
web scraping
rynek pracy
labour market
Opis:: Celem artykułu jest zaprezentowanie korzyści wynikających z wykorzystania na potrzeby statystyki publicznej (rynku pracy) narzędzi do automatycznego pobierania danych na temat ofert pracy zamieszczanych na stronach internetowych zaliczanych do zbiorów big data, a także związanych z tym wyzwań. Przedstawiono wyniki eksperymentalnych badań z wykorzystaniem metod web scrapingu oraz text miningu. Analizie poddano dane z lat 2017 i 2018 pochodzące z najpopularniejszych portali z ofertami pracy. Odwołano się do danych Głównego Urzędu Statystycznego (GUS) zbieranych na podstawie sprawozdania Z-05. Przeprowadzona analiza prowadzi do wniosku, że web scraping może być stosowany w statystyce publicznej do pozyskiwania danych statystycznych z alternatywnych źródeł, uzupełniających istniejące bazy danych statystycznych, pod warunkiem zachowania spójności z istniejącymi badaniami.
The goal of this paper is to present, on the one hand, the benefits for official statistics (labour market) resulting from the use of web scraping methods to gather data on job advertisements from websites belonging to big data compilations, and on the other, the challenges connected to this process. The paper introduces the results of experimental research where web-scraping and text-mining methods were adopted. The analysis was based on the data from 2017–2018 obtained from the most popular jobsearching websites, which was then collated with Statistics Poland’s data obtained from Z-05 forms. The above-mentioned analysis demonstrated that web-scraping methods can be adopted by public statistics services to obtain statistical data from alternative sources complementing the already-existing databases, providing the findings of such research remain coherent with the results of the already-existing studies.
Źródło:: Wiadomości Statystyczne. The Polish Statistician; 2019, 64, 9; 60-74
0043-518X
Pojawia się w:: Wiadomości Statystyczne. The Polish Statistician
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 12.

Tytuł:: Web-based Digital Lexicographic Bilingual Resources
Autorzy:: Dutsova, Ralitsa
Powiązania:: https://bibliotekanauki.pl/articles/676996.pdf
Data publikacji:: 2015
Wydawca:: Polska Akademia Nauk. Instytut Slawistyki PAN
Tematy:: web-application
bilingual resources
lexicographic resources
digital resources
dictionary
aligned corpus
data mining
Opis:: Web-based Digital Lexicographic Bilingual ResourcesThe paper presents briefly a web-based system for creation and management of bilingual resources with Bulgarian as one of the paired language. This is useful and easy to use tool for collection and management of a large amount of different linguistic knowledge. The system uses two sets of natural language data: bilingual dictionary and aligned text corpora
Źródło:: Cognitive Studies; 2015, 15
2392-2397
Pojawia się w:: Cognitive Studies
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 13.

Tytuł:: Application of data mining techniques to find relationships between the dishes offered by a restaurant for the elaboration of combos based on the preferences of the diners
Autorzy:: Vazquez, Rosa Maria
Bonilla, Edmundo
Sanchez, Eduardo
Atriano, Oscar
Berruecos, Cinthya
Powiązania:: https://bibliotekanauki.pl/articles/118001.pdf
Data publikacji:: 2019
Wydawca:: Polskie Towarzystwo Promocji Wiedzy
Tematy:: data mining
association rules
apriori algorithm
combos
Web Service
eksploracja danych
reguły asocjacji
algorytm a priori
kombinacje
Opis:: Currently, blended food has been a common menu item in fast food restaurants. The sales of the fast-food industry grow thanks to several sales strategies, including the “combos”, so, specialty, regional, family and buffet restaurants are even joining combos’ promotions. This research paper presents the implementation of a system that will serve as support to elaborate combos according to the preferences of the diners using data mining techniques to find relationships between the different dishes that are offered in a restaurant. The software resulting from this research is being used by the mobile application Food Express, with which it communicates through webservices. References
Źródło:: Applied Computer Science; 2019, 15, 2; 73-88
1895-3735
Pojawia się w:: Applied Computer Science
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 14.

Tytuł:: Portal mapowy jako źródło informacji o terenie pogórniczym na przykładzie Zagłębia Wałbrzyskiego
Geoportal as the source of geographical information about post-mining area based on the example of the Wałbrzych Coal Basin
Autorzy:: Olejnik, K.
Powiązania:: https://bibliotekanauki.pl/articles/122249.pdf
Data publikacji:: 2015
Wydawca:: Politechnika Wrocławska. Wydział Geoinżynierii, Górnictwa i Geologii. Instytut Górnictwa
Tematy:: Zagłębie Wałbrzyskie
teren pogórniczy
GIS
portal mapowy
open source
Wałbrzych Coal Basin
post-mining area
Web Mapping
Opis:: Portal mapowy jest doskonałą formą wizualizacji wszystkich elementów z obszaru dawnego Zagłębia Wałbrzyskiego pozostałych po przemyśle wydobywczym. Dane wykorzystane do przygotowania opracowania pochodziły głównie z map topograficznych i archiwalnych opracowań górniczych. Realizacja omawianego portalu opiera się na tak zwanej architekturze trójwarstwowej. Proces budowy rozpoczęto od przygotowania danych wektorowych w postaci warstw tematycznych. Następnie, według zastosowanego schematu, stworzono strukturę składającą się z bazy danych oraz serwera sieciowego, który umożliwił wyświetlanie obiektów w aplikacji, przeznaczonej dla użytkownika. Całość wykorzystanego oprogramowania jest dystrybuowana na licencjach typu open source.
The post-mining area of Wałbrzych Coal Basin is still undergoing revitalization and contains traces of many mining activities. Web mapping process presented in this paper is a fine way of visualization of all surface objects of the mining industry. Data used in the project has been collected from various sources like topographic maps and mining documentations. Implementation of the geoportal was based on three-tier architecture. The whole process started from preparing spatial data which occurs in vector layer form. The key step was the construction of three connected layers which are: database, server application and presentation trier designed for clients. All of the software used is distributed on open source licenses.
Źródło:: Hereditas Minariorum; 2015, 2; 161-174
2391-9450
2450-4114
Pojawia się w:: Hereditas Minariorum
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Skocz do pozycji: 15.

Tytuł:: Algorithm CFP-SFPwith parallel processing
Autorzy:: Kujawiak, M.
Powiązania:: https://bibliotekanauki.pl/articles/92930.pdf
Data publikacji:: 2008
Wydawca:: Uniwersytet Przyrodniczo-Humanistyczny w Siedlcach
Tematy:: association rules
data mining
web logs
a priori
a priori TID
a priori hybrid algorithm
FP-Tree
Opis:: Existing algorithms for finding association rules do not implement parallel processing. This paper proposes CFP-SFP (Creating Frequent Patterns with Set from Frequent Patterns) algorithm with parallel processing. The research involves running CEP-SEP algorithm with one thread and a dozen or so threads that are executed simultaneously. The research was conducted on a computer with one processor and dual-core processor.
Źródło:: Studia Informatica : systems and information technology; 2008, 1(10); 87-93
1731-2264
Pojawia się w:: Studia Informatica : systems and information technology
Dostawca treści:: Biblioteka Nauki

Artykuł

Zmień widok

na półce

Informacja

Wyszukujesz frazę "web mining" wg kryterium: Temat

Źródło danych

Dostawca treści

Kolekcja

Rok wydania

Wydawca

Temat

Autor

Typ dokumentu

Język