Πλοήγηση ανά Επιβλέποντα "Kalamboukis, Theodore"

A B C D E F G H I J K L M N O P Q R S T U V W X Y Z

Α Β Γ Δ Ε Ζ Η Θ Ι Κ Λ Μ Ν Ξ Ο Π Ρ Σ Τ Υ Φ Χ Ψ Ω

Τώρα δείχνει 1 - 2 από 2

Aspect based sentiment analysis
Vasilakos-Filippakos, Kyriakos; Athens University of Economics and Business, Department of Informatics; Kalamboukis, Theodore
In this work, an investigation to the aspect based sentiment analysis is performed with the implementation of the classical algorithm of Hu & Liu that’s presented in their paper “Mining Opinion Features in Customer Reviews”. Along with the related Apriori algorithm that’s embedded in the process, each step of the algorithm is explained in detail highlighting on the functionality of each part. Every phase of the process that was earlier explained is discussed again, commenting on the encountered problems and proposing solutions as additions and modifications to the original algorithm. Furthermore a web user interface was developed that allows the customization of several parameters and helps in the visualization of the results and evaluation measures. The evaluation phase is based on the datasets provided by H&L and datasets from the SemEval workshop 2014. Apart the common metrics (recall, precision) used, a new metric was introduced, namely the Cumulative Distance Discount Gain (CDDG) a metric that, measures the similarity between two ranked lists of results. Finally the evaluation results and the preferred configuration parameters are discussed, commenting on various observations and proposing some thoughts for future work.
Using clustering to aid text classiffcation of single-labelled datasets
Kyriakopoulou, Antonia; Athens University of Economics and Business, Department of Informatics; Kalamboukis, Theodore
Supervised and unsupervised learning have been the focus of critical research in the areas of machine learning and artificial intelligence. In the literature, these two streams flow independently of each other, despite their close conceptual and practical connections. This dissertation demonstrates that unsupervised learning algorithms, i.e. clustering, can provide us with valuable information about the data and help in the creation of high-accuracy text classifiers. In the case of clustering,the aim is to extract a kind of \structure" from a given sample of objects. The reasoning behind this is that if some structure exists in the objects, it is possible to take advantage of this information and find a short description of the data,exploiting the dependence or association between index terms and documents.This concise representation of the whole dataset can be properly incorporated in the existing data representation. The use of prior knowledge about the nature oft he dataset helps in building a more efficient classifier for this set. This approach does not capture all the intricacies of text; however on some domains this technique substantially improves text classification accuracy.In this vein, a study of the interaction between supervised and unsupervised learning has been carried out. We have studied and implemented models that apply clustering in multiple ways and in conjunction with classification to construct robust text classifiers. The extensive experimentation has shown the effectiveness of using clustering to boost text classification performance. Additionally, preliminary experiments on some of the most important applications of text classification such as Spam Mail Filtering, Spam Detection in Social Bookmarking Systems,and Sentence Boundary Disambiguation, have shown promising enhancements by exploiting the proposed models.