Intrinsic evaluation of word embeddings for Turkish
dc.authorid | 0000-0002-4253-8920 | en_US |
dc.contributor.author | Agun, Hayri Volkan | |
dc.contributor.author | Yilmazel, O. | |
dc.date.accessioned | 2021-03-20T20:26:57Z | |
dc.date.available | 2021-03-20T20:26:57Z | |
dc.date.issued | 2020 | |
dc.department | BTÜ, Mühendislik ve Doğa Bilimleri Fakültesi, Bilgisayar Mühendisliği Bölümü | en_US |
dc.description | Newcastle University | en_US |
dc.description | 4th International Symposium on Computer Science and Intelligent Control, ISCSIC 2020 -- 17 November 2020 through 19 November 2020 -- -- 167082 | en_US |
dc.description.abstract | Word embeddings are evaluated through intrinsic and extrinsic tests. Similarity and analogy test are mainly preferred for intrinsic evaluation and natural language processing tasks such as named entity recognition and question answering are prefferred for extrinsic evaluation. Although there are various intrinsic evaluation datasets for English, the datasets for Turkish are very limited and measuring the degree of similarity and relatedness between words without specifying the type of semantic relation. In this paper, we propose an intrinsic evaluation dataset for evaluating different semantic relations other than a synonym, antonym, hypernym, and meronym as well as morphological relations of individual Turkish words. Moreover, we benchmark three publicly available word-embedding models on the proposed dataset and discuss agglutinative characteristics of the Turkish language for language modeling. © 2020 ACM. | en_US |
dc.identifier.doi | 10.1145/3440084.3441184 | en_US |
dc.identifier.isbn | 9781450388894 | |
dc.identifier.scopus | 2-s2.0-85101695574 | en_US |
dc.identifier.scopusquality | N/A | en_US |
dc.identifier.uri | http://doi.org/10.1145/3440084.3441184 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12885/1360 | |
dc.indekslendigikaynak | Scopus | en_US |
dc.institutionauthor | Agun, Hayri Volkan | |
dc.language.iso | en | en_US |
dc.publisher | Association for Computing Machinery | en_US |
dc.relation.ispartof | ACM International Conference Proceeding Series | en_US |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Deep Learning | en_US |
dc.subject | GAN | en_US |
dc.subject | Infrared Images | en_US |
dc.subject | Object Detection | en_US |
dc.title | Intrinsic evaluation of word embeddings for Turkish | en_US |
dc.type | Conference Object | en_US |