PDF] YAWN: A Semantically Annotated Wikipedia XML Corpus
Por um escritor misterioso
Descrição
YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags, is presented. The paper presents YAWN, a system to convert the well-known and widely used Wikipedia collection into an XML corpus with semantically rich, self-explaining tags. We introduce algorithms to annotate pages and links with concepts from the WordNet thesaurus. This annotation process exploits categorical information in Wikipedia, which is a high-quality, manually assigned source of information, extracts additional information from lists, and utilizes the invocations of templates with named parameters. We give examples how such annotations can be exploited for high-precision queries.
Publishing Scholarly Editions
PDF) Proceedings of the NAACL HLT 2010 Workshop on Computational Linguistics and Writing: Writing Processes and Authoring Aids}
PDF) Proceedings of the NAACL HLT 2010 Workshop on Computational Linguistics and Writing: Writing Processes and Authoring Aids}
Characterizing the hypergraph-of-entity and the structural impact of its extensions, Applied Network Science
Characterizing the hypergraph-of-entity and the structural impact of its extensions, Applied Network Science
PDF) Proceedings of the NAACL HLT 2010 Workshop on Computational Linguistics and Writing: Writing Processes and Authoring Aids}
PDF) 8th Symposium on Languages, Applications and Technologies, SLATE 2019, June 27-28, 2019, Coimbra, Portugal
PDF) Hypergraph-of-entity
PDF) The World Within Wikipedia: An Ecology of Mind
Information_retrieval_and_extraction_IIIT
Characterizing the hypergraph-of-entity and the structural impact of its extensions, Applied Network Science
de
por adulto (o preço varia de acordo com o tamanho do grupo)