Hyphe, a Curation-Oriented Approach to Web Crawling for the Social Sciences

Contenu

Titre
Hyphe, a Curation-Oriented Approach to Web Crawling for the Social Sciences
Date de soumission
6 juillet 2016 à 04:14:04 +00:00
Date
mai 2016
Est référencé par
RAVZ8ATV
Résumé
The web is a field of investigation for social sciences, and platform-based studies have long proven their relevance. However the generic web is rarely studied in itself though it contains crucial aspects of the embodiment of social actors: personal blogs, institutional websites, hobby-specific media… We realized that some sociologists see existing web crawlers as “black boxes” unsuitable for research though they are willing to study the broad web. In this paper we present Hyphe, a crawler developed with and for social scientists, with an innovative “curation-oriented” approach. We expose the problems of using web-mining techniques in social science research and how to overcome those by specific features such as step-by-step corpus building and a memory structure allowing researchers to redefine dynamically the granularity of their “web entities”.
Est une partie de
International AAAI Conference on Web and Social Media
Editeur
Köln, Germany
Association for the Advancement of Artificial Intelligence
Source
HAL Archives Ouvertes
is compiled by
Lucky Semiosis
is in semantic relation with
crawler
web mining
Complexité
331
Date de modification
8 septembre 2023 à 06:52:58 +00:00
Détails de la complexité
Physique,1,,,,,18,18
Physique,2,,,,,30,60
Actant,2,,,,,5,10
Concept,1,,,,,17,17
Concept,2,,,,,32,64
Rapport,1,1,Physique,Concept,properties,17,17
Rapport,1,1,Physique,Physique,values,17,17
Rapport,1,1,Physique,Actant,dcterms:creator,4,4
Rapport,2,2,Actant,Concept,properties,24,48
Rapport,2,2,Actant,Physique,values,24,48
Rapport,1,1,Physique,Actant,cito:isCompiledBy,1,1
Rapport,1,1,Physique,Concept,skos:semanticRelation,2,2
Rapport,2,2,Concept,Concept,properties,6,12
Rapport,2,2,Concept,Physique,values,6,12
Rapport,1,1,Physique,Physique,uri,1,1
Totaux de la complexité
Physique,2,1,2,48,78
Actant,1,2,2,5,10
Concept,2,1,2,49,81
Rapport,10,1,2,102,162
Existence,15,1,2,204,331
343
Date de modification
8 janvier 2024 à 10:51:47 +00:00
Détails de la complexité
dimension,niv (sujet),niv objet,sujet,objet,prédicat,nb,c
Physique,1,,,,,18,18
Physique,2,,,,,35,70
Actant,1,,,,,1,1
Actant,2,,,,,5,10
Concept,1,,,,,12,12
Concept,2,,,,,33,66
Rapport,1,1,Actant,Concept,properties,12,12
Rapport,1,1,Actant,Physique,values,17,17
Rapport,1,1,Actant,Physique,owner,1,1
Rapport,2,1,Actant,Physique,oa:hasSource,2,4
Rapport,2,2,Actant,Concept,properties,31,62
Rapport,2,2,Actant,Physique,values,31,62
Rapport,1,1,Actant,Actant,dcterms:creator,4,4
Rapport,1,1,Actant,Actant,cito:isCompiledBy,1,1
Rapport,1,1,Actant,Concept,skos:semanticRelation,2,2
Rapport,1,1,Actant,Physique,uri,1,1
Totaux de la complexité
Physique,2,1,2,53,88
Actant,2,1,2,6,11
Concept,2,1,2,45,78
Rapport,10,1,2,102,166
Existence,16,1,2,206,343,
Collections
Zotero

Annotations

[Annotation #86593] 2022-08-05 01:22:47 Samuel Szoniecky tagging
Valeur Purpose
Zotero automatique tagger a taggé le document Hyphe, a Curation-Oriented Approach to Web Crawling for the Social Sciences avec le tag crawler
classifying
Target selector Selector type
o:Item
[Annotation #86597] 2022-08-05 01:23:06 Samuel Szoniecky tagging
Valeur Purpose
Zotero automatique tagger a taggé le document Hyphe, a Curation-Oriented Approach to Web Crawling for the Social Sciences avec le tag web mining
classifying
Target selector Selector type
o:Item