File:Citation Detective WikiWorkshop2020.pdf
From Wikimedia Commons, the free media repository
Jump to navigation
Jump to search
Size of this JPG preview of this PDF file: 463 × 599 pixels. Other resolutions: 185 × 240 pixels | 371 × 480 pixels | 593 × 768 pixels | 1,275 × 1,650 pixels.
Original file (1,275 × 1,650 pixels, file size: 595 KB, MIME type: application/pdf, 5 pages)
File information
Structured data
Captions
Summary
[edit]DescriptionCitation Detective WikiWorkshop2020.pdf |
English: Machine learning models designed to improve citation quality in Wikipedia, such as text-based classifiers detecting sentences needing citations (“Citation Need” models), have received a lot of attention from both the scientific and the Wikimedia communities.However, due to their highly technical nature, the accessibility of such models is limited, and their usage generally restricted to machine learning researchers and practitioners. To fill this gap,we present Citation Detective, a system designed to periodically run Citation Need models on a large number of articles in English Wikipedia, and release public, usable, monthly data dumps exposing sentences classified as missing citations. By making Citation Need models usable to the broader public,Citation Detective opens up new opportunities for research and applications. We provide an example of a research direction enabled by Citation Detective, by conducting a large-scale analysis of citation quality in Wikipedia,showing that article citation quality is positively correlated with article quality, and that articles in Medicine and Biology are the most well sourced in English Wikipedia. |
|||
Date | ||||
Source | Own work | |||
Author | Miriam (WMF) | |||
Other versions |
|
Licensing
[edit]I, the copyright holder of this work, hereby publish it under the following license:
This file is licensed under the Creative Commons Attribution-Share Alike 4.0 International license.
- You are free:
- to share – to copy, distribute and transmit the work
- to remix – to adapt the work
- Under the following conditions:
- attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 10:38, 28 February 2020 | 1,275 × 1,650, 5 pages (595 KB) | Miriam (WMF) (talk | contribs) | User created page with UploadWizard |
You cannot overwrite this file.
File usage on Commons
The following page uses this file:
File usage on other wikis
The following other wikis use this file:
- Usage on en.wikisource.org
Metadata
This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.
Short title | Citation Detective: a Public Dataset to Improve and Quantify Wikipedia Citation Quality at Scale |
---|---|
Image title | |
Author | Ai-Jou Chou, Guilherme Gonçalves, Sam Walton, and Miriam Redi |
Keywords |
|
Software used | LaTeX with acmart 2020/02/08 v1.69 Typesetting articles for the Association for Computing Machinery and hyperref 2019/11/10 v7.00c Hypertext links for LaTeX |
Conversion program | pdfTeX-1.40.20 |
Encrypted | no |
Page size | 612 x 792 pts (letter) |
Version of PDF format | 1.5 |