Commons:Village pump

From Wikimedia Commons, the free media repository
(Redirected from Commons:VP)
Jump to navigation Jump to search

Shortcut: COM:VP

↓ Skip to table of contents ↓       ↓ Skip to discussions ↓       ↓ Skip to the last discussion ↓
Welcome to the Village pump

This page is used for discussions of the operations, technical issues, and policies of Wikimedia Commons. Recent sections with no replies for 7 days and sections tagged with {{Section resolved|1=--~~~~}} may be archived; for old discussions, see the archives; the latest archive is Commons:Village pump/Archive/2024/10.

Please note:


  1. If you want to ask why unfree/non-commercial material is not allowed at Wikimedia Commons or if you want to suggest that allowing it would be a good thing, please do not comment here. It is probably pointless. One of Wikimedia Commons’ core principles is: "Only free content is allowed." This is a basic rule of the place, as inherent as the NPOV requirement on all Wikipedias.
  2. Have you read our FAQ?
  3. For changing the name of a file, see Commons:File renaming.
  4. Any answers you receive here are not legal advice and the responder cannot be held liable for them. If you have legal questions, we can try to help but our answers cannot replace those of a qualified professional (i.e. a lawyer).
  5. Your question will be answered here; please check back regularly. Please do not leave your email address or other contact information, as this page is widely visible across the internet and you are liable to receive spam.

Purposes which do not meet the scope of this page:


Search archives:


   

# 💭 Title 💬 👥 🙋 Last editor 🕒 (UTC)
1 Hosting HDR images as JPEG with gain map 1 1 C.Suthorn 2024-11-01 07:41
2 Google's semi-censorship of Wikimedia Commons must end 33 11 Adamant1 2024-10-31 21:34
3 Admin action rational 18 7 L. Beck 2024-10-25 07:34
4 Patent search 1 1 Richard Arthur Norton (1958- ) 2024-10-16 00:43
5 Picture of the Year 2022 finalist with an undeclared fake background: what should be done? 11 7 Giles Laurent 2024-10-25 20:38
6 Clear Category:Symbols of municipalities in Japan used in Wikipedia articles with vector versions available 3 2 Jmabel 2024-10-25 18:42
7 Mass uploads works very bad for me 4 3 4300streetcar 2024-10-30 01:25
8 110 Million files 1 1 PantheraLeo1359531 2024-10-27 15:20
9 Commons talk:Nudity categories 4 3 Jmabel 2024-10-29 01:40
10 I messed up making a mass deletion request 2 2 RoyZuo 2024-11-01 18:57
11 Flickr license and license in embedded metadata differ 14 5 RobbieIanMorrison 2024-10-30 21:56
12 Final Reminder: Join us in Making Wiki Loves Ramadan Success 0 0
13 Your input... 9 4 Enhancing999 2024-11-01 07:50
14 MediaWiki_talk:Gadget-Cat-a-lot.js 2 2 Prototyperspective 2024-10-29 12:10
15 https://ocr.wmcloud.org/ 6 5 Enhancing999 2024-10-31 21:33
16 Views through mobile phones 4 4 ReneeWrites 2024-10-30 20:21
17 Category:Musical groups by genre 5 2 Jmabel 2024-10-31 01:02
18 Almost 400k files need license review 15 8 MGA73 2024-11-01 19:40
19 Help interpreting photographs from the Velvet Revolution in Prague in 1989 4 2 RobbieIanMorrison 2024-11-01 15:59
20 Obtuse bot created categories 13 9 Adamant1 2024-11-01 18:42
21 Speedy deletion: F3. Derivative work of non-free content 4 3 Yann 2024-11-01 13:47
22 Commons Gazette 2024-11 1 1 RoyZuo 2024-11-01 19:15
23 Derivative works (FOP etc.) 2 2 Richard Arthur Norton (1958- ) 2024-11-02 01:17
Legend
  • In the last hour
  • In the last day
  • In the last week
  • In the last month
  • More than one month
Manual settings
When exceptions occur,
please check the setting first.
Centralized discussion
See also: Village pump/Proposals   ■ Archive

Template: View   ■ Discuss    ■ Edit   ■ Watch
SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 1 day and sections whose most recent comment is older than 7 days.

September 23

Hosting HDR images as JPEG with gain map

The tools for creating and displaying High Dynamic Range (HDR) images are starting to mature. HDR displays can render much brighter highlights than before, which leads to a big qualitative improvement in an image. Software for HDR production, and web-browser support, are becoming wide-spread. (Note that this is distinct from the tone-mapped HDR images you may have seen for the past decade or so.)

This post is partly a response to User:Hym3242 and User:PantheraLeo1359531 in Commons:Village pump/Archive/2024/08#Can I upload bt2020nc/bt2020/smpte2084(PQ) HDR AVIF images to commons and use them in wikipedia articles?. I was wondering the same thing, so I uploaded a couple files to see how well Commons would support them. They are formatted as JPEG with a gain map. The promise of this format is that it is backward-compatible with systems that process and serve standard JPEG. The base image is a JPEG, usable on any device. HDR information is inserted in the file as metadata. In the worst case HDR metadata is lost, resulting in a standard image. In the best case HDR metadata is preserved, the end-user has an HDR-capable display and web browser, and the image looks great.

My test results are at Category:HDR gain-mapped images. Both images survived the process of uploading and rendering previews. HDR metadata was stripped from preview images, but preserved in the original uploads. If you have a newish HDR screen and a compliant web browser, the originals of this house and this church will appear brighter than usual. The effect on the house is subtle, limited to where sunlight hits white paint. The effect on the church is more dramatic: the windows should appear much brighter than the rest of the interior.

Most users of Commons images will see one of the smaller standard files, so for now the benefits of publishing this sort of content are limited. Are there any downsides to publishing it on Commons?

This post isn't marked as a proposal, because hosting these images on Commons works already. At a later date, when the standards are settled and the hardware is widely available, it would be nice to preserve HDR metadata in the generated preview images. — Preceding unsigned comment added by Semiautonomous (talk • contribs) 23:51, 23 September 2024 (UTC)[reply]

A phab task would need to be created for "include gain map of images into thumbs"- C.Suthorn (@Life_is@no-pony.farm - p7.ee/p) (talk) 07:41, 1 November 2024 (UTC)[reply]

October 14

Google's semi-censorship of Wikimedia Commons must end

Please see meta:Community Wishlist/Wishes/Do something about Google & DuckDuckGo search not indexing media files and categories on Commons. I think we can and should do something about Google not indexing most files (including all videos) and category pages on Commons. Prototyperspective (talk) 15:42, 14 October 2024 (UTC)[reply]

It is a private company and if not violating the law, they can do whatever (...) they want. If they choose to ignore stuff on commons - that´s fine. Alexpl (talk) 20:02, 14 October 2024 (UTC)[reply]
I was not saying it's illegal. That may be fine according to law. I wonder if it's fine to Commons that users' contributions are just blacked out and not available to people. Prototyperspective (talk) 21:39, 14 October 2024 (UTC)[reply]
Huge filesizes for photos are a cost factor when it comes to processing and are almost never worth it anyway. I dont blame them from not wanting photos with the megabytes in the three digits to show up, whenever somebody types in a generic searchterm. Alexpl (talk) 14:13, 15 October 2024 (UTC)[reply]
This seems offtopic. 1. Most files on WMC are not many MBs large and this is not about some particular few large files. 2. It only shows gstatic thumbnails in Google Search, not the whole image, and it's the same for DDG and other search engines.
It's absurd to argue that Google's storage or processing would have notable issues that out of the millions of indexed website makes WMC one whose media is not findable.
You can of course defend anti-WMC practices – despite that I don't understand why Commons contributors could be supportive of that – but this point does not make sense, partly because this isn't about the <0.1% of WMC files that are large image files to begin with. Prototyperspective (talk) 14:33, 15 October 2024 (UTC)[reply]
This is not the first time I have seen you try to dismiss comments with which you disagree as "off topic", when they are not. Please do not so that. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 15:46, 15 October 2024 (UTC)[reply]
I said it seems offtopic and I did notdismiss the comment but address it comprehensively. When I say it seems offtopic that is for example because I may have misunderstood it and/or the user may want to clarify how it would be ontopic. I do wonder why you're so super sensitive about me using the word offtopic. The user did say something but did not explain how it relates to this subject and clarifying that with clear language is I think more constructive than beating around the bush. Prototyperspective (talk) 16:41, 15 October 2024 (UTC)[reply]
There already is a thumbnail for every file here anyway so not even any need to create any anew. Prototyperspective (talk) 15:30, 15 October 2024 (UTC)[reply]
See also meta:Talk:Community Wishlist/Wishes/Do something about Google & DuckDuckGo search not indexing media files and categories on Commons. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 20:41, 14 October 2024 (UTC)[reply]
There is a commercial interest in steering the search results to commercial and social websites. These generate clicks, not the commons. I do have the impression that Google is much more interested in SDC of files than the Commons categories. Every effort should be made to fill in the P:P180. Google certainly uses the labels in Wikidata as datafeed for the search engines. Also used for educating the translation software.Smiley.toerist (talk) 10:12, 15 October 2024 (UTC)[reply]
Wikipedia itself is indexed rather highly on Google search results though. And it does index images that are used in Wikipedia articles, but this treatment isn't extended to the other Wikimedia projects. (I can't speak for other media files however). ReneeWrites (talk) 18:26, 15 October 2024 (UTC)[reply]
Yes Wikipedia is, but not Commons, the second largest Wikimedia project with a type of content that lots of people are interested in, watch and search for (media of all kinds). It does not index any video on here (at least in my tests I could not find any so far even when searching for the exact title) and images I think are only indexed when they're used in Wikipedia articles and even then often missing from the main results. One part of the proposal is systematic tests/investigations so there is some data on this. I think overall the indexing is pretty bad even when one is searching for a subject that WMC has lots of high quality contents and other image results that are shown are fairly low-quality. One could also focus on the videos. Prototyperspective (talk) 20:32, 15 October 2024 (UTC)[reply]
Google often indexes images that are not in a Wikipedia article. I find plenty if I do specifically an image search. But it doesn't tend to list pages that are mainly an image in its general results, so Commons image pages often don't show in the result if you do a general Google search. - Jmabel ! talk 05:11, 16 October 2024 (UTC)[reply]
Rarely it does, but indexing a random tiny subset of files doesn't change anything about the issue and only makes it harder to notice this. I did not find plenty of images for prior searches I did where I then either used an image not from WMC despite that I know WMC has at least as good images well-organized or used the WMC search. Again, investigations are the first step of what is proposed so maybe you could share your searches. Images certainly shouldn't show up in the general search results (well nearly always) – I made it clear that this is about the Images and Videos tabs of these sites...only when it comes to category pages is this about the general search results. I currently don't have many good examples. Things I searched for (those may not be the best examples) I think included roughly Rivers from space and Algae blooms from space and Satellite picture of cities at night. This is not about Google&DDG not indexing any files on WMC. Please let me know if that should be clearer in the proposal. It is about them indexing only very few images (and those are not even the most relevant or best) when it should be many (e.g. in searches where WMC has lots of good-organized files), not showing nearly all categories in the results and not indexing any videos. Maybe it should be clearer that isn't necessarily all Google's fault – the investigations may reveal things Wikimedia community & tech could do to improve its inclusion in external search results – however such steps depend on investigations and don't mean step 2 & 3 are invalid, other things could follow up on that step in addition and shape these two. Prototyperspective (talk) 11:30, 16 October 2024 (UTC)[reply]
@Prototyperspective: Colourpicture Publishers. There isn't that many results to begin with, but maybe it's at the top because the category has a description that contains the companies name in it? --Adamant1 (talk) 01:21, 18 October 2024 (UTC)[reply]
  • Yes, that's the kind of investigations I'm proposing are done large scale and in systematic ways (and well visibly e.g. published in diff) so we can identify cases that are well indexed, find out why, and identify cases that should be well-indexed but aren't and so on.
It could be that it's at the top because it contains a long descriptive category description – which most cats however don't really need because the category title is self-explanatory – as well as an infobox with all sorts of data. It's not unlikely also because there's few other websites with info on that subject, especially not recent ones that are linked from other pages. As a result of findings like your example, one could for example conduct tests (and/or check the theory via the dataset) whether it's the company's name in the description that caused the cat to show up this high or the description and consider things like adding category-descriptions (partly automatically via WP article leads and/or Wikidata item description). An open letter doesn't have to be as provocative and confrontational as the title of this thread, one could nicely ask Google & Co to improve their results by considering specific things or identified requested changes. Relevant to that is that Google & Co heavily make use of Wikimedia content in all sorts of ways but this isn't about fairly giving back (some media attention however could be due to that and reference that): it would be about them improving their search results for everyone so it shows media or pages that the person searching would likely find useful (e.g. via considering how many files and how many Wikipedia-used files are contained in the category). (When it comes to videos however it seems like purposeful exclusion.) Prototyperspective (talk) 08:24, 18 October 2024 (UTC)[reply]
Google clearly does take these images into account. I looked up a handful of terms:
Google Images searches
  • hubble extreme deep field (1 top result from WMF projects)
  • pando tree (2 top results from WMF projects)
  • tokyo tower (2 top results from WMF projects)
  • african renaissance monument (2 top results from WMF projects)
  • burj khalifa (2 top results from WMF projects)
  • gutenberg bible (2 top results from WMF projects)
  • ka'ba (7 top results from WMF projects)
  • michelangelo david (3 top results from WMF projects)
  • mount denali (3 top results from WMF projects and 1 from Wikiwand, which mirrors Wikipedia)
  • keyboard (0 top results from WMF projects. In this case, it gave me stores near me to buy keyboards, which makes perfect sense, if you ask me.)
  • hurricane milton (1 top result from WMF projects)
  • vladimir putin (1 top result from WMF projects)
  • mitochondrion (1 top result from WMF projects)
  • october revolution (2 top results from WMF projects)
  • northern lights (0 top results from WMF projects)
  • train (3 top results from WMF projects)
  • barcelona (1 top result from WMF projects)
  • mesopotamia (2 top results from WMF projects)

If you narrow your search to CC images, you get more from Flickr and Commons:

Google Images searches - Narrowed to Creative Commons
  • hubble extreme deep field (4 top results from WMF projects)
  • pando tree (4 top results from WMF projects)
  • tokyo tower (4 top results from WMF projects)
  • african renaissance monument (6 top results from WMF projects)
  • burj khalifa (7 top results from WMF projects)
  • gutenberg bible (4 top results from WMF projects)
  • ka'ba (5 top results from WMF projects, decreased)
  • michelangelo david (6 top results from WMF projects)
  • mount denali (3 top results from WMF projects)
  • keyboard (4 top results from WMF projects)
  • hurricane milton (1 top result from WMF projects)
  • vladimir putin (4 top results from WMF projects)
  • mitochondrion (16(!) top results from WMF projects)
  • october revolution (1 top result from WMF projects, decreased)
  • northern lights (3 top results from WMF projects)
  • train (4 top results from WMF projects)
  • barcelona (2 top results from WMF projects)
  • mesopotamia (5 top results from WMF projects)

I don't believe there even is a problem. Sure, results from WMF projects are only 1 or 2 in many cases, but:

  1. it's not like there was any other site that did have a majority of the top results
  2. you can improve them by searching for CC content
  3. Wikipedia was almost always in the results, even if they didn't have a majority in the top images (which there's no reason it should, might I add). I can't say the same about other results I saw, like Britannica, NatGeo, Adobe Stock, etc.
Google is showing results from Wikipedia, Commons, and even smaller projects like Wikispecies and Wikivoyage, at times .I wouldn't put it past them that they're prioritizing commercial and social sites that run Google Ads (purely speculation from my part, don't take my word for it), but I find it hard to believe that they're straight up censoring, shadowbanning, or otherwise limiting results from WMF projects. Rubýñ (Scold) 17:21, 15 October 2024 (UTC)[reply]
I haven't repeated all the searches to test this, but with the ones I did I only got 1 result from WMF, and it was the image in the infobox of the Wikipedia article about the subject. ReneeWrites (talk) 20:29, 15 October 2024 (UTC)[reply]
  • I personally use Ecosia to search things and I often just type in something in Ecosia rather than search it here because I am too lazy to use the convoluted Wikimedia internal search method (yes, using external websites to find something is oftentimes easy than the internal "search" engines on Wikimedia websites), but I noticed that in the past few months Ecosia has been suppressing non-Wikipedia Wikimedia websites more, now, this seems to coincide with the switch where Ecosia now mixes in Google Search search results with those from Microsoft Bing, before this change Ecosia exclusively used Microsoft Bing and while I've used Microsoft Bing as my main search enginge since 2011~2012'ish, I switched to Ecosia a couple of years ago (after I saw one of their advertisements on Google YouTube) and I occasionally compare it with Google Search and other search engines. Judging by the fact that Google Search suppresses Wikimedia Commons and Microsoft Bing does this to a lesser extent I assume that this likely is a deliberate choice by those companies. But it could probably also be something internal at Wikimedia websites as all non-article space pages at Wikipedia are also excluded from search engines (meaning that someone cannot find any Wikipedia policy pages unless someone looks for them within Wikipedia, which I've always found to be a rather odd choice).
Now, we know that Google Search, Microsoft Bing, Ecosia, DuckDuckGo, Yahoo! Search, Etc. all heavily rely on Wikidata, perhaps linking all Wikimedia Commons category pages with Wikidata items might help integrate this website better with search engines, if you think about it, the exclusion of the Wikimedia Commons is exclusively the exclusion of the Wikimedia Commons, I have no trouble finding results from the Wiktionary or Wikivoyage, which probably means that the integration between Wikidata and other Wikimedia websites helps them. Now, I know that "SEO" is considered "a curse word among Wikimedians", but if we want the Wikimedia Commons to show up in search results we most likely do need to link to Wikidata and properly use redirects, alternative titles, translations, Etc. in a way that makes sense. For example, if you search for alternative titles on Wikipedia you get them, like "Communist Germany" in a search enginge you'll find the DDR because "Communist Germany" is a redirect at Wikipedia. Meanwhile, we tend to have highly specific titles and redirects are typically deleted. But my guess is that the main culprit is the lack of Wikidata integration at the Wikimedia Commons, I wonder if files with more optimised structured data also show up in search engine results more as these are dependent on Wikidata items. Alternatively, we could compare if categories with or without Wikidata integration show up more in internet search enginges. --Donald Trung 『徵國單』 (No Fake News 💬) (WikiProject Numismatics 💴) (Articles 📚) 18:52, 19 October 2024 (UTC)[reply]
Thanks for this interesting info contribution.
  • Comparing indexing results between search engines like so and across time (especially after algorithms were reported to be changed albeit it's often probably not announced) could help identify causes and potential mitigation measures.
  • I never noticed or thought about search engines not indexing policy and meta pages of Wikimedia sites (nonWMC), if so that's also I think something that would be good to be changed if possible. For example, new editors or readers may search for these with a search engine instead of the internal one. If they searched for a meta/help pages on Commons it's often quite possible they can't find it because they don't show up in the search results even when in the MediaSearch' Categories and Pages tab (issue #8 here).
  • [Google & Co] all heavily rely on Wikidata that good integration with Wikidata is a cause for SE indexing or good indexing and that improving that integration are two hypotheses that could be tested. I do not think this is the case much because category pages that are linked to Wikidata items also do not show up and only a tiny sub < 0,01% of files are used in Wikidata items or usable there while most items are somewhere underneath a category that is linked to Wikidata item. I think 'it's not linked to a Wikidata item' or 'it doesn't have structured data depicts statements' would be not much more than false excuses (not necessarily deliberate) for not indexing and I don't see why it would rely on / require it / why it should be expected. Moreover, some categories should probably be well-indexed without being linked to a Wikidata item or linking such would be inappropriate or at least can't be done at scale(?) – e.g. Category:Drone videos with lots of organized content can't even be found in DuckDuckGo when searching for drone videos wiki (btw I think it should also show up high for searches like free drone videos). The linked proposal however is interesting but I have doubts this can be done both at scale and affects the SE much. Data suggesting such as has any significant effect is also missing. So I don't think it would solve this, e.g. videos on WMC still don't show up in the videos tab and many large categories are already linked.
  • and properly use redirects, alternative titles, translations, Etc. in a way that makes sense Agree. One option is to sync ENWP redirects of items to WMC so WMC has the same redirects [ie a tool for doing so]. Another is Adding machine translated category titles and this could also be implemented via redirects and be extended to category descriptions. This however is another case that I don't think should be required for the pages to show up in search results but only improve them. It's possible that this would solve this even if it shouldn't be that way due to how pages are ranked. Note that this may require that the category page is an actual url with an actual title and not not the same url with some Javascript dynamically changing the title depending on the user language. Another option of creating redirects of translated titles – Category:Tiere (de; only plural form not singular) currently redirects to Category:Animals – can't be done at scale and may cause issues (such as HotCat autocompletes).
  • In any case such comparison data would be great even if it's just a small factor (I doubt it's the main culprit for the plural indexing issues).
Prototyperspective (talk) 20:03, 19 October 2024 (UTC)[reply]
From everything I've been able to tell, Google does index pages in "Commons" space. For example, do a Google search on "structured data commons" (no quotes). - Jmabel ! talk 16:43, 20 October 2024 (UTC)[reply]
Yes, this is known, e.g. the intro already is about "most" files, not "all" files as well as results' ranking/findability. I've yet got to see a WMC video in the videos tab however. Prototyperspective (talk) 16:46, 20 October 2024 (UTC)[reply]
Sorry I misunderstood your comment Jmabel – it's addressing point #2 and you're right on that.
Some examples of low-views useful major categories below. Please comment if anybody knows more in regards to why Videos on WMC are not showing in the Videos tab of Google, DuckDuckGo, etc. Maybe one could ask them or see if there's any other large websites whose videos are not shown there (and why).
  • Category:Our World in Data
  • Category:Sustainable transport
  • Category:Science
  • Category:Drone videos
  • Category:Time-lapse videos
  • Category:Audio files of music
  • Prototyperspective (talk) 17:23, 26 October 2024 (UTC)[reply]
    The 14th most viewed page and the second most viewed category on Commons [1] in also a video category [2]. Views on all Commons pages are quit low there is nothing special with videos on Commons. GPSLeo (talk) 19:13, 26 October 2024 (UTC)[reply]
    Yes, even Commons pages with most view get few views which is consistent with the problem description in the proposal. I did not suggest there was something special with videos except that none of them are shown in and indexed in the videos tab of the search engines. Prototyperspective (talk) 19:29, 26 October 2024 (UTC)[reply]
    It's a good thing, if Google keeps us a relative secret. This is a databank for a select audience, that’s hopefully using items for creating content, or research. It's not a social media website for easy access to every airhead in creation, we don't need the level of vandalism, that would surely follow.
    As a matter of fact, we scavenge off commercial websites, without them, we would have limited access to new materiel. It would be detrimental, to attempt to replace them, no good would come of it. Broichmore (talk) 12:26, 29 October 2024 (UTC)[reply]
    Even for "select audience" it's known, used and discoverable far too little. They also use the Videos tab for example. Moreover, I do not agree with this elitism. Free media and free knowledge is about society overall not some very small group. With increased use, there would also be increased contributors who watch pages and Wikipedia is used much more and is not overrun by vandalism, it probably doesn't increase linearly with increased public use and even if it would there can be and are technological means to detect vandalism. The site would not replace commercial websites even if far more popular. I do not agree that we scavenge off these either. Prototyperspective (talk) 12:54, 29 October 2024 (UTC)[reply]
    So, to wrap this up: you want to upload stuff on Commons and have it shown in google´s services in a predictable way. This would only make sense for either advertising or some sort of campaigning and that is "no bueno". Alexpl (talk) 15:43, 30 October 2024 (UTC)[reply]
    No this doesn't wrap it up at all and it's entirely unrelated to advertising or some sort of ad-like campaigning. It's also not about a "predictable way". Prototyperspective (talk) 16:03, 30 October 2024 (UTC)[reply]
    Sure. Alexpl (talk) 18:30, 31 October 2024 (UTC)[reply]
    Its to bad the Phabricator ticket is stalled out. It doesn't seem like anything else can be done about it outside of that though. --Adamant1 (talk) 19:15, 31 October 2024 (UTC)[reply]
    I named three specific things in the linked proposal. These things can be done. Prototyperspective (talk) 21:11, 31 October 2024 (UTC)[reply]
    Sure, but I was specifically referring to this discussion. Not suggestions you've made in other proposals. Can anything be done about it in this conversation? Probably not. Can things be done about in other conversations or places? Maybe. But I'm not replying to someone else in another conversation now am I? --Adamant1 (talk) 21:34, 31 October 2024 (UTC)[reply]

    October 27

    Mass uploads works very bad for me

    Hello. I described my problem on ticket phab:T378276. Did anyone has the same problem last days? MBH 09:22, 27 October 2024 (UTC)[reply]

    I had longer delays when uploading slightly larger files the last days. Maybe there is a connection? --PantheraLeo1359531 😺 (talk) 15:23, 27 October 2024 (UTC)[reply]
    I've also noticed it taking unusually long to upload and process files the last few days. 4300streetcar (talk) 16:21, 27 October 2024 (UTC)[reply]
    Seems to be fixed now 4300streetcar (talk) 01:25, 30 October 2024 (UTC)[reply]

    110 Million files

    Commons is set to have 110 Million files soon. Another milestone :) --PantheraLeo1359531 😺 (talk) 15:20, 27 October 2024 (UTC)[reply]

    At Commons talk:Nudity categories#Editorializing?, Prototyperspective and I clearly disagree about a recent edit he made to the project page in question. I don't think the two of us will reach any consensus without the involvement of third parties. Discussion should presumably take place there rather than here (other than the fact that I invited Prototyperspective to comment here if they think my wording in this notification is not neutral). - Jmabel ! talk 19:31, 27 October 2024 (UTC)[reply]

    1. It does not matter much whether it's neutral or not and whether it's "editorializing" since this is an essay page and the essay hatnote itself already clarifies it contains the advice and/or opinions of one or more Commons contributors
    2. Neutrally mentioning objectively relevant information about how other large sites handle this is appropriate and not editorializing or unneutral – if you disagree on that, please see the point above. I don't know what the objection is here, adding some info about how this is handled elsewhere( without e.g. saying it should be the same way here or that how other sites handle this is best) is not nonneutral but clearly relevant at this page.
    Prototyperspective (talk) 19:37, 27 October 2024 (UTC)[reply]
    I'm not really clear on the details here but aren't all essays inherently "editorializing" to some degree since they aren't guidelines that were voted on and/or edited by multiple users based on consensus? --Adamant1 (talk) 15:15, 28 October 2024 (UTC)[reply]
    @Adamant1: If you ask that on the discussion thread, rather than here where I gave a notification directing people to the discussion thread, I'll respond substantively. - Jmabel ! talk 01:40, 29 October 2024 (UTC)[reply]

    I messed up making a mass deletion request

    How do I fix it? I edited the template page instead of making a new request by accident. — Preceding unsigned comment added by TansoShoshen (talk • contribs) 21:05, 27 October 2024 (UTC)[reply]

    @TansoShoshen: I've deleted the template page; you can go ahead and re-make your request. Normally you would have gotten an error, since the page is create-protected. However, admin FunkMonk had recently made the same error, so the page happened to exist and you could edit it. Pi.1415926535 (talk) 22:21, 27 October 2024 (UTC)[reply]
    @TansoShoshen consider using com:vfc for mass requests. RoyZuo (talk) 18:57, 1 November 2024 (UTC)[reply]

    October 28

    Flickr license and license in embedded metadata differ

    de:Theresia Crone

    The given image is currently licensed CC‑BY‑2.0 (generic). But the image metadata clearly states CC‑BY‑4.0. Should the licensing here be changed? I prefer the information in the metadata myself. Also, the version 2.0 licenses are at least a decade stale and legally deficient in several respects.

    In addition, I can easily contact the copyright holder and gain explicit permission for CC‑BY‑4.0 should that be necessary.

    Thanks in advance. RobbieIanMorrison (talk) 12:21, 28 October 2024 (UTC)[reply]

    Sorry, I did not realize the item for "copyright" at the top of this page is clickable and not an indicator. But I'll leave this posting here nonetheless. (It would be more intuitive to have little subtabs at the top and not just colored text, a hint!) RobbieIanMorrison (talk) 12:27, 28 October 2024 (UTC)[reply]
    Afaik, it is important what is written in the license template. The problem is that data in the metadata might become obsolete due to changes, or the metadata is automated for all works by a photographer. Sometimes we have an upload to Flickr with an NC license, but the author decides to change to CC BY. Then the metadata does not reflect recent changes, but in fact, there are some. (Another example: When a photographer uploads his image to Commons, but has a NC license stated in the metadata, it becomes obselete when he declares to publish his work under a CC BY license, for example). There are also many cases where the metadata states that the respective image must not be used without permission by the photographer, but since then, usage rights were transferred to another institution and they released the image under a free license, but the metadata does not reflect these recent changes --PantheraLeo1359531 😺 (talk) 08:00, 29 October 2024 (UTC)[reply]
    In this case, it is interesting whether a usage under the conditions of version 2 AND 4 is allowed, as the license only vary in the versions, not the restrictions necessarily --PantheraLeo1359531 😺 (talk) 08:04, 29 October 2024 (UTC)[reply]
    @PantheraLeo1359531: No easy answer, I guess, in terms of workflows. The tags embedded in the file can easily become obsolete. But — I would strongly argue — that the most liberal license present should still take precedence. And I would suggest that the CC‑BY‑4.0 license is the most liberal with its grant of 96/9/EC database rights. So returning to my original question, I believe the license notice on Wikimedia should be modified to version 4.0. I am going to get technical here, so feel free to stop reading! The SPDX AND logical conjunction operator requires that recipients simultaneously comply with the terms of both or all listed licenses. This is correct, AFAIK, in your example because CC‑BY‑4.0 is simply more permissive than CC‑BY‑2.0. In short, CC‑BY‑2.0 is forward/inbound compatible to CC‑BY‑4.0 (my best info using a quick search was this). Noting also that the CC‑BY‑2.0 does not contain the "or later" version language that some software licenses do. Thanks for your reply. RobbieIanMorrison (talk) 09:16, 29 October 2024 (UTC)[reply]
    Thank you for you answer! I am not an expert to the license details, so I cannot examine further what to do :). Greetings --PantheraLeo1359531 😺 (talk) 09:31, 29 October 2024 (UTC)[reply]
    I spend quite a lot of time advocating for en:open data. RobbieIanMorrison (talk) 10:13, 29 October 2024 (UTC)[reply]
    If they offer two versions of the same named license, any reuser can select whichever they prefer. Just like any other multi-licensing. - Jmabel ! talk 03:40, 30 October 2024 (UTC)[reply]
    @Jmabel: Thanks. Essentially the SPDX OR logical disjunction operator if a need to be explicit was sought. That was not my question. My question was should that image stored on Wikimedia be tagged as CC‑BY‑4.0 and not CC‑BY‑2.0 — version 4.0 being the more favorable license for several reasons (universal, database rights grant, contemporary)? RobbieIanMorrison (talk) 11:46, 30 October 2024 (UTC)[reply]
    @RobbieIanMorrison: No, it should be tagged as both. Generally, this is done as a vertical stack.   — 🇺🇦Jeff G. please ping or talk to me🇺🇦 14:58, 30 October 2024 (UTC)[reply]
    @Jmabel: That is a sensible approach. Some only obliquely relevant comments follow. Why does Flickr apply CC‑BY‑2.0 on images shot in 2022? I could not find a definitive source for CC‑BY‑2.0 being forward compatible to CC‑BY‑4.0. And I spoke to the photographer of the image under discussion recently and he said he would reissue any of his material for Wikipedia under CC‑BY‑4.0 on request (we often end up photographing the same climate protests in Berlin). Thanks for your replies too. RobbieIanMorrison (talk) 15:26, 30 October 2024 (UTC)[reply]
    @RobbieIanMorrison: Flickr never updated this aspect of their offered licensing. I have no solid idea why they have made that choice; most likely the defaulted into lack of change by not addressing the issue. But you'd really have to ask someone at Flickr why Flickr made a particular decision; I certainly can't speak for them. - Jmabel ! talk 17:42, 30 October 2024 (UTC)[reply]
    This issue was already addressed on Flickr. It seems that they just don't care. [3] [4] Herbert Ortner (talk) 20:37, 30 October 2024 (UTC)[reply]
    @Herbert Ortner: Thanks. Some discussion about file‑specific embedded licenses versus site licenses in that last URL. RobbieIanMorrison (talk) 21:56, 30 October 2024 (UTC)[reply]

    October 29

    Final Reminder: Join us in Making Wiki Loves Ramadan Success

    Dear all,

    We’re thrilled to announce the Wiki Loves Ramadan event, a global initiative to celebrate Ramadan by enhancing Wikipedia and its sister projects with valuable content related to this special time of year. As we organize this event globally, we need your valuable input to make it a memorable experience for the community.

    Last Call to Participate in Our Survey: To ensure that Wiki Loves Ramadan is inclusive and impactful, we kindly request you to complete our community engagement survey. Your feedback will shape the event’s focus and guide our organizing strategies to better meet community needs.

    Please take a few minutes to share your thoughts. Your input will truly make a difference!

    Volunteer Opportunity: Join the Wiki Loves Ramadan Team! We’re seeking dedicated volunteers for key team roles essential to the success of this initiative. If you’re interested in volunteer roles, we invite you to apply.

    • Application Link: Apply Here
    • Application Deadline: October 31, 2024

    Explore Open Positions: For a detailed list of roles and their responsibilities, please refer to the position descriptions here: Position Descriptions

    Thank you for being part of this journey. We look forward to working together to make Wiki Loves Ramadan a success!


    Warm regards,
    The Wiki Loves Ramadan Organizing Team 05:11, 29 October 2024 (UTC)

    Your input...

    FYI: Commons talk:Administrators#Userpages: red or blue? Regards, Aafi (talk) 09:39, 29 October 2024 (UTC)[reply]

    "[Administrator should have a] user-page with .. [information] how they could be contacted": Is that a joke? Don't we have talk pages for that?
     ∞∞ Enhancing999 (talk) 10:14, 29 October 2024 (UTC)[reply]
    @Enhancing999, I'm sorry if that sounded somewhat weird. I've made a change and tried to clarify what I exactly mean by it. You're free to comment on that discussion. I posted here for a wider community input and won't be monitoring any responses here. ─ Aafī on Mobile (talk) 10:23, 29 October 2024 (UTC)[reply]
    So my quote is no longer on that page. Ok.
    I wonder if user pages are read as much as some user hope ..
     ∞∞ Enhancing999 (talk) 10:27, 29 October 2024 (UTC)[reply]
    Interestingly, there isn't much info on User:EugeneZelenko's user page (one of the admins/bureaucrats who asked for a user page to be created).
     ∞∞ Enhancing999 (talk) 10:43, 29 October 2024 (UTC)[reply]
    Aren't language skills, user rights status and projects where user is participating/had participated completely useless? This seems bare minimum for me and I don't demand for something more. EugeneZelenko (talk) 14:34, 29 October 2024 (UTC)[reply]
    For user rights, the information is generally not complete and better left to the relevant MediaWiki function.
    Language skills should be visible on the talk page and most of the time, at least implicitly it is.
     ∞∞ Enhancing999 (talk) 19:34, 29 October 2024 (UTC)[reply]
    Information from user talk page could be accidentally removed. For example links to archived talks were deleted couple of times from my talk page. Also archive bots could move it. So user page is something more persistent. EugeneZelenko (talk) 14:31, 30 October 2024 (UTC)[reply]
    Same could happen to a user page. Adding it directly to the talk page saves time.
     ∞∞ Enhancing999 (talk) 07:50, 1 November 2024 (UTC)[reply]

    I've put a request there for a tag to also filter changes made with the tool in Wikipedias like it is in Commons, but the page tells me that: Talk pages in this namespace are generally not watched by many users. and sends me here. I understand that this page is only for Commons issues, but I don't know exactly where to ask. Thank you. Gdaniel111 (talk) 12:00, 29 October 2024 (UTC)[reply]

    Since meta:Help:Cat-a-lot doesn't have a talk page you had asked at the right place. The problem is a lack of developers/development and I proposed several concrete readily-adoptable solution to that here: mw:Please increase MediaWiki development capacity further.
    Prototyperspective (talk) 12:10, 29 October 2024 (UTC)[reply]

    Thanks again to the person who posted the link to https://ocr.wmcloud.org/ for me. I am rerunning news articles where Newspaper.com could not transcribe their own articles or could not properly distinguish the columns of material and jumbled the transcribed text. The Google OCR was able to transcribe the previously unreadable articles and even transcribed handwritten cursive writing. Thanks again. RAN (talk) 21:35, 29 October 2024 (UTC)[reply]

    This image comes up blank
    Any suggestions on what to do?--Trade (talk) 21:28, 30 October 2024 (UTC)[reply]
    @Trade: What language is that? Setting it to Korean, it transcribes something, although it doesn't look quite right. I'd think with such a tiny amount of text it'd be easier to just type it, rather than using OCR at all! :) Sam Wilson 00:43, 31 October 2024 (UTC)[reply]
    That would require me to know Korean in the first place Trade (talk) 00:46, 31 October 2024 (UTC)[reply]
    @Trade: According to Google Translate it's "Nano Cola" in Korean, which makes sense. --Adamant1 (talk) 03:12, 31 October 2024 (UTC)[reply]
    Selecting the lower half gives a result. The tools seems mostly help for long texts, but still, it works even on this.
     ∞∞ Enhancing999 (talk) 21:33, 31 October 2024 (UTC)[reply]

    October 30

    Views through mobile phones

    Musicians seen via a mobile phone screen and directly

    Do we have a category for images like the one above? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 09:51, 30 October 2024 (UTC)[reply]

    technically? Category:Mobile phone screenshots. Alexpl (talk) 14:52, 30 October 2024 (UTC)[reply]
    @Alexpl: Please use the colon trick per internal links to form Category:Mobile phone screenshots.   — 🇺🇦Jeff G. please ping or talk to me🇺🇦 18:42, 30 October 2024 (UTC)[reply]
    There's no category for that currently, but you can find enough images that depict something similar to justify creating a new category for it. I'm not sure what to call it though. ReneeWrites (talk) 20:21, 30 October 2024 (UTC)[reply]

    There seems to be some inconsistency between the use of the term "music groups" and "musical groups". Anyone know which is correct?--Trade (talk) 21:05, 30 October 2024 (UTC)[reply]

    Both are perfectly valid English. - Jmabel ! talk 21:22, 30 October 2024 (UTC)[reply]
    Maybe but arbitrarly having one set of categories use one spelling and another a different spelling makes for a complete mess. Trade (talk) 21:26, 30 October 2024 (UTC)[reply]
    Fine, but the choice is arbitrary. You asked which was "correct", and both are acceptable English. - Jmabel ! talk 01:01, 31 October 2024 (UTC)[reply]
    FWIW, "music groups" may be easier for non-native speakers, since we would refer to "rock music groups" and "jazz music groups", not "rock musical groups" and "jazz musical groups". - Jmabel ! talk 01:02, 31 October 2024 (UTC)[reply]

    October 31

    Almost 400k files need license review

    I just did a search of Category:License review needed and subcategories and saw almost 400k files!!!

    The result is that some of those files have been marked for review for years and the source die before anyone review the file. Then we have two choises:

    1. Mark the file for deletion (just like what is standard for recent files that fail upload)
    2. Keep the file

    I'm sure reviewers feel tempted do skip such old files because it does not feel right to delete a file that could have been saved if it was reviewed right after the file was uploaded.

    The good news is that many of those files might actually not need a "normal" review to confirm the license. For example a bot can verify a video have the right license but it can't check if there are any derivative work in the video. So it might help if we somehow could sort the files in those that urgently need a review and those that can wait. If anyone have ideas feel free to fix the problem.

    If a file is checked 1 or 10 years after upload and no longer available we could create a template like {{Grandfathered old file}} that say that uploader claim the file is licensed freely but we can't verify that (now).

    If we do so then we could move files that can't be reviewed from the normal review categories and hopefully it will be easier for reviewers to keep up with new uploads. It's like link rot. We can't fix what is allready broken but we can focus on new files.

    Question is if that is an acceptable solution? Or does someone have a better idea? --MGA73 (talk) 16:04, 31 October 2024 (UTC)[reply]

    Delete the files. Otherwise, we create a playground for underworked attorneys to hassle Wikimedia/Foundation for years - before we ultimately have to delete those files anyway. Alexpl (talk) 16:55, 31 October 2024 (UTC)[reply]
    There is 30k+ files from Finna.fi which could be reviewed by software if somebody would like to write script which compares image to image in Finna and confirms that the licence is correct. I could even write script for that if somebody wants to run it. (note: I am participated to uploading the images). I suppose that there is other images uploaded from well formed repositories with API too which could be reviewed automatically too. --Zache (talk) 17:20, 31 October 2024 (UTC)[reply]
    I don't see how (all) files can/should be deleted as long as there is no obvious violation of guidelines or laws (and probably a huge amount of files is good (and several files are in use etc. etc.)) --PantheraLeo1359531 😺 (talk) 17:36, 31 October 2024 (UTC)[reply]
    Where exactly are those "400k" files? There are e.g. ~110,000 files in subcats of CAT:URAA (which includes +600 artist categories whose works are potentially affected by URAA paranoia), or ~130,000 files in CAT:PD-Art (PD-old default) (which are in 95% of cases obvious PD-old-70 or similar). There are 'only' 70,000 files using the actual {{LicenseReview}} template, and from my experience it dosen't seem to be the case that those files are more likely to be copyright violations than other any file on Commons (pretty much the opposite is the case). ~TheImaCow (talk) 17:56, 31 October 2024 (UTC)[reply]
    @TheImaCow: I agree that many files does not require an actual review but there are other review templates that LicenseReview. For example YouTube, Flickr and GODL-India. That is why I said it might help if we sort the categories in files that should be reviewed where someone confirm that the file is on some website with some license and files that need some other review were we do not need to compare the file to some website. --MGA73 (talk) 18:07, 31 October 2024 (UTC)[reply]
    • @Alexpl: underworked attorneys could have done that already if they want. Some of the file have been here for many years. If the files are uploaded by users with a good upload history I would not worry that much. If uploaded by someone with only one upload or with 10 uploads where 9 was deleted as copyvios I would worry much more. In any case if someone send a take down notice then I’m sure the file would be deleted even if it had a template saying file was claimed to be free but sadly not reviewed in time. --MGA73 (talk) 05:59, 1 November 2024 (UTC)[reply]
    A bot could identify files, that have a source, that is archived in archive.org or archive.is or both and add this information to the talk page of the file. Files without an archive version could get priority for review. --C.Suthorn (@Life_is@no-pony.farm - p7.ee/p) (talk) 07:05, 1 November 2024 (UTC)[reply]
    • That is simply most (or so I think) files uploaded with video2commons for example. I don't know why you suggest deletion. They definitely should not be deleted just because somehow a license review tag was added. Most files simply do not have such a tag but are likewise not license reviewed, there is no reason for deleting files that have this template set. Once again I strongly disagree Alexpl but also I don't understand why he would even comment something like that.
    • For license review, please prioritize those files that are in use. Various tools like GLAMorgan can be used to see files that are in use that are in category Category:License review needed. This tag / category is useful for that but maybe it should be used more sparingly, e.g. only for uploads by new users or a subset of video2commons uploads and/or the reviewing could be automated.
    Prototyperspective (talk) 12:02, 1 November 2024 (UTC)[reply]
    Here's one further idea: a link archival bot for external links on Commons (anywhere but especially in the source field of {{Information}}). There have been many requests & proposals for this in the Community Wishlists and so on but they are usually focused on Wikipedia. It seems like on Wikipedia lots of this is being done. Not so much on Commons except for vid2commons which seems to request an IA-archival for every video/audio import. This recent Wishlist proposal has "All projects" specified so its scope includes Commons; probably more could and should be done: Automatic Archiving of Cited Web Pages in Web Archive. Prototyperspective (talk) 17:27, 1 November 2024 (UTC)[reply]

    Thank you for all the ideas. It would be great if they could be implemented. :-)

    I mentioned a template earlier and I made an example of how it might look:

    This image was originally posted to a website and claimed to be licensed under a free license. An administrator or reviewer <user> tried on the <date> to confirm that the above/below mentioned license was valid. However the file was not available on the specified source so the copyright status could not be confirmed. Administrator/reviewer found no indications that the copyright claim can't be trusted. If you disagree you can start a deletion request and state your reasons.

    I think such a template would be useful because it will make it possible to get the file away from the review category and at the same time it tell everyone that there is no reason to asking for a new review. --MGA73 (talk) 16:48, 1 November 2024 (UTC)[reply]

    1.  Support such a template.
    2. we need a bot to go through files with a youtube source and test if the youtube source is ccby. when no, fail the review; when yes, mark it with a template that says something like "bot xx confirms that the given source youtubeURL is ccby" and auto categorises to a category "youtube files reviewed by bot". if a human reviews after the bot review, it gets categorised to "youtube files reviewed by bot and reviewer".
    3. we also need bots/some better automatic processes for all the iranian news photos.
    RoyZuo (talk) 18:50, 1 November 2024 (UTC)[reply]
    Re 2.: Agree. However, it's not so simple: often people upload videos they don't have rights for under CCBY or only mean the music is CCBY but not the video. Sometimes, a different license is specified in the file description but usually that's just CCBYSA or CCBY4.0 instead of CCBY3.0. Sometimes, a license may be specified in the description but not in the file metadata but I think this is an edge case that shouldn't be a problem. Lastly, some files were CCBY at the time of upload but had this changed later on or the video is down. In any case, I don't think most of these 400 k files are videos from youtube. Prototyperspective (talk) 19:08, 1 November 2024 (UTC)[reply]
    All the special cases can be handled in a DR started by the bot, or by the uploader replacing the failed review template with one that says "this youtube file fails bot review but is actually good so a human please review it".
    as long as a bot starts working and continues non stop, any new youtube uploads will be handled shortly after upload. then it's the uploader's responsibility to explain all those special cases (changed licence, taken down video...). if they cant do that in like 1 or 2 days after upload, the file deserves speedy deletion.
    https://commons.wikimedia.org/w/index.php?search=incategory:License_review_needed+youtube 17545 / 76125 = 23%. RoyZuo (talk) 19:31, 1 November 2024 (UTC)[reply]
    •  Comment There was an attempt earlier at Commons:Bots/Requests/EatchaBot 3 / Category:Arranged license review project to make review easier. I think it did help but it have now stopped. Maybe there are some ideas or code that can be of use for future bots. I also like the idea Zache mention about having a bot to confirm that files from Finna match the source. It is probably not possible to make one bot that can solve all problems but it will help if one or more bots can do some tasks and reduce the amount of files that humans have to work on. --MGA73 (talk) 19:40, 1 November 2024 (UTC)[reply]

    Help interpreting photographs from the Velvet Revolution in Prague in 1989

    Protest rally with participants holding up two fingers

    I recently dug out some 35 year old negatives, had them scanned and post‑processed, and uploaded them to Wikimedia yesterday. See this Wikimedia category.

    If you can help with information about the context and circumstances of these various images, shot during the high point of the Velvet Revolution, can you please edit the Discussion tabs of the respective images or use the Discussion tab for the aforementioned category.

    I am currently talking to the Czech National Archives about this material too.

    With reference to that thumbnail, the crowd also took out their house keys and rattled them to symbolize freedom. I was there as a tourist (and these pictures probably better qualify as holiday snaps but I nonetheless ticked the educational box on upload). TIA, RobbieIanMorrison (talk) 16:47, 31 October 2024 (UTC)[reply]

    Suggestion: Post this question also on Commons:Hospoda U Commons (the Czech village pump), perhaps they might help you. JopkeB (talk) 14:36, 1 November 2024 (UTC)[reply]
    @JopkeB: Thanks, will do that now. RobbieIanMorrison (talk) 15:53, 1 November 2024 (UTC)[reply]
    See here: Commons:Hospoda U Commons#Help interpreting photographs from the Velvet Revolution in Prague in 1989. RobbieIanMorrison (talk) 15:59, 1 November 2024 (UTC)[reply]

    November 01

    Obtuse bot created categories

    Apparently User:Gzen92Bot has been mass creating thousands of categories that only contain a couple of images and basing the names of the categories on the file names. Category:"Papier dominoté. Damier alternant le motif du dé, face cinq, un carré plein, deux carrés avec deux fleurs stylisées différentes, un carré avec un motif " géométrique ", sur fond vert pâle - btv1b10576326x being one of thousands of examples. People can look through Category:Files from Gallica needing categories (images) to find a ton more. Creating 20 word categories based on purely descriptive file names seems sub-suboptimal at best though. More so given that it's being done in mass and through automated editing. I'm not really sure what to do about it though since I'm not an expert on bots. Let alone am I even sure if it's an issue to begin with. But it does seem like a needlessly obtuse way to do things. So does anyone else have an opinion about it or know what can be done done to fix the issue assuming it even is one? --Adamant1 (talk) 04:51, 1 November 2024 (UTC)[reply]

    @Adamant1: I fully agree. Creation of >7,000 uncategorized and possibly-nonsense categories is not appropriate. Doubly so given that this does not seem to be an approved task for the bot. I have blocked the bot until/unless the task is approved.
    @Gzen92: This is the third time your bot has been blocked for operating with an unapproved task. Per Commons:Bots#Permission to run a bot, it is not optional to seek approval for bot tasks. Pi.1415926535 (talk) 05:46, 1 November 2024 (UTC)[reply]
    @Adamant1: As a regular user with some background in research data management, I completely agree as well. Thanks for pursuing the matter. RobbieIanMorrison (talk) 06:53, 1 November 2024 (UTC)[reply]
    Gee .. what's the cleanup plan for these?
     ∞∞ Enhancing999 (talk) 07:48, 1 November 2024 (UTC)[reply]
    Please delete all the subcategories of Category:Files from Gallica needing categories (images). Prototyperspective (talk) 11:56, 1 November 2024 (UTC)[reply]
    Strong oppose towards such mass deletions. These categories appear to contain similar images, which can greatly aid the manual, proper catgorisation on commons - these categories may or may not be deleted if the images in them have been properly categorized. ~TheImaCow (talk) 16:24, 1 November 2024 (UTC)[reply]
    Most of them contain just 2 images. The files would be upmerged. Prototyperspective (talk) 17:20, 1 November 2024 (UTC)[reply]
    @Adamant1, Pi.1415926535, and Enhancing999: I continued uploading following Commons:Bots/Requests/Gzen92Bot-4, but I agree with the additional categories. I will make a new request (I will indicate the link here soon). This raises questions: there are millions of files to upload and it cannot be done manually, so from how many files should a category be created? How to name the categories (other than with the name of the file)? Following the decision I could easily empty the categories. Gzen92 (talk) 08:19, 1 November 2024 (UTC)[reply]
    If you are not able to categorize the photos properly when uploading such an amount of photos you should slow down the upload process and create them manually. GPSLeo (talk) 08:29, 1 November 2024 (UTC)[reply]
    Categorisation of images on Commons is not a requirement when uploading images & it shouldn't be - especially not for batch/GLAM uploads. A category such as "Images to check" is sufficient & often much better than automated categorisation. There are still thousands of content categories with random junk in them that was dumped there by automatic categorisation from ten years ago which needs to be cleaned up. A bunch of images, or also a bunch of 500,000 images waiting in a "to check/to categorize" category don't hurt anyone whatsoever, as opposed to poorly done automatic categorisation. ~TheImaCow (talk) 16:24, 1 November 2024 (UTC)[reply]
    I made the request. Gzen92 (talk) 17:26, 1 November 2024 (UTC)[reply]
    I'm not sure if it's practical in this case but the way I'd do it is to categorize the images by subject. For instance "maps from Gallica", "books from Gallica", Etc. Etc. Then people sub-categorize the images beyond that if they want to. But at least it doesn't lead to a bunch of random categories. --Adamant1 (talk) 18:42, 1 November 2024 (UTC)[reply]
    •  Comment I'm not a fan of mass creation of categories with very few files in them (generally I do not like categories with very few files and I prefer to have 20 photos of John Doe in one category rather than to have 10 categories of John Doe in 2020, John Doe in 2021 or John Doe wearing a yellow hat looking west). But now they are created I agree with TheImaCow that it might be better to keep them untill better categories are created. --MGA73 (talk) 18:04, 1 November 2024 (UTC)[reply]

    Speedy deletion: F3. Derivative work of non-free content

    I tried to figure out how to correctly nominate it for speedy deletion, but alas I could not figure out how. Grorp (talk) 05:03, 1 November 2024 (UTC)[reply]

    @Grorp: There is an important distinction between trademark (an identifying idea) and copyright (a creative expression). For example, the content of a novel is copyrighted, while the title or certain character names may be trademarked for marketing purposes. Commons primarily concerns itself with copyright, as it directly affects whether we can host a file. In this case, the symbol itself is too geometrically simple to be copyrighted. For non-copyright restrictions like trademarks that do not affect Commons but may affect reuse elsewhere, we sometimes use templates like {{Trademark}} as courtesy notices on the file pages.
    There may be other reasons for us to not keep the file - in this case, it may be out of scope - but F3 is not applicable here. Pi.1415926535 (talk) 05:33, 1 November 2024 (UTC)[reply]
    @Pi.1415926535: Well, I'm no intellectual property expert, but I do know that the trademark holder, the Church of Scientology, is particularly litigious... and anti-LGBTQ. And someone created this LGBTQ symbol and placed it in Wikipedia article Scientology and homosexuality, most likely as trolling/provocation/agitation... putting Wiki in the middle and smack dab in the crosshairs. Though it was quickly removed from the article, there is no need of keeping such in Wikicommons. I am only familiar with deletion process in English Wikipedia, and not in Wikicommons. How fast does that process usually go? Grorp (talk) 13:24, 1 November 2024 (UTC)[reply]
    Yes, trademark is not a copyright restriction. However I wonder what educational use there could be for this file. I warned the uploader about scope and copyright violations. Yann (talk) 13:47, 1 November 2024 (UTC)[reply]

    Commons Gazette 2024-11

    Volunteer staff changes

    In October 2024, 1 sysop was elected. Currently, there are 180 sysops.

    Other news


    Edited by RoyZuo.


    Commons Gazette is a monthly newsletter of the latest important news about Wikimedia Commons, edited by volunteers. You can also help with editing!

    --RoyZuo (talk) 19:15, 1 November 2024 (UTC)[reply]

    Derivative works (FOP etc.)

    1. does commons want derivative works (dw) that are currently not compatible with com:l, especially photos taken in no-FOP countries?
    2. were there users that got blocked for uploading such dw?

    --RoyZuo (talk) 19:24, 1 November 2024 (UTC)[reply]

    • Yes, they are wanted because one day they will be in the public domain. We hide the images and add an undelete date. There should be a mechanism in place where you can hide an image yourself and add the undelete date. --RAN (talk) 01:17, 2 November 2024 (UTC)[reply]
    I don't know if its neccessarily in line with the guidelines but I'm big proponent of people uploading uploading copyrighted works under the guise of documenting and theb deleting them with undeletion dates. At the end of the day this is as much about hosting documenting who created certain works and when they will become PD as it is a place to host freely licensed media. That's at least how I see it. There's no harm in uploading something purely to have it deleted so it can be restored once the copyright expires though. --Adamant1 (talk) 02:35, 2 November 2024 (UTC)[reply]

    November 02