Commons:Batch uploading

COM:BATCH


Request a batch upload	Current requests	Past batch uploads	Failed batch uploads

Commons Batch Uploading is a project to centralize the uploading of a collection of files, that have released their work as PD or any Commons compatible license. The files would be assigned to a bot operator who would see how the request would be fulfilled.

Before you request a batch upload here, please read the guide to batch uploading first.

See w:Wikipedia:Public domain image resources for potential future batch uploads.

Related project: Commons:Library back up project aims to upload books in public domain from libraries of all languages.

Requests

Videos by Psych2Go

The videos of the Channel "Psych2Go"

Source to upload from

https://www.youtube.com/@Psych2Go

License

Description

The videos addresses different psychological-related topics; often with references. The topics are based on psych health, love and social relationships, and more.

PantheraLeo1359531 😺 (talk) 12:17, 13 November 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
			Category:Videos by Psych2Go

Suntooth on iNaturalist

Source to upload from

https://www.inaturalist.org/observations?user_id=suntooth

License

CC-BY-SA

Description

The linked profile is mine, and I upload all my observations as CC-BY-SA. I'd like to import all of my observation images to Commons in bulk, and audio observations too if possible. An example image would be https://inaturalist-open-data.s3.amazonaws.com/photos/432698136/original.jpeg (from this observation), and an example audio file would be https://static.inaturalist.org/sounds/1204663.wav?1726706336 (from this observation).

Ideally, all images would have the category for their identified species if the observation is classed as research-grade, and would also be added to Category:Photos taken by User:Suntooooth (or Category:Non-photos created by User:Suntooooth for audio) for personal tracking purposes. Other relevant categories are Category:Media from iNaturalist and Category:Media from iNaturalist with obscured location. Templates to use include {{INaturalistreview}} and {{INaturalist}}.

As of posting this request, there are 94 observations (some of which have multiple images), so it's not a huge amount, but still too many to do by hand. User:Kaldari/iNaturalist2Commons, Wiki Loves iNaturalist, and iNat2Wiki exist, and may be useful for reference (none allow bulk uploads of a whole account's media as far as I can tell). Suntooooth (talk) 20:34, 19 September 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

George Michell Bengal Photographs

Source to upload from

All digitised photographs have been uploaded to this Google Drive

License

Creative Commons Attribution-Share Alike 4.0 International

George Michell as the sole owner of the physical photographic slides has released the copyright of that work under the above license.

Detailed discussion with the Wikimedia VRT agent and link to the ticket is available in this section of the Discussion tab of the Wikimedia Grant page.

Description

1. All the metadata including file names and locations is in this Google Sheet.

2. Photos are within folders named after sites photographs were taken. The metadata file has references to locations.

Do the media URLs follow a pattern: Yes, metadata file has details.
Does the site have an API: Not sure. It is a Google Drive folder.
What else could ease uploading: Metadata file.
Did you contact the site owner: Photos are uploaded to my personal Google Drive.
Is there a template that could be used on the file description pages, or should one be created: In the metadata file.

AmitGuha (talk) 18:02, 7 June 2024 (UTC)[reply]

Hello @AmitGuha. Rows 469 and 470 has no Creator. Should it be assumed to be George Michell or be left blank? -- DaxServer (talk) 09:48, 20 September 2024 (UTC)[reply]

Are the GPS coordinates: of the building or of the photographer? -- DaxServer (talk) 10:31, 20 September 2024 (UTC)[reply]

Would you be able to provide some Wikimedia Commons categories for the files?

By default, they'd added to Category:George Michell and Category:Centre for Studies in Social Sciences -- DaxServer (talk) 10:42, 20 September 2024 (UTC)[reply]

Is Commons:Batch uploading/George Michell Bengal Temples request a duplicate? -- DaxServer (talk) 11:08, 20 September 2024 (UTC)[reply]

There are quite some discrepancies in the Sheets compared to the Drive. Please refine them, it's becoming harder for me to validate -- DaxServer (talk) 14:40, 20 September 2024 (UTC)[reply]

Many thanks @DaxServer for the comments above.

1. I will fix all of the missing creator and discrepancies on sheets vs drive and respond here.

2. Yes, I will provide additional Wikimedia Commons categories

I have found some other errors as well. I will need a couple of weeks to fix this and will respond and tag you here when I'm done.

On the other questions:

1. The GPS coordinates are of the buildings

2. Commons:Batch uploading/George Michell Bengal Temples is a duplicate and can be removed

Thanks again! AmitGuha (talk) 21:51, 23 September 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Yevgeny Khaldei photographs

Photographs in public domain by famous Soviet photographer Yevgeny Khaldei.

Source to upload from

https://russiainphoto.ru/search/years-1937-1953/?query=&author_ids=171

License

Public domain - {{PD-Russia}}{{Yevgeny Khaldei as TASS photographer}} (2,453 files)

Description

Do the media URLs follow a pattern?
- Description page: for example https://russiainphoto.ru/photos/180701/, but not all files are in order in the catalog numbering system
- Description pages contain plenty of metadata such as title, caption, source museum, date taken, place taken and keywords, which should be included
- Full size images: for example https://735606.selcdn.ru/thumbnails/photos/2017/09/04/ztjzboam0rskxuxz_1024.jpg, however not every file was uploaded on the same day and some files have urls like https://735606.selcdn.ru/thumbnails/photos/9/6/o/96o52b30d244f515_1024.jpg
Does the site have an API? - Yes
What could ease uploading? The source code for every image page brought up by search query [1] includes source code with this text.

<div class="share share_size_small b-photo__share-block" data-description="" data-title="Подвеска авиабомб в самолет Пе-2, 1941 год" data-url="https://russiainphoto.ru/photos/180589/" data-image="https://735606.selcdn.ru/thumbnails/photos/2017/09/04/qpip0gtkuggujqqo_1024.jpg"></div>

- Information from the field data-title should be the image title
- data-url is the description url
- data-image is the link to the full size image
Did you contact the site owner? - Not necessary
Is there a template that could be used on the file description pages, or should one be created? - Not necessary

Kges1901 (talk) 17:37, 8 August 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

U.S. Army Corps of Engineers Digital Visual Library

Source to upload from

U.S. Army Corps of Engineers Digital Visual Library - https://usace.contentdm.oclc.org/digital/search
Provides almost 50,000 media files by the Corps of Engineers in the subjects of

About USACE (405 files)
Booklets, Manuals & Guides (2,109 files)
Fish & Wildlife reports (883 files)
Histories (309 files)
IWR reports (1231 files)
Laws & Congressional reports (1,302 files)
Magazines & Newsletters (148 files)
Maps & Drawings (1,373 files)
Media (438 Videos, were already uploaded to YouTube)
Photographs (17,022 files)
Project management reports (12,075 files)
Public Notices and Jurisdictional Determination Forms (5,310 files)
Regulatory Information (438 files)
Technical reports (5,378 files)

License

Public domain - {{PD-USGov-Military-Army-USACE}}

Description

Do the media URLs follow a pattern? - site is based on the "CONTENTdm® Digital Collection Management System"
- Description page: https://usace.contentdm.oclc.org/digital/collection/<collection ID>/id/<ContentDM record ID>/rec/<running number for related records>
- Full size downlaod linkhttps://usace.contentdm.oclc.org/digital/download/collection/<collection ID>/id/<ContentDM record ID>/size/full
- Description pages contain plenty of metadata such as title, caption, sub-collection, dates published/digitized, subject keywords, general keywords, physical location/description, "Data entered by"/uploader, language, usage rights (always PD)/use credits (not mandatory), record number/ContentDM record number, which should be included
Does the site have an API? - I don't know
What else could ease uploading?
- Unrelated general thing I noted is that the majority of files have a very recent digitisation date (after 2020)
Did you contact the site owner? Not nessescary
Is there a template that could be used on the file description pages, or should one be created?
- Something like {{LOC-image}} could be created

TheImaCow (talk) 09:31, 3 August 2024 (UTC)[reply]

IIIF (JSON) API:

Collection Manifest: https://usace.contentdm.oclc.org/iiif/2/<collection ID>/manifest.json
Item Manifest: https://usace.contentdm.oclc.org/iiif/2/<collection ID>:<ContentDM record ID>/manifest.json
- For better annotated description metadata indexed by unique identifier keys: https://usace.contentdm.oclc.org/digital/api/collections/<collection ID>/items/< ContentDM record ID>/false
Item info: https://usace.contentdm.oclc.org/iiif/2/<collection ID>:<ContentDM record ID>/info.json
Download: https://usace.contentdm.oclc.org/digital/iiif/2/<collection ID>:<ContentDM record ID>/full/max/0/default.<format> with format offerings specified in the Item Info API above. JPEG is presumed to be available for all files.

-- DaxServer (talk) 18:14, 3 August 2024 (UTC)[reply]

@TheImaCow Could you create the template based on the LOC-image as you suggested? Do you have any suggestions on the categorization schema? I suppose the files go into specific collection category. Apart from this, other categories?

I can start working on this and file for a bot task approval when ready. -- DaxServer (talk) 18:17, 3 August 2024 (UTC)[reply]

This looks very good, thanks for your work already!

I created {{USACE image}} based on similar templates, where parameter 1 is the <collection ID> and parameter 2 the <ContentDM record ID>.

I think categorisation can be done like the USGov FEMA import from 15 years ago, categorization mirroring the collectionID (see subcats of Category:Images from FEMA called "Images from FEMA, category XXX" - for example we use something like "Images from USACE, Fish & Wildlife reports collection". I will create such categories in a moment.

+ a category like the recently created Category:Images from NPGallery to check on all files that can be removed when the actual content categories have been added manually. TheImaCow (talk) 19:23, 3 August 2024 (UTC)[reply]

@DaxServer I created Category:Images from USACE+subcats for the individual collections + Category:Images from USACE to check.

I noted that above you mentioned "JPEG is presumed to be available for all files."

This appears to be correct, however, a large portion of the media are PDF files/scanned documents, where, when requesting a JPG, only the title page would be uploaded, nonsense obivously. I am not sure how to get the PDF using the way described above, as I can't add a .pdf into the .<format> nor is PDF stated under the aviable file types at info.json

However adding /api/ and appending /download to the URL downloads the proper PDF https://usace.contentdm.oclc.org/digital/api/collection/p16021coll7/id/22887/download-example, but there might be a better way.

(Files where the manifest states "file type" "pdf" should not uploaded as JPG) TheImaCow (talk) 20:08, 3 August 2024 (UTC)[reply]

Of course, yes. I was only referring to images and probably should have said "presumed to be available for all images" instead 😅 -- DaxServer (talk) 10:02, 4 August 2024 (UTC)[reply]

BTW, do you want to create a {{USACE file}} or {{USACE work}} or some sorts for these pdfs? Additionally, the {{USACE image}} can just invoke the former with a "image" argument so the templating is not duplicated? -- DaxServer (talk) 10:04, 4 August 2024 (UTC)[reply]

Here's what I'll use:

Collection API - https://usace.contentdm.oclc.org/digital/api/search/collection/p16021coll7/page/2/maxRecords/50
Item API - https://usace.contentdm.oclc.org/digital/api/collections/p16021coll7/items/22884/false
Download API - https://usace.contentdm.oclc.org/digital/api/collection/p16021coll7/id/22884/download

FWIW their API needs some overhaul into fixing things and streamlining -- DaxServer (talk) 13:41, 4 August 2024 (UTC)[reply]

I added an "image=yes/no" parameter to the template where it either states "This document..." or "This image...". (And "This work..." if the parameter is missing) Can be added based on filetype, but I don't think that it really matters. TheImaCow (talk) 14:39, 4 August 2024 (UTC)[reply]

I've downloaded the entire metadata into my OpenRefine. I think this would enable us to strategize the work.

Would you be able to devise a schema for the {{Information}} so that we can add as much information as we can? I'm also studying the structure and can share my thoughts once I process them. -- DaxServer (talk) 19:08, 4 August 2024 (UTC)[reply]

Is this file under PD? The rights ask to consult the West Point Museum - https://usace.contentdm.oclc.org/digital/collection/p15141coll5/id/505 -- DaxServer (talk) 10:43, 6 August 2024 (UTC)[reply]

Yes, as it is from 1847, 177 years old. It's creator died 1889, so {{PD-art-100-expired}} applies. Are there more items which are not labeled as PD? TheImaCow (talk) 11:01, 6 August 2024 (UTC)[reply]

There are a few, ~30 or so, that are deemed restrictive and do not provide access to content. Ex - https://usace.contentdm.oclc.org/digital/collection/p16021coll6/id/1100 -- DaxServer (talk) 13:05, 6 August 2024 (UTC)[reply]

I propose to use {{Photograph}} for images/{{Book}} for PDFs instead of {{Information}}

USACE to commons/examples

Label to Commons Template
ContentDM label	{{Photograph}} value	{{Book}} value
Title	Title=	Title=
Alternative title	X	Subtitle=
Description	description=	description=
Sub-collection	collection=	collection=
Organizational author/Organizational creator	author=	author=
Digital Publisher	publisher=	X
Publisher	X	publisher=
Local place, State/Province, Country	depicted=<Local place>, <State/Province>, <Country>	city=
Subject	other_fields_2=`{{Information field\|name=Subject\|value=''XXXX<br>XXXX''}}`	other_fields=`{{Information field\|name=Subject\|value=''XXXX<br>XXXX''}}`
Keywords	can be omitted	can be omitted
Notes	notes=	X
Physical location	department=	X
Physical description	medium=	X
Document location/Format extend/File Type/File size	irrelevant for us	irrelevant for us
Resolution	other_fields_3=`{{Information field\|name=Resolution\|value=''XXXX''}}`	other_fields_3=`{{Information field\|name=Resolution\|value=''XXXX''}}`
Data entered by/Rights/Contributed by/Disposition/CONTENTdm number/CONTENTdm file name	redundant	redundant
Record number	X redundant, not an actual record number	X
Use credits	Example	Example
Report type	X	genre=
Publisher	X	publisher=
Date created(img)/published(book)	date=	date=
Date digitized	other_fields=`{{Information field\|name=Date digitized\|value={{date\|XXXX}}}}`	other_fields=`{{Information field\|name=Date digitized\|value=''{{date\|XXXX}}''}}`
Location	not existent (?)	not useful
Language	X	language=
Personal creator	photographer=	X

Image

== Summary ==

117 Cameron destroyed buildings

Photographer

Cameron, Harry F.

Title

117 Cameron destroyed buildings

Publisher

United States. Army. Corps of Engineers. Office of History

Description

Montebourg, Normandy [destroyed buildings].

Depicted place

Montebourg (France), Normandy, France

Subject

World War, 1939-1945
Buildings
War damage
Villages

Date

1944

Medium

2" x 2" color 35 mm slides in cardboard mounts

Current location

Cameron Collection, Rows 1-2

Accession number

This image was released by the U.S. Army Corps of Engineers, the military engineering branch of the United States Army.

This tag does not indicate the copyright status of the attached work. A normal copyright tag is still required. See Commons:Licensing.

Notes

Col. Harry F. Cameron, Jr., commanded the 164th Engineer Battalion in Europe during World War II. Title created by staff.

Resolution

600 dpi

Source

U.S. Army Corps of Engineers Digital Library

Date digitized

2023

Book

== Summary ==

Carolina Power and Light Company, Mayo Electric Generating Plant: Final environmental impact statement

Title

Carolina Power and Light Company, Mayo Electric Generating Plant: Final environmental impact statement

Publisher

US Army, Corps of Engineers, Wilmington District

Genre

Environmental impact statement

Language

eng

Subject

Coal-fired power plants
Environmental impact statements
Electric power-plants

Publication date

September 1978

Accession number

This document was released by the U.S. Army Corps of Engineers, the military engineering branch of the United States Army.

This tag does not indicate the copyright status of the attached work. A normal copyright tag is still required. See Commons:Licensing.

Place of publication

North Carolina

Resolution

Bitonal 1 bit/600 dpi, Greyscale 8 bit/300 dpi, Color 24 bit/300 dpi

Source

U.S. Army Corps of Engineers Digital Library

Date digitized

2021

Dates need to be converted from YYYY-MM-DD to YYYY|MM|DD for the date template, multiple subjects need to be converted from XXXX;YYYY to XXXX<br>YYYY
Hope I didn't forget anything important. TheImaCow (talk) 21:38, 4 August 2024 (UTC)[reply]

Here are the contentTypes:

application/octet-stream 82
application/pdf 29550
application/url 487
audio/mpeg 1
image/jp2 17124
image/jpeg 5
image/tiff 141
restricted 31
video/mp4 45

Some notes and questions:

The octet-stream contents are mainly ppt/x, doc/x, exe, pdf, zip. I think I'll skip these, if necessary can be manually uploaded.
Here are the URL types: https://usace.contentdm.oclc.org/digital/collection/p16021coll11/id/3967
For TIFF files, I'll upload them as they're lossless and the community seem to prefer them
Do you know which info template to use for video instead of {{Photograph}} ?
Do we just note the photographer and/or publisher to be text or do we create a Creator: and/or Institution: namespace entries and use that to enrich info?
Do we put the {{USACE image}} template in accession number field?

-- DaxServer (talk) 13:32, 6 August 2024 (UTC)[reply]

Since jp2 format is not supported on Commons, I'll use IIIF download to fetch jpeg version -- DaxServer (talk) 14:05, 6 August 2024 (UTC)[reply]

Actually, all of the images are provided in TIFF, I'll upload them instead. -- DaxServer (talk) 15:07, 6 August 2024 (UTC)[reply]

I would leave out everything that is not image/PDF, audio, URL, exe, "restricted", videos too. Creating creator templates for authors would be very good, is there a way to show the authors with the most files? (prob. everything over 100 or so is something we can create a template for)

USACE image template: I picked the accession number field, because I think it fits the best. The objects don't have a identifier other than the two IDs assigned by the record mangament system - which are then present as part of the template.

TIF issue: I compared some 15 objects from various collections JPG vs TIF using both GIMP/Windows Photo Viewer, and the image quality is always 1:1 the same, pixel by pixel the same. See https://i.imgur.com/iowJb9X.png for a quick comparison, left JPG right TIF. (files used are p15141coll5/3453 and p15141coll5/10765)

The TIFs here are in no way better quality, therefore we don't loose anything when using the JPGs. (there is apparently consensous to not create JPG duplicates, but nothing againest uploading only JPGs) TheImaCow (talk) 22:35, 6 August 2024 (UTC)[reply]

Here are the most used "Personal creator"s:

Cameron, Harry F. - 315
Rowland, Chester A. - 140
Jordan, Jonas - 83
Wood - 38
Ryan, Robert H. - 31
Knuppel, Lee - 26
O'Sullivan, Timothy H. - 26
Boswell, Ray - 17
Majors - 14
Wu, Andy - 14
Garver, Cpt. - 11

Re TIFs, I'll upload the JPEGs instead per your observation. -- DaxServer (talk) 13:33, 7 August 2024 (UTC)[reply]

For the depicted places, Local place and State can have multiple values. Ex: https://usace.contentdm.oclc.org/digital/collection/p16021coll6/id/1918 How should we approach that? -- DaxServer (talk) 13:47, 7 August 2024 (UTC)[reply]

The only creator I could find anything about is Timothy H. O'Sullivan, who was a known/notable photographer, template already at {{Creator:Timothy H. O'Sullivan}}. Unfortunaly I couldn't find anything reliable about any of the other people. Sometimes people with the same name were apparently involved in e.g. the Vietnam War, but their photos are from WW2 and there is no mention about them being in WW2, and similar situations.

Different states: I don't think that this is very common, and since it is on a PDF, I think we can ignore then location then - don't think it's worth creating a special case for that.

What I noticed at the same file is the very long title. Commons files names are up to 240 bytes long, this title would have ~3600 bytes. However this file has an much shorter "alternate title" field "River & harbor annual reports, 1883-1892". We should use the "alternate title" as the main title/file name here, and the "title" as "|description=" (as there is no description otherwise either). If there are too long file names/titles without an alternate title, we should cap the title with "..." or so. TheImaCow (talk) 08:15, 8 August 2024 (UTC)[reply]

What needs to be done with the Use credits? I didn't get what you meant by Example -- DaxServer (talk) 11:53, 8 August 2024 (UTC)[reply]

Oops, "example" was the placeholder when generating the table preset using the source editor, and I forget replacing it.

The "credit_line=" parameter in the photograph/book templates should be used for "Use credit" values. TheImaCow (talk) 23:16, 8 August 2024 (UTC)[reply]

The location seems pretty complicated and not just a simple combination of <Local place> <State> <Country>. It requires significant effort to cleanup and streamline. Do you have any thoughts on how to avoid that? -- DaxServer (talk) 07:42, 13 August 2024 (UTC)[reply]

I would suggest to use three seperate fields for Place/State/Country then. However, the Artwork template supports only up to four custom fields (using {{Information field}}/other_fields/other_fields_1/2/3 parameters). Three of these custom parameters are already used (Date digizized, resolution, subjects), so either we add support for more custom parameters to {{Artwork}} -no clue how to-, or we limit to using only the "Local place" value. (Using the other_fields_1= parameter, which has not been used so far).

It appears that the Country/State values can be easily inferred from the local place value anyway. ~TheImaCow (talk) 10:03, 13 August 2024 (UTC)[reply]

@TheImaCow Unfortunately I won't be able to work on the Place/State/Country field. That would be upto someone else to take up on. If that is okay with you, let me know and I can conclude the templating and do a test run. -- DaxServer (talk) 07:42, 30 August 2024 (UTC)[reply]

Yes, than we'll just continue without it. ~TheImaCow (talk) 11:01, 30 August 2024 (UTC)[reply]

I did a test run. Can you check - Special:ListFiles/CuratorBot -- DaxServer (talk) 19:19, 1 September 2024 (UTC)[reply]

Looks good. There is this file, which is watermarked and noted "Contact A&M for usage rights.". Searching for files where there is something noted about "usage rights" only returns this file and this image, which should be excluded.

Otherwise, metadata, categories, etc. looks really good. ~TheImaCow (talk) 20:20, 1 September 2024 (UTC)[reply]

Is there something specific to add in SDC? Other bots seem to add quite some basic statements, so we don't have to do that ourselves. -- DaxServer (talk) 20:44, 1 September 2024 (UTC)[reply]

Are there any inferred categories that can be added during the upload? (asked at the bot request) -- DaxServer (talk) 07:36, 2 September 2024 (UTC)[reply]

Re SDC, I don't really know much about that. We could probably add something like "copyright public domain", "filetype image/jpg", or similar, but since there are already other bots doing that everywhere, I personally don't think its worth the effort.

Re categories, I replied there.

Re language template (at the bot request), the values we should place inside {{EN|1=XXXX}} are Description, Subjects, Notes, Title, Collection, Author, Publisher, Genre. ~TheImaCow (talk) 19:47, 2 September 2024 (UTC)[reply]

Subject InfoField	English: Strategy, Sustainability, Planning
Subject InfoField	English: Strategy; Sustainability; Planning
Subject InfoField	English: Strategy Sustainability Planning

Which format would you prefer for the subject? -- DaxServer (talk) 07:48, 3 September 2024 (UTC)[reply]

I'd say the second one (with ";") ~TheImaCow (talk) 00:13, 5 September 2024 (UTC)[reply]

Current state is very good, however, It looks like uploads of the "Photographs" category have stopped at around the letter "R" ([2]), a few thousand images are still missing there. ~TheImaCow (talk) 16:50, 12 October 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
DaxServer (talk · contribs)	In progress	CuratorBot (talk · contribs) Commons:Bots/Requests/CuratorBot (3) Task #3	Category:Images from USACE

Landesarchiv Baden-Württemberg

The Baden-Württemberg state archive (w:de:Landesarchiv Baden-Württemberg) provides access to over 25,000,000 digitized files documenting the history of the german state of Baden-Württemberg. The majority of the files are digitized/microfilmed documents, but the collection also includes plenty of images.

Source to upload from

https://www2.landesarchiv-bw.de/ofs21/home.php is the starting point of the archive. However, to ease navigation for this purpose, I made a list of the (in my opinion) contents most intresting for commons, mostly photos and maps:

Staatsarchiv Freiburg / State archive Freiburg

Series ID	Series Name	Link	Date	Digitized files	Notes on digitized files
A 25/1	Landgericht Freiburg	[3]	(1823-) 1879-1945 (-1963)	1,903	court files
A 47/1	Staatsanwaltschaft beim Sondergericht Freiburg	[4]	1933-45 (Nazi German)	103,265	court documents, generally nazi german specific crimes such as desertion
A 66/1	Direktorium des Dreisamkreises	[5]	1800-1830	238	digitized materials are largely architectural plans of buildings
B 685/1	Bezirksamt Achern	[6]		55
B 686/1	Bezirksamt Appenweier	[7]		8	plats
B 694/1	Bezirksamt Breisach	[8]		205
B 695/1	Landratsamt Donaueschingen	[9]		119
B 698/5	Landratsamt Emmendingen	[10]		1,302	plats, building plans
B 700/1	Bezirksamt Engen	[11]		32
B 701/1	Bezirksamt Ettenheim	[12]		59
B 702/1	Landratsamt Freiburg	[13]		1,213
B 704/1	Bezirksamt Gengenbach	[14]		49
B 711/1	Bezirksamt Jestetten	[15]		102	building plans
B 713/1	Landratsamt Kehl	[16]		1,191
B 714/1	Bezirksamt Kenzingen	[17]		181
B 715/1	Landratsamt Konstanz	[18]		5,067
B 717/2	Landratsamt Lahr	[19]		5,422
B 719/1	Landratsamt Lörrach	[20]		3,564
B 723/1	Bezirksamt Meßkirch	[21]		86
B 725/1	Landratsamt Müllheim	[22]		327
B 726/1	Landratsamt Neustadt	[23]		211
B 727/12	Bezirksamt Oberkirch	[24]		117
B 728/1	Landratsamt Offenburg	[25]		151
B 729/1	Bezirksamt Pfullendorf	[26]		2
B 730/1	Bezirksamt Radolfzell	[27]		248
B 731/1	Bezirksamt Rheinbischofsheim	[28]		18
B 733/1	Landratsamt Säckingen	[29]		3,759
B 734/1	Bezirksamt Salem	[30]		52
B 735/1	Bezirksamt St. Blasien	[31]		14
B 738/1	Bezirksamt Schönau	[32]		46
B 740/1	Bezirksamt Schopfheim	[33]		2,603
B 741/1	Bezirksamt Staufen	[34]		32
B 743/1	Landratsamt Stockach	[35]		16
B 744/1	Bezirksamt Stühlingen	[36]		10
B 747/1	Landratsamt Überlingen	[37]		185
B 748/1	Landratsamt Villingen	[38]		162
B 749/1	Bezirksamt Waldkirch	[39]		25
B 750/14	Landratsamt Waldshut	[40]	(1544 - 1804) 1805 - 1952 (1953 – 1989)	167
G 1000/1	Landwirtschaftsämter und landwirtschaftliche Schulungseinrichtungen Kreis Waldshut	[41]	1934-1977	3,667	largely images
K 345	Karten und Pläne aus Bezirks- und Landratsamtsbeständen	[42]	ca. 1810 - ca. 1968	1780
T 1 (Zugang 2005/0058)	Nachlass Allgeier, Sepp	[43]		5,087
T 1 (Zugang 2009/0034)	Nachlass Baudendistel, Peter	[44]		659
T 1 (Zugang 1975/0001)	Nachlass Blankenhorn, Erich	[45]		23,098
T 1 (Zugang 2015/0038)	Nachlass Bolanz, Julius Friedrich	[46]		100
T 1 (Zugang 2010/0034)	Nachlass Drischel, Dr. Friedrich	[47]		106
T 1 (Zugang 2008/0032)	Nachlass Eckert, Klaus	[48]		193
T 1 (Zugang 2014/0036)	Nachlass Herr, Konrad	[49]		123
T 1 (Zugang 2017/0006)	Nachlass Hoppe, Oskar I	[50]		117
T 1 (Zugang 1974/0013)	Nachlass Lorenz, Adolf	[51]		845
T 1 (Zugang 1999/0046)	Nachlass Middendorff, Wolf	[52]		276
T 1 (Zugang 1992/0347)	Nachlass Schinzinger, Fridolin	[53]		717
T 1 (Zugang 2003/0065)	Nachlass Tröndle, Wilhelm I	[54]		212
W 110/1	Plakatsammlung I	[55]		401
W 134 (Glaspl.)	Sammlung Willy Pragher I: Glasplattennegative	[56]	1926-92	4,933
W 134 (Filmnegative I)	Sammlung Willy Pragher I: Filmnegative I, Bildordner 3-307	[57]	1926-92	62,133
W 134 (Filmnegative II)	Sammlung Willy Pragher I: Filmnegative II, Bildordner 311-787	[58]	1926-92	43,209
W 134 (Filmnegative III)	Sammlung Willy Pragher I: Filmnegative III, Bildordner 804-1921	[59]	1926-92	39,172
W 134 (Mittelformatdiapositive)	Sammlung Willy Pragher I: Mittelformatdiapositive	[60]	1926-92	5,218
W 134 (Kleindiapositive)	Sammlung Willy Pragher I: Kleindiapositive	[61]	1926-92	1,641
W 140	Fotosammlung Marlis Decker	[62]	1976-2005	5,838
W 145/1	Bildsammlung v. Lamezan – Hundt	[63]	ca. 1801 - ca. 1953	147
W 145/2	Bildsammlung Karl Fritz	[64]	1890 – 1950	414
W 145/3	Bildsammlung Losch-Olson	[65]	1921-1943	302
W 145/4	Fotosammlung Georg Corcodel	[66]	1880 - ca. 1945	1048
W 145/5	Fotosammlung Norbert Armbruster	[67]	2012-2015	27
W 251	Kriegsbriefsammlung Badische Zeitung Freiburg	[68]	1914-1918	167
W 307	Sammlung Karl Fritz	[69]	1870-1982	6,245
				TOTAL 340,208

Generallandesarchiv Karlsruhe / General state archive Karlsruhe

Series ID	Series Name	Link	Date	Digitized files	Notes on digitized files
46	Haus- und Staatsarchiv: I. Personalia	[70]	1693	67	military plans
Hfk Planbände	Hausfideikommiss, Planbände 1-28	[71]	1600s	116	military plans
69	Baden, Sammlung 1995: Fotosammlung I	[72]	ca. 1850 - 1923 (-1951)	9,284	historic buildings
498-1	Glasnegative Wilhelm Kratt (1869-1949) - Landesamt für Denkmalpflege, Außenstelle Karlsruhe	[73]	1900-1935	8,926	historic buildings
498-2	Denkmalpflege - Sammlung Karl Weysser	[74]	1858-1902	1,011	historic buildings
57-3	Badisches Staatstheater Karlsruhe – Fotos	[75]	late 1800s - 2009	10,521
456 F 1	Armee-Oberkommando 7	[76]	World War I	45	military photos
456 F 2	Armee-Abteilung A	[77]	World War I	55	military photos
456 F 3	Armeeabteilung B	[78]	World War I	1,467	military photos
456 F 5	Generalkommando XIV. Armeekorps: Frieden und Abwicklung	[79]	World War I	35	military photos
456 F 6	Generalkommando XIV. Armeekorps: Krieg und Abwicklung	[80]	World War I	278	military photos
456 F 7	Generalkommando XIV. Reservekorps	[81]	World War I	282	military photos
456 F 8	Stellvertretendes Generalkommando XIV. Armeekorps	[82]	World War I	2	military photos
456 F 11	28. Infanterie-Division: Feld	[83]	World War I	112	military photos
456 F 12	29. Infanterie-Division: Feld	[84]	World War I	158	military photos
456 F 13	56. Infanterie-Division	[85]	World War I	218	military photos
456 F 14	200. Infanterie-Division	[86]	World War I	48	military photos
456 F 15	222. Infanterie-Division (später Oberbaustab 222)	[87]	World War I	51	military photos
456 F 16	28. Reserve-Division	[88]	World War I	24	military photos
456 F 17	75. Reserve-Division	[89]	World War I	30	military photos
456 F 18	8. Landwehr-Division	[90]	World War I	65	military photos
456 F 19	12. Landwehr-Division (später Ostsee-Division und Deutscher General in Finnland)	[91]	World War I	118	military photos
456 F 20	55. Infanterie-Brigade: Feld	[92]	World War I	50	military photos
456 F 22	56. Infanterie-Brigade: Feld	[93]	World War I	20	military photos
456 F 23	57. Infanterie-Brigade: Feld	[94]	World War I	102	military photos
456 F 24	58. Infanterie-Brigade: Feld	[95]	World War I	29	military photos
456 F 25	84. Infanterie-Brigade	[96]	World War I	12	military photos
456 F 26	112. und 513. Infanterie-Brigade	[97]	World War I	64	military photos
456 F 29	56. Reserve-Infanterie-Brigade	[98]	World War I	1	military photos
456 F 31	55. Landwehr-Infanterie-Brigade	[99]	World War I	84	military photos
456 F 32	56. Landwehr-Infanterie-Brigade	[100]	World War I	284	military photos
456 F 74	28. und 29. Kavallerie-Brigade Inspektion der Ersatz-Eskadronen des XIV. Armeekorps	[101]	World War I	5	military photos
456 F 82	Artillerie-Kommandeur 28	[102]	World War I	97	military photos
456 F 83	Artillerie-Kommandeur 29	[103]	World War I	3	military photos
456 F 84	Artillerie-Kommandeur 56	[104]	World War I	176	military photos
456 F 85	Artillerie-Kommandeur 75, 85, 126, 127, 142, 222	[105]	World War I	49	military photos
456 F 137	Oberleitung Grenzschutz Baden-Schweiz	[106]	World War I	30	military photos
456 F 34	Füsilier-Regiment 40	[107]	World War I	76	military photos
456 F 35	Leib-Grenadier-Regiment 109	[108]	World War I	225	military photos
456 F 36	Grenadier-Regiment 110	[109]	World War I	117	military photos
456 F 37	Infanterie-Regiment 111	[110]	World War I	69	military photos
456 F 38	Infanterie-Regiment 112	[111]	World War I	77	military photos
456 F 39	Infanterie-Regiment 113	[112]	World War I	88	military photos
456 F 40	Infanterie-Regiment 114	[113]	World War I	45	military photos
456 F 41	Infanterie-Regiment 142	[114]	World War I	7	military photos
456 F 42	Infanterie-Regiment 169	[115]	World War I	27	military photos
456 F 43	Infanterie-Regiment 170	[116]	World War I	69	military photos
456 F 44	Infanterie-Regiment 185	[117]	World War I	40	military photos
456 F 46	Infanterie-Regiment 469	[118]	World War I	25	military photos
456 F 47	Infanterie-Regiment 470	[119]	World War I	19	military photos
456 F 49	Ersatz-Infanterie-Regiment 28	[120]	World War I	5	military photos
456 F 50	Ersatz-Infanterie-Regiment 29	[121]	World War I	20	military photos
456 F 51	Reserve-Infanterie-Regiment 40	[122]	World War I	75	military photos
456 F 53	Reserve-Infanterie-Regiment 109	[123]	World War I	117	military photos
456 F 54	Reserve-Infanterie-Regiment 110	[124]	World War I	147	military photos
456 F 55	Reserve-Infanterie-Regiment 111	[125]	World War I	255	military photos
456 F 56	Reserve-Infanterie-Regiment 249	[126]	World War I	8	military photos
456 F 59	Landwehr-Infanterie-Regiment 109	[127]	World War I	1	military photos
456 F 63	Landsturm-Infanterie-Bataillone des XIV. Armeekorps	[128]	World War I	20	military photos
456 F 64	Sturmbataillon 7	[129]	World War I	43	military photos
456 F 65	Sturmbataillon 16	[130]	World War I	49	military photos
456 F 66	Feldrekruten-Depot der 28. und 29. Infanterie-Division	[131]	World War I	30	military photos
456 F 70	Feldrekruten-Depot der 8. Landwehr-Division	[132]	World War I	1	military photos
456 F 72	Maschinengewehr-Scharfschützen-Abteilungen, Maschinengewehr-Instandsetzungs-Werkstätten, Maschinengewehr-Schulen	[133]	World War I	8	military photos
456 F 81	Jäger-Regimente 2 und 3, Gebirgsersatz-Abteilung des II. Bataillons des Jäger-Regiments 3, Jäger-Bataillon Stephanus	[134]	World War I	71	military photos
456 F 77	Dragoner-Regiment 22	[135]	World War I	2	military photos
456 F 86	Feldartillerie-Regiment 14	[136]	World War I	118	military photos
456 F 87	Feldartillerie-Regiment 30	[137]	World War I	5	military photos
456 F 88	Feldartillerie-Regiment 50	[138]	World War I	224	military photos
456 F 89	Feldartillerie-Regiment 66	[139]	World War I	272	military photos
456 F 90	Feldartillerie-Regiment 76	[140]	World War I	19	military photos
456 F 91	Feldartillerie-Regimenter 104, 259, 261, 270 Feldartillerie-Abteilungen 298, 1003, 1007 Feldartillerie-Batterien 854, 855, 856, 859, 964 Infanterie-Geschütz-Batterien 3 und 9 Nahkampf-Batterie 220	[141]	World War I	20	military photos
456 F 92	Gebirgsartillerie-Abteilung 5 (mit Gebirgs-Batterien 14, 15, 16), Gebirgsartillerie-Abteilung 6 (mit Gebirgs-Batterien 1, 2, 17), Ersatz-Abteilung der Gebirgsartillerie-Abteilungen 5 und 6, Gebirgskanonen-Batterie "Rennecke"	[142]	World War I	5	military photos
456 F 93	Reserve-Feldartillerie-Regimenter 28, 29, 52, 55	[143]	World War I	8	military photos
456 F 94	Landwehr-Feldartillerie-Regimenter 4, 12, 15	[144]	World War I	225	military photos
456 F 96	Fußartillerie-Regiments-Stäbe 119 und 224	[145]	World War I	1	military photos
456 F 97	Fußartillerie-Regiment 14	[146]	World War I	19	military photos
456 F 98	Fußartillerie-Regiment 24	[147]	World War I	5	military photos
456 F 99	Fußartillerie-Bataillone 33, 61, 68, 98, 151	[148]	World War I	7	military photos
456 F 101	Licht- und Schallmeßtrupps	[149]	World War I	589	military photos
456 F 102	Reserve-Fußartillerie-Regiment 14	[150]	World War I	138	military photos
456 F 103	Reserve-Fußartillerie-Regiment 24	[151]	World War I	34	military photos
456 F 104	Landwehr-Fußartillerie-Bataillone 14, 50, 59, Landsturm-Fußartillerie-Bataillon XIV. A. K.	[152]	World War I	5	military photos
456 F 105	Stabsoffizier der Pioniere Nr. 52, Pionier-Bataillone und Pionier-Bataillonsstäbe, Pionier-Kompanien, Reserve-, Landwehr- und Landsturm-Pionier-Kompanien, Brückentrains des XIV. Armeekorps, selbständige Pionierformationen	[153]	World War I	600	military photos
456 F 106	Minenwerfer-Bataillone, Minenwerfer-Kompanien, selbständige Minenwerferformationen	[154]	World War I	293	military photos
456 F 108	Nachrichtentruppen	[155]	World War I	104	military photos
456 F 119	Kraftfahr-Formationen	[156]	World War I	70	military photos
456 F 110	Train-Formationen des XIV. Armeekorps und Militärische Prüfungsstelle Pforzheim	[157]	World War I	28	military photos
456 F 114	Feldlazarette, Reserve-Feldlazarette und Landwehr-Feldlazarette	[158]	World War I	16	military photos
456 F 115	Sanitäts-Kompanien	[159]	World War I	8	military photos
456 F 117	Krankentransport-Abteilungen, Sanitätsdepots, Lazarettzüge, Krankensichtungsstellen, Krankensammelstellen	[160]	World War I	3	military photos
456 F 118	Durchgangslager-, Garnisons-, Kriegsgefangenen-, Offiziers-, Reserve- und Vereinslazarette	[161]	World War I	15	military photos
456 F 112	Pferdedepots, -lazarette und -sammelstellen	[162]	World War I	17	military photos
456 F 109	Mobile Etappen-Kommandanturen	[163]	World War I	37	military photos
456 F 123	Armierungsformationen und Etappen-Hilfsbataillone	[164]	World War I	131	military photos
456 F 146	Etappen-Inspektion 7	[165]	World War I	9	military photos
456 F 147	Etappen-Inspektion 19	[166]	World War I	35	military photos
456 F 148	Etappen-Inspektion 28	[167]	World War I	81	military photos
456 F 113	Sanitätsamt	[168]	World War I	85	military photos
456 F 124	Proviantämter/-depots, Durchgangslager, Verpflegungsanstalten, Schlächtereien, Kriegsgefangenenstellen, Material- und Veterinärdepots	[169]	World War I	3	military photos
456 F 128	Garnisonkommandos	[170]	World War I	10	military photos
456 F 129	Garnisonverwaltungen und Militärbauamt Rastatt	[171]	World War I	3	military photos
456 F 150	Intendantur des Generalkommandos und des Stellvertretenden Generalkommandos sowie die Abwicklungsintendantur des XIV. Armeekorps	[172]	World War I	6	military photos
456 F 151	Feldintendanturen	[173]	World War I	6	military photos
456 F 143	Kriegstagebücher: Artillerie-, Pionier-, Nachrichten-, Train-, Armierungs-, Sanitäts- und Etappenformationen	[174]	World War I	69	military photos
456 G 2	Fotosammlung der Offiziere des XIV. Armeekorps	[175]	World War I	341	military photos
421 K 1	Eisenbahndirektion/Bundesbahndirektion Karlsruhe: Planrollen	[176]	1832-1993	554	railroad doumentation
J-B	Ansichten von Orten und Landschaften	[177]	1400s-2011	2,988
J-D	Geschichtliche Ereignisse	[178]		171
J-E	Belagerungen und Schlachten	[179]		123
J-G	Revolution 1848-1849 in Baden und in der Pfalz	[180]		108
J-S Karikaturen	Karikaturen zum Vormärz und zur Revolution 1848 - 1849	[181]		265
J-H	Karikaturen	[182]		18
J-L	Mode, Trachten und Berufsgruppen	[183]		200
F-S Neuwirth	Fotografien von Abgeordneten des badischen Landtags	[184]		41
F-S Paulcke	Fotosammlung des Prof. Wilhelm Paulcke (1873-1949)	[185]	1869-1950	476
F-S Wochenschau	Propagandafotos aus dem Ersten Weltkrieg	[186]	1917-1918	255
F-S Schmeiser	Fotografien von Abgeordneten des badischen Landtags	[187]	1908 - 1933	120
J-S Velten	Sammlung Velten: Künstlerpostkarten	[188]	1897-1901	88	postcards
G Technische Pläne I	Technische Pläne I (allgemein), eine Auswahlpräsentation	[189]		40
G Technische Pläne III	Technische Pläne III: Patentschriften	[190]		112
H	Gemarkungspläne	[191]	1500s-1900s	9248	maps
H-1	Gemarkungspläne 1:10000 (farbig)	[192]	1857-1935	1532	maps
O	Neuere Flugblätter	[193]		77
P	Plakate	[194]		15
S Thomas Kellner	Sammlung des Buchhändlers und Antiquars Thomas Kellner	[195]	1494 – 2003	1,016	Photo & map collection
				TOTAL 56,766

Grundbuchzentralarchiv Kornwestheim / Plat book central archive Kornwestheim
238,424 digized files online, only a tiny fraction of the entire archive. Questionable if in commons scope.

Staatsarchiv Ludwigsburg / State Archive Ludwigsburg

Series ID	Series Name	Link	Date	Digitized files	Notes on digitized files
E 18 III	Hof-/Landes-/Staatstheater Stuttgart: Fotos und Graphiken	[196]	1800-ca. 1960	6,995
EL 18 IV	Staatsarchiv Ludwigsburg: Fotosammlung	[197]	1989	1,028
EL 20/4 III a	Regierungspräsidium Stuttgart: Glasplatten zu Straßen- und Wasserbau (bis 9x12 cm)	[198]		430
EL 20/4 III b	Regierungspräsidium Stuttgart: Glasplatten zu Straßen- und Wasserbau (13x18 cm)	[199]		395
EL 20/4 III c	Regierungspräsidium Stuttgart: Glasplatten zu Straßen- und Wasserbau (18x24 cm)	[200]		103
EL 20/4 III d	Regierungspräsidium Stuttgart: Glasplatten zu Straßen- und Wasserbau (24x30 cm)	[201]		5
EL 68 VI	Landesvermessungsamt Baden-Württemberg: Flurkarten der Württemberg. und Hohenz. Landesvermessung (Digitalisate)	[202]	1818-1863	16,672
EL 68 IX	Landesvermessungsamt Baden-Württemberg: Landesbefliegung Baden-Württemberg 1968 - Luftbilder und digitales Orthophoto	[203]	1968	19,568	statewide aerial photography
EL 75 I	Landesamt für Straßenwesen Baden-Württemberg	[204]	1934-2002	1,726
EL 75 VI a	Landesamt für Straßenwesen: Bildersammlung zum Autobahnbau (Papierabzüge auf Karteikarten)	[205]	1930-2000	7,499
EL 75 VI b	Landesamt für Straßenwesen: Bildersammlung zum Autobahnbau (Historische Glasplatten)	[206]		285
EL 75 VI c	Landesamt für Straßenwesen: Bildersammlung zum Autobahnbau (Kleinbild-Dias I)	[207]		676
EL 75 VI d	Landesamt für Straßenwesen: Bildersammlung zum Autobahnbau (Kleinbild-Dias II)	[208]	1982-1999	3,854
EL 221/8	Württembergisches Staatstheater Stuttgart: Dekorationsmappen	[209]	1954-1989	1633
EL 228 a I	Landesdenkmalamt Baden-Württemberg: Fotosammlung, Glasplatten Mittelformate	[210]		4,910
EL 228 a II	Landesdenkmalamt Baden-Württemberg: Fotosammlung, Glasplatten Großformate	[211]		1,193
EL 228 a III	Landesdenkmalamt Baden-Württemberg: Fotosammlung, Glasplatten Klein- und Mittelformate	[212]		4,294
EL 228 a IV	Landesdenkmalamt Baden-Württemberg: Fotosammlung, Glasplatten Kleinformate	[213]		2,199
F 234 VI	Staatliche Heilanstalt Weinsberg: Fotosammlung Kemmler	[214]	1903-1915	1,032
FL 10/5	Polizeidirektion Heidenheim	[215]	World War II	44	flyers dropped by allies encouraging resistence againest Nazi Germany
FL 45/1	Wasserwirtschaftsamt Besigheim	[216]	1751-1995	1,341
FL 410/8 II	Staatliches Hochbauamt Stuttgart I: Pläne	[217]	1896-1992	4,242	building plans
FL 410/12	Staatliches Hochbauamt Ulm II (Bund) – Pläne	[218]	1801-1998	136
FL 420/1	Wilhelma, Zoologisch-botanischer Garten Stuttgart	[219]	1927-2016	2,362	animal photos
K 412 IV	Reichs-/Bundesbahndirektion Stuttgart: Hochbaupläne	[220]	ca. 1840 - ca. 1990	13,421
K 414 I	Reichs-/Bundesbahndirektion Stuttgart: Fotografien	[221]	ca. 1930-1990	6,954
K 414 II	Reichs-/Bundesbahndirektion Stuttgart: Fotografien	[222]	Ca. 1900-ca. 1950	536
K 415	Reichs-/Bundesbahnbetriebsamt Aalen	[223]	1859-1977	479
K 422 II a	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Kleinbilddias	[224]	Ca. 1960-ca. 1974	1,224
K 422 II b	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Glasplatten 6x9	[225]	1935-1952	120
K 422 II c	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Glasplatten 8,5x10	[226]	1928-1957	185
K 422 II d	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Glasplatten 9x12	[227]		33
K 422 II e	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Glasplatten 13x18	[228]	1,937	4
K 422 II f	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Glasplatten 18x24	[229]		33
K 422 II g	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Glasplatten 24x30, 30x40	[230]	1930s	238
K 422 II i	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Negative 11,5x16,5	[231]	1930-1950	3
K 422 II j	Wasser- und Schifffahrtsdirektion Stuttgart: Fotosammlung, Negative 17x23	[232]		10
K 423	Wasser- und Schiffahrtsamt Heilbronn	[233]		1,095
{PL 4/81}	Unifranck Lebensmittelwerke GmbH Ludwigsburg, Werbemittelarchiv	[234]		103
PL 723	Nachlass Hans Noller: Sammlung zum Eisenbahnwesen in Württemberg	[235]	1844-2011	15,381
PL 734	Fotosammlung Harald Knauer zum Eisenbahnwesen in Württemberg	[236]	1872-1889, 1905-2022	8,825
				TOTAL 131,266

Hohenlohe-Zentralarchiv Neuenstein / Hohenlohe-Central Archive Neuenstein

Series ID	Series Name	Link	Date	Digitized files	Notes on digitized files
GA 100	Handgezeichnete Karten	[237]		865
GA 105	Gedruckte Karten	[238]		30
GA 115	Plansammlungen	[239]		150	building plans
KrA SF 2/1	Fotosammlung: Ehemals eigenständige Gemeinden Adolzfurt – Lassbach	[240]	1860 – 2011	3,983
KrA SF 2/2	Fotosammlung: Ehemals eigenständige Gemeinden Mangoldsall – Zweiflingen	[241]	1860 - 2011	3,691
				TOTAL 8,719

Staatsarchiv Sigmaringen / Sigmaringen State Archive

Series ID	Series Name	Link	Date	Digitized files	Notes on digitized files
FAS H 1/1 T 1	Nachlass Albert Waldenspul: Glasplatten, Diapositive und Alben	[242]		831
FAS H 1/1 T 3	Nachlass Albert Waldenspul: Fotosammlung	[243]		422
FAS Sa	Sammlungen und Nachlässe	[244]	1918-1928	1,188
FAS Sa A 7 T 1	Nachlass Xaver Henselmann	[245]	1899-1918, 1932-1933	203
FAS K	Karten	[246]		199
FAS PA	Pläne, Karten, Zeichnungen	[247]		95
FAS PA St	Stammtafeln	[248]	16.- 20. Jh.	42
Wü 160 T 5	Forstdirektion Tübingen: Luftbilddokumentation	[249]	1952-2008	13,639
Dep. 44 T 2	Sammlung Botho Walldorf, Fotograf, Heimatkundler (geb. 1945)	[250]	1717-2007	1,041
N 1/68	Fotoatelier Kugler in Sigmaringen: Glasplattennegative	[251]	1916-1938	2,215
N 1/78 T 1	Nachlass Robert Arnaud, Kaufmann (1885-1945)	[252]	1871-2005	458	postcards
N 1/85 T 1	Nachlass Heinz Braun, Techniker (geb. 1927)	[253]		181
N 1/89 T 1	Nachlass Werner Rees (geb. 1947)	[254]		85
N 1/96 T 1	Luftbildarchiv Erich Merkler (geb. 1947): Negative	[255]		1,407
N 1/96 T 2	Luftbildarchiv Erich Merkler (geb. 1947): Diapositive	[256]		35
K	Karten und Pläne	[257]		236
				TOTAL 22,277

Hauptstaatsarchiv Stuttgart / Main State Archive Stuttgart

Series ID	Series Name	Link	Date	Digitized files
H 107	Kieser-Ortsansichten	[258]	1680-ca. 1690	974
J 151	Sammlung von Maueranschlägen	[259]	1900-1948	2,736
J 312	Fotoarchiv Blumenthal / von Schoenebeck, Bad Wildbad	[260]	ca. 1896-1965	4,367
J 170	Berichte von Gemeinden über die Kriegsereignisse 1945 und das Ausmaß der Zerstörungen im Zweiten Weltkrieg	[261]	1948-1952, 1955, 1960-1962	9,799
J 311	Luftbildpläne der Jahre 1933-1936	[262]	1933-1936	57
J 313	Landschafts-, Kunst-, Weltkriegs- und Architekturfotografien	[263]	ca. 1910 und ca. 1935	372
J 314	Aufnahmen aus Südwestdeutschland	[264]	1935 - 1939	106
J 317	Fotografien aus dem Nachlass von Christoph Siegfried Langbein (1880-1921)	[265]	1895 - 1905	111
J 319	Fotografien aus dem Nachlass von Edmund Müller, Grafiker und Fotograf, *1900 +1945	[266]		1,434
J 320	Sammlung von Fotographien "Aus dem Leben der Herzogin Wera (*1854, +1912)"	[267]		723
M 700/1	Sammlung von Ortsfotografien	[268]	World War I	1,059
M 700/2	Sammlung von Landschaftsfotografien	[269]	World War I	177
M 700/3	Sammlung von Karten und Planfotografien	[270]	World War I	88
M 700/4	Sammlung von Truppenfotografien	[271]	World War I	132
M 700/5	Sammlung von Personenfotografien	[272]	World War I	65
M 703	Militärhistorische Bildersammlung	[273]	1800s	5,381
M 707	Bildersammlung I	[274]	World War I	7,736
M 708	Bildersammlung II	[275]	World War I	10,657
M 709	Bildersammlung III	[276]	World War I	6,729
N 1	Land- und Flurkarten betreffend Altwürttemberg	[277]		150
N 3	Forstkarten betreffend Altwürttemberg	[278]		766
N 5	Karten des Herzoglich Württembergischen Corps des Guides	[279]		56
N 7	Nachlass Johann Majer, Pfarrer und Kartograph	[280]		123
N 11	Land- und Flurkarten betreffend Neuwürttemberg	[281]		399
N 13	Forstkarten betreffend Neuwürttemberg	[282]		19
N 26	Karten des Benediktinerklosters Ochsenhausen und des Fürstentums Ochsenhausen der Fürsten von Metternich-Winneburg	[283]		81
N 28	Karten des Prämonstratenserklosters Rot an der Rot und der Herrschaften Wartenberg-Rot und Erbach-Wartenberg-Rot	[284]		16
N 30	Karten des Prämonstratenserklosters Schussenried	[285]		158
N 34	Karten des Benediktinerklosters Weingarten	[286]		127
N 36	Karten und Pläne des Prämonstratenserklosters Weißenau	[287]		37
N 40	Karten des Benediktinerklosters Zwiefalten	[288]		115
N 60	Land- und Flurkarten betreffend Gebiete des Kurfürstentums bzw. Königreichs Württemberg	[289]		109
N 70	Forstkarten betreffend Gebiete des Königreiches Württemberg	[290]		550
N 100	Ältere gedruckte Karten	[291]		2,676
N 200	Pläne und Zeichnungen betreffend Altwürttemberg aus der Zeit bis 1806	[292]		1,238
N 201	Pläne und Zeichnungen betreffend Neuwürttemberg und Gebiete außerhalb von Württemberg aus der Zeit bis 1806	[293]		45
N 205	Pläne und Zeichnungen betreffend Württemberg ab 1806 und Württemberg-Baden bzw. Baden-Württemberg ab 1945	[294]		38
				TOTAL 82,681

Staatsarchiv Wertheim / Wertheim State Archive

Series ID	Series Name	Link	Date	Digitized files
S-N 70	Fotosammlung Wehnert	[295]	1882-2013	3,741
K-LRA 91	Kreisbildstelle / Kreismedienzentrum Main-Tauber-Kreis	[296]	1935-2004	3,095
				6,836

TOTAL 649,253

License

All content is either public domain (age/created by government), and if not, everything is licenced under CC BY 3.0. (see terms of use: Some of the digitized archive records in the finding aid system of the Landesarchiv Baden-Württemberg are still subject to copyright protection, some are exempt from copyright protection as official works, and some are in the public domain. Where they are protected by copyright, the State Archives hold the corresponding exploitation rights and grant a Creative Commons CC-BY license.)
The source "Landesarchiv Baden-Württemberg" and the archive reference ID of the image needs to be mentioned when using the image - this information is also always present on the image when downloading fron the website.

Description

Do the media URLs follow a pattern?

Yes, from what I can see: Each image can be viewed in an image viewer. The image viewer can be accessed from the archive series page (example)
Those image viewer links look like this: https://www2.landesarchiv-bw.de/ofs21/bild_zoom/zoom.php?bestand=SERIES_ID&id=IMAGE_ID&gewaehlteSeite=FILE_NAME (for example https://www2.landesarchiv-bw.de/ofs21/bild_zoom/zoom.php?bestand=21715&id=2836819&gewaehlteSeite=02_0001128649_0001_2-1128649-1.png)
The image file can be downloaded by using https://www2.landesarchiv-bw.de/ofs21/bild_zoom/download.php?&id=IMAGE_ID&bilddatei=FILE_NAME (example https://www2.landesarchiv-bw.de/ofs21/bild_zoom/download.php?&id=2836819&bilddatei=02_0001128649_0001_2-1128649-1.png)

Does the site have an API?

No.

Is there a template that could be used on the file description pages, or should one be created?

Since there is a huge amount of files, creating one might by a good idea.

TheImaCow (talk) 09:28, 19 May 2024 (UTC)[reply]

I started scraping and it is going well. Do you know of files that fall under CC BY 3.0 ? -- DaxServer (talk) 07:55, 13 August 2024 (UTC)[reply]

All 25,700,000 digitized objects do, per the terms of use.

There is already {{Landesarchiv-bw-image}}, which references

- the archive location (collapsed sections above, see the "Table of identification codes" on the template documentation)

- the archive signature of the object (such as "W 145/4 Nr. 0091")

- the permanent link ID to the digitized object such as "5-790216-1" (http://www.landesarchiv-bw.de/plink/?f=5-790216-1)

This can be used in an information template "accession_number" field.

As for the |source= field, there is clear guidance: We cite the archive signature, if known the author and the permalink, such as

"Landesarchiv Baden-Württemberg, Staatsarchiv Freiburg W 145/4 Nr. 0091 / Fotograf: Leopold Adler"

Example without known/specified author:

"Landesarchiv Baden-Württemberg, Staatsarchiv Ludwigsburg K 414 I Nr 273"

As for the licencing template, I think we should use a custom version of the {{Cc-by-3.0}} template, I created {{LABW}} (for parameter 1, use the same value as for "source" described above)

What data exactly is there to scrape? Is there a way to tell if an object is a photograph or something else (map, drawing, text)? (to decide weather to use {{Photograph}}) ~TheImaCow (talk) 11:03, 13 August 2024 (UTC)[reply]

I'm scraping the metadata and the images associated with an object. An object can contain one or more [digitalized] images (of course, only those Findbücher that are digitalized).

The metadata can be accessed by clicking the Details in the Strukturansicht view. URL format - https://www2.landesarchiv-bw.de/ofs21/olf/druckansicht.php?id_titlaufn=IMAGE_ID&bestand=SERIES_ID In the example above from permalink http://www.landesarchiv-bw.de/plink/?f=5-790216-1 - click on the Details, it will open a small popup with info. The direct URL would be https://www2.landesarchiv-bw.de/ofs21/olf/druckansicht.php?id_titlaufn=3092053&bestand=23318
- From this view, this particular object has good metadata to determine what it is, for example:
  - Art der Infomation = Bild
  - Art der Vorlage = Glasplatte
  - Format = 16 x 11 cm
- Not all objects have such good info.
The info on the images can be obtained from the thumbnail view - that's the magnifying glass icon in the Strukturansicht view. URL format - https://www2.landesarchiv-bw.de/ofs21/bild_zoom/thumbnails.php?bestand=SERIES_ID&id=IMAGE_ID An example would be
- https://www2.landesarchiv-bw.de/ofs21/bild_zoom/thumbnails.php?bestand=7564&id=7278223 for multiple images all of which have the same Bestellsignatur
- https://www2.landesarchiv-bw.de/ofs21/bild_zoom/thumbnails.php?bestand=23318&id=3092053 for single image

I haven't yet explored the data that I've gathered so far, so I've limited info atm. -- DaxServer (talk) 12:13, 13 August 2024 (UTC)[reply]

Things I noted regarding file names - some images have no title (eg "Keine Angabe"), then we should only use the archive ID (PL 723 DK 94-2), and if the images have title, we should probably start the file name with the archive ID, because they are sometimes sorted by date or relation etc, and this preserves the original file order in the categories, e.g. "PL 723 DK 55-20 - Bf. Kupfer", where the "20" is a running number.

I removed some series from the above table...

- T 1 (Zugang 2008/0013)/SERIES ID 22257 only text documents

- T 1 (Zugang 2008/0032)/22664 only low quality scans of personal photos, probably out of scope

- T 1 (Zugang 1983/0018-01)/10461 only text documents

- J 153/political party advertising often recently collected by the archive, unlikely if actually free

- Q 2/50 - 17,000 negative photo bags containing ~520,000 individual photographs, unfortunaly low scan quality, so extracted images are of very poor quality

All objects seem to have their "category tree" as part of the description, such as

Staatsarchiv Ludwigsburg

\/

Deposita, nichtstaatliche Archive und Nachlässe / 1335-1997

\/

Nachlässe (ohne Deposita)

\/

PL 723 Nachlass Hans Noller: Sammlung zum Eisenbahnwesen in Württemberg / Ca. 1844-2011

\/

2. KB-Dias

\/

Stuttgart - Horb ("Gäubahn")

\/

Diakasten 18: Stuttgart - Horb I

at this object: http://www.landesarchiv-bw.de/plink/?f=2-5340602-1

I think it would be a good idea to automatically copy these category trees as commons categories, as this would make the manual categorisation needed for the images significantly easier. Such a category tree has already been created for the US National Archives upload, see Category:US National Archives Record Groups. ~TheImaCow (talk) 12:42, 22 August 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

2023 Army-Navy Games photos from Naval Academy flickr album

Source to upload from

https://www.flickr.com/photos/west_point/albums/72177720313329606/

License

Description

All of these photographs are of the 2023 Army-Navy Game. Therefore, they should be uploaded and placed in Category:2023 Army-Navy Game

While all images are Army photographs taken by Army photographers in the course of their duty (and therefore are available for use under the license {{PD-USGov-Army}}), the Army did not list them under a public domain license when they uploaded them to Flickr, making it impossible for me to upload them on the Flickr2commons tool or at the UploadWizzard. Furthermore, the album has more than 500 photographs, making it impossible to download the album in its entirety as a zip file (Flickr only allows you to do that in albums that are at max 500 photographs).

If there was a tool made available to me that would allow uploading similar to Flickr2commons in instances when PD images were added to flickr without being identified as PD that'd be greatly appreciated. Otherwise, if someone with access to a tool of that kind can do that for me, that'd also be great.

These images capture a notable annual sporting event, feature notable military figures in attendance, capture many traditions of American football, and also capture notable collegiate football players. Therefore, they would be valuable to the project. Once these are uploaded I plan to sift through them and identify individuals of note, and sub-categorize some of the images as well. So please let me know when this task is accomplished.

The file description would ideally copy what each file is captioned as on Flickr. If not, something like "2024 Army-Navy Game" might suffice. Ideally, each photo would be linked under "source" to the photo's individual Flickr hyperlink. If not, they could simply be linked to the album's hyperlink.

Ideally, each photograph should have a title that identifies which number photo it is on Flickr.

SecretName101 (talk) 07:37, 7 May 2024 (UTC)[reply]

Similarly, I'd like to have the files on https://www.flickr.com/photos/west_point/albums/72177720312669739/with/53329769305 (same circumstances, Army-West Point flickr account album with more than 500 photos taken by Army photographers) be uploaded and placed in the category Category:Holy Cross Crusaders at Army Black Knights football (November 11, 2023) SecretName101 (talk) 21:03, 7 May 2024 (UTC)[reply]

Similarly:

Files in https://www.flickr.com/photos/west_point/albums/72177720304371730/ should be uploaded to Category:2022 Army-Navy Game
Files in https://www.flickr.com/photos/west_point/albums/72177720303889429/ should be uploaded to Category:Connecticut Huskies at Army Black Knights football (November 19, 2022)
Files in https://www.flickr.com/photos/west_point/albums/72177720311110044/ should uploaded to Category:Delaware State Hornets at Army Black Knights football (September 9, 2023)
Files in https://www.flickr.com/photos/west_point/albums/72177720302193078/ should be uploaded to Category:Villanova Wildcats at Army Black Knights football (September 17, 2022)
Files in https://www.flickr.com/photos/west_point/albums/72177720302947119/ should be uploaded to Category:Colgate Raiders at Army Black Knights football (October 15, 2022)
Files in https://www.flickr.com/photos/west_point/albums/72177720302041772/ should be uploaded to Category:UTSA Roadrunners at Army Black Knights football (September 10, 2022)

Also, Flickr is having trouble zipping the album https://www.flickr.com/photos/west_point/albums/72177720313311296/ , the contents of which should be uploaded and added to Category:2023 Army–Navy Gala SecretName101 (talk) 21:08, 17 May 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Deepin icons

Deepin's icons.

Source to upload from

https://github.com/linuxdeepin/deepin-icon-theme

License

This work is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3 of the License, or any later version. This work is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See version 3 of the GNU General Public License for more details.

Description

Do the media URLs follow a pattern?

Sure do. Example: https://github.com/linuxdeepin/deepin-icon-theme/blob/master/Sea/apps/scalable/accessories-text-editor.svg

Does the site have an API?

Yes.

What else could ease uploading?

Not sure.

Did you contact the site owner?

Nope.

Is there a template that could be used on the file description pages, or should one be created?

User:Psiĥedelisto/Deepin icons

Psiĥedelisto (talk • contribs) ^{please always ping!} 18:51, 3 July 2023 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
	Half done		Category:Deepin Icon Theme

@Psiĥedelisto: Hi! I looked deeper into this and uploaded the majority of the icons. Unfortunately, some icons are covered by copyright, as they are derivative work. --PantheraLeo1359531 😺 (talk) 19:17, 20 January 2024 (UTC)[reply]

IBM Research on YouTube

Source to upload from

https://www.youtube.com/@ibmresearch/videos

License

Virtually all uploads to the IBM Research channel are licensed under the Creative Commons Attribution 3.0 Unported license, per the License tag in the description of each video.

Description

784 videos (and counting, as of the time I'm writing this) of pure IBM and technology-related gold. Lots of great photography and headshots to extract from these. Some of the content therein may contain non-free elements over the de minimis threshold, but from what I've watched so far those are few and far in between. Would be trivial to download all videos using youtube-dl; reencoding each video to fit within the 100 MB upload limit is a different story however.

DigitalIceAge (talk) 04:52, 17 November 2023 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
			Videos by IBM Research

Try to upload files from time to time --PantheraLeo1359531 😺 (talk) 18:16, 22 August 2024 (UTC)[reply]

Newspapers by Feureau in Internet Archive

Source to upload from

https://archive.org/details/%40feureau

License

Public domain

Description

This user has uploaded more than a thousand old newspapers from Indonesia, as well as Dutch magazine Tong Tong.

Please help importing the newspaper here, they would make a great addition to Category:Newspapers_of_Indonesia

Bennylin (yes?) 18:29, 18 February 2023 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Babad Diponegoro from Internet Archive

Source to upload from

Volume 1 https://archive.org/details/eap-1268-babad-diponegoro-v-1-0001/Babad%20Diponegoro%20Jilid%201/EAP1268_Babad_Diponegoro_V1_0006.jpg 15+ GB of jpgs
Volume 2 https://archive.org/details/eap-1268-babad-diponegoro-v-2-0001 ? GB of jpgs

License

Public Domain

Description

The IA didn't provide DJVU nor PDF format, only zipped JPGS (1429 files and 1303 files) Bennylin (yes?) 06:57, 15 February 2023 (UTC)[reply]

I found another file, same, no pdf/djvu

https://archive.org/details/supratman

Opinions

Assigned to	Progress	Bot name	Category

VK Icons

Source to upload from https://github.com/VKCOM/icons/tree/master/src/svg

License MIT

Description Examples of uploaded files Category:VK Icons

Артём 13327 (talk) 17:44, 21 October 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

fluentui-emoji

Source to upload from https://github.com/microsoft/fluentui-emoji/tree/main/assets

License MIT

Description

Артём 13327 (talk) 17:41, 21 October 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

CC0 ant micro-CTs

X-ray microtomograms of ants

Source to upload from

Dryad Subject Area: cybertype - Blacklight Search Results

License

Creative Commons CC0 License (Q6938433) (rationale)

Description

what makes it valuable to Wikimedia Commons?
- expands the selection of biology-focussed STLs in the repository.
Do the media URLs follow a pattern?
- https://datadryad.org/stash/downloads/file_stream/[five-digit number]
Does the site have an API?
- According to https://datadryad.org/stash/our_platform#architecture-and-implementation ...maybe?
What else could ease uploading?
- If the files are categorised per 'stash' (dataset/DOI) with informational text or template, then a subset of files can be uploaded with the others easy to find and also upload later, similarly to the PLoS import
- See also https://antwiki.org/wiki/index.php?title=Special:CargoQuery&limit=500&offset=100&tables=Economolab3D&fields=_pageName%3DPage%2CName%3DName%2CGenus%3DGenus%2CCaste%3DCaste%2CView%3DView%2CLink%3DLink%2CSpecimenIdentifier%3DSpecimenIdentifier%2CInstitution%3DInstitution%2CNotes%3DNotes&max+display+chars=300, derived from the source above
Did you contact the site owner?
- no, not needed for CC0 license
Is there a template that could be used on the file description pages, or should one be created?
- {{Sketchfab}}

Arlo James Barnes 23:01, 10 June 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Denkmalatlas Niedersachsen

Images of cultural monuments in Lower-Saxony, Germany.

Source to upload from

https://denkmalatlas.niedersachsen.de/viewer/

License

CC BY-SA 4.0

Description

Images of cultural monuments in Lower-Saxony, Germany, from the "Denkmalatlas Niedersachsen" project of the Lower Saxony State Office for Heritage Conservation. The project offers exterior shots of the monuments. Photos shot from public space are permitted in accordance with the freedom of panorama in Germany. For published photos shot on private property, the State Office has the consent of the property owner. In the "Denkmalatlas", all photos are published with the license CC BY-SA 4.0.

Timk70 (talk) 16:00, 10 June 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Bull of Heaven

Source to upload from

https://archive.org/details/BullOfHeaven

License

Most of these are in the public domain, but a few are non-commercial.

Description

The majority of these files are audio files to be uploaded to Category:Audio files of music by Bull of Heaven or its subcategory Category:Roman Numeral series, and the ones with three-digit numbers in front of them will have their titles formatted like the examples already in that category. However, it should be noted that Bull of Heaven is a very avant-garde band, and so, not all of their releases will have a single OGG file that contains all of the music for that release. In that case, I'll be skipping it. Let me know if you have any questions.

Lizardcreator (talk) 21:23, 27 May 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Editora Fi

Source to upload from

https://www.editorafi.org/catalogo

License

CC BY-SA 4.0

Description

Open Access/CC books perfect for Wikisource. I believe that it is everything on Google Drive. It needs an specific template. Erick Soares3 (talk) 12:36, 9 March 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

SciELO Books

Source to upload from

http://books.scielo.org/; https://archive.org/details/scielobooks; https://archive.org/details/@scielo_books
License

Several types of Creative Commons (including non-commercial) and Public Domain.

Description

SciELO Books, part of Scielo Brazil (also an amazing source for Wikisource), have a partnership with several academic publishers to release or re-release their works on Open Access, be CC or Public Domain.

Since it is clearly legal, it should be an amazing resource for Wikisource and the Wikimedia in general.

The bot should be able to read the archive and select the ones with Wiki Commons friendly licenses. Internet Archive some works released as CC BY-4.0 are registered as non-commercial (example). A similar thing also happens on the main website: 1 and 2. Would be nice if the bot could compare the main website and the Internet Archive collection for missing files and check at least once a month for new works released into Wiki friendly licenses.

It is necessary an official template. Thanks, Erick Soares3 (talk) 20:38, 6 March 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Official Journal of the European Union

Source to upload from

Eurlex: https://eur-lex.europa.eu/oj/direct-access.html

License

{{PD-EUGov}} for the EU itself and {{PD-EdictGov}} for the US side. Admittedly, I'm not sure how this will work for issues before certain dates (eg. before the act of 2011 mentioned in {{European Union Government}}) or prior to the EU's existence (i.e. during the time of the European Coal and Steel Community). However, due to the content being official legislation and communication I think it should be OK.

Description

Describe the content to be uploaded in detail (audio files, images by …), and what makes it valuable to Wikimedia Commons.

The files will be PDF copies of all issues of the Official Journal of the European Union (OJEU), the official gazette of the EU. Thus, it would be a useful resource for EU legislation and communications.

Do the media URLs follow a pattern? Yes: https://eur-lex.europa.eu/legal-content/[lang-code(EN, FR etc)]/TXT/PDF[or other format, eg. TXT, HTML]/?uri=OJ:[type, eg. C for Communications (Information and Notices), L for Legislation]:[year, 1952-2022]:[issue]:FULL, for example, the latest C-type issue in PDF format as of Feb. 8, 2022 is at https://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=OJ:C:2022:056:FULL and the earliest issue in German is at https://eur-lex.europa.eu/legal-content/DE/TXT/PDF/?uri=OJ:A:1952:001:FULL.
Does the site have an API? Somewhat. The official API can't be used for document downloads. There are bulk downloads available for issues since 2004 on the EU Open Data Portal. (per https://eur-lex.europa.eu/content/welcome/data-reuse.html) However, there is a third party API (http://api.epdb.eu/) for downloading EU legislation.
What else could ease uploading? Not sure.
Did you contact the site owner? No, the copyright templates should cover at least post-2011 documents, but I think as the OJEU is legislation and official communications it should be fine as previously mentioned.
Is there a template that could be used on the file description pages, or should one be created? One should probably be created.

MSG17 (talk) 14:47, 8 February 2022 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Biblioteca Digital Hispánica

Source to upload from: Photography collection from the Biblioteca Digital Hispánica: search query
- Do the media URLs follow a pattern? metadata permalink, viewer permalink, JPEG deep link
- Does the site have an API? No
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) The HTML is quite well-formed and follows an homogeneous structure, although metadata tabulation is a bit weird.
- Did you contact the site owner? No

Describe the works to be uploaded in detail (audio files, images by …): This request is for a subset of this Digital Library covering photographies and engravings. Note that the JPEG deep link provided above is valid only to fetch the first page of the document. For this collection, most (all?) works are a single page.

Which license tag(s) should be applied? It depends on the work. I think it should generally be PD-old-assumed, and in some cases PD-old-70 and PD-old-100.

Is there a template that could be used on the file description pages? Do you think a special template should be created? I created a manual sample here: File:Retrato de Mariano Ballestero (1869).jpg

I already have a scraper and (work in progress) page generator for this collection. So I can help to provide everything in the required format. Anyway, I think the bulk of pending work is probably identifying author and the right license tag for each work.

MarioGom (talk) 21:15, 12 October 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Perry–Castañeda Library Map Collection

Source to upload from: http://legacy.lib.utexas.edu/maps/ams/
- Do the media URLs follow a pattern?
  The urls themselves, so far as I can work out, don't, but in the same way as in Adobe Acrobat Pro you can set it to go down a list of web links to generate a single pdf, a bot may be able to too
- Does the site have an API?
  Bit technical, but I dont' think so
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
  I don't know
- Did you contact the site owner?
  No
Describe the works to be uploaded in detail (audio files, images by …): vast series of maps generated by the US Army Map Service (i.e., PD-USGov-Military) in the Perry–Castañeda Library Map Collection, The University of Texas at Austin
Which license tag(s) should be applied? PD-USGov-Military
Is there a template that could be used on the file description pages? Do you think a special template should be created? In terms of the file naming convention, this could follow that of the site, i.e, the top of each page has the series, a credit to the US AMS, and the date, then each map file has the name of the map, the sheet number (for the index pages, cross-references from adjoining maps etc), and the scale

NB there are already some files at Category:India maps by U.S. Army Map Service (plus various other individual uploads etc within Category:Maps by the United States Army Map Service), and it looks from below on this page and eg this commons image that "Slick-o-bot" may have been used in 2012 to upload some or all of these (I'm most keen on the various Japan-related maps (especially the 3x Honshu 1:50,000 series) but imagine every region would benefit).
This would be a mind-bogglingly great addition, thank you, Maculosae tegmine lyncis (talk) 14:08, 13 August 2020 (UTC)[reply]

PS, these are much more detailed than google maps - and the labelling is in English (with some Japanese too), Maculosae tegmine lyncis (talk) 19:27, 21 August 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Claremont Colleges Digital Library

Source to upload from: https://ccdl.libraries.claremont.edu/digital/collection/bce
- Do the media URLs follow a pattern? Yes
- Does the site have an API? Not sure
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Not sure
- Did you contact the site owner? No

Describe the works to be uploaded in detail (audio files, images by …):

All photos in the Boynton Collection of Early Claremont, all of which are dated prior to 1925. If it's not too much trouble, it would also be very nice to have all photos in the Claremont Colleges Photo Archive and City of Claremont History Collection dated prior to 1925.

Which license tag(s) should be applied? {{PD-US-expired}}

Is there a template that could be used on the file description pages? Do you think a special template should be created? Not sure

Sdkb (talk) 07:42, 8 August 2020 (UTC)[reply]

Opinions

Were these photos published prior to 1925, or merely taken prior to then? Publication needs to be pre-1925 for {{PD-US-expired}} to be allowed. Pi.1415926535 (talk) 08:25, 8 August 2020 (UTC)[reply]

@Pi.1415926535: The about page states The collection ... is believed to have come to Pomona College included with the papers of Charles Luther Boynton, a Pomona College alumnus and missionary to China. Boynton himself graduated from Pomona around 1900. I can't say for sure the year his papers came into possession of the college, though (which I assume would be the date of publication?). The library would probably tell us if we asked, though. Sdkb (talk) 05:47, 10 August 2020 (UTC)[reply]

Acquisition by the college would not be considered publication for the purposes of copyright. Only use in a publicly released printed material, or on a webpage, is considered publication. Pi.1415926535 (talk) 06:48, 10 August 2020 (UTC)[reply]

@Pi.1415926535: does being added to a library not count as publication? The collection has presumably been housed in the special collections department and publicly available to anyone who requested access since it was obtained. Sdkb (talk) 20:55, 10 August 2020 (UTC)[reply]

A collection merely being in a library does not constitute publication, by my reading. Under copyright law, publication is the distribution of copies or phonorecords of a work to the public by sale or other transfer of ownership or by rental, lease, or lending. Offering to distribute copies or phonorecords to a group of people for purposes of further distribution, public performance, or public display also constitutes publication. (From here.) Is the death date of Boynton known? If it was before 1950, then {{PD-old-70}} applies. Pi.1415926535 (talk) 23:14, 10 August 2020 (UTC)[reply]

@Pi.1415926535: According to here, Boynton died in 1961, so not quite. The above would seem to me to indicate being in a library counts, though, because of lending, which is what a library does. Sdkb (talk) 19:11, 11 August 2020 (UTC)[reply]

A collection in the library would be the originals (not copies) and is likely for use only in the library (not lending). I understand that you wish to have this collection available on Commons, but from the available evidence I do not believe the images are public domain. Pi.1415926535 (talk) 20:58, 11 August 2020 (UTC)[reply]

Assigned to	Progress	Bot name	Category

Balinese Lontar from Internet Archive

Source to upload from: http://archive.org/details/Bali
- Do the media URLs follow a pattern? yes
- Does the site have an API? yes
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) N/A
- Did you contact the site owner? yes

Describe the works to be uploaded in detail (audio files, images by …):
- Balinese Lontar (palm-leaf manuscripts) from the Internet Archive's Bali collection
- Each manuscript is a PDF containing photographs of the originals
- This batch upload is in connection with an active project grant.

Which license tag(s) should be applied?

{{PD-scan}}, following the behavior of the ia-upload tool.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Yes. I will follow the ia-upload template closely when doing the batch upload. I will use a short python script that aggregates info from the Internet Archive API and sends each upload request via pywikibot. If necessary I will create a bot account for this purpose. There are approximately 2700 items to upload.

Lautgesetz (talk) 01:03, 4 July 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Catalog of Copyright Entries

Source to upload from: https://archive.org/details/copyrightrecords?&sort=-date
- Do the media URLs follow a pattern? Unsure.
- Does the site have an API? Unusre, but there seems to be an RSS feed - Not sure if it contains all entries.
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Commons has tools for upload transfer from IA.

- Did you contact the site owner?

No.

Describe the works to be uploaded in detail (audio files, images by …):

Scanned volumes (647) consisting of the Catlog of Copyright Entries volumes for the United States for the period 1891-1977/8)

Which license tag(s) should be applied?

Is there a template that could be used on the file description pages? Do you think a special template should be created?

No new templates are required, additional fields could be added in {{Book}} or {{Information}}

ShakespeareFan00 (talk) 07:37, 3 June 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category
Fæ	Completed	Fæ	Category:Catalogs of Copyright Entries

Commons:Batch uploading/Modern Sketch

Source to upload from: This Complete Gallery
- Do the media URLs follow a pattern? There are 39 links. Inside of each link there are all the pages of every issue, in order.
- Does the site have an API? I don't know
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
- Did you contact the site owner? It's Public Domain

Describe the works to be uploaded in detail (audio files, images by …):

Each one of the 39 issues of Chinese magazine "Modern Sketch". They are in public domain for the reasons given in the following parametre. All the pages can be uploaded.

Which license tag(s) should be applied?

PD-China and PD-1996

Is there a template that could be used on the file description pages? Do you think a special template should be created?

No Special Template: PD-China and PD-1996 as license and Category:Modern Sketch as Category. TaronjaSatsuma (talk) 10:29, 18 February 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category Modern Sketch

Japanese Homes and their surroundings

Source to upload from: List of files, List of illustrations with names assigned to each number. It would be really nice if the figures contained their original names in teh uploaded filenames.
- Do the media URLs follow a pattern?:
  - Yes. https://www.gutenberg.org/files/52868/52868-h/images/fig001.jpg - https://www.gutenberg.org/files/52868/52868-h/images/fig307.jpg
  - Note that combined figures fig114_117.jpg and fig188_192.jpg do not follow this pattern; titldeco.jpg is a lower-res red version of one of the other illustrations, used as a frontispiece, and can be omitted.
- Does the site have an API?:
  - I assume that Gutenberg has an API. If someone can point me at instructions on how to use it with Commons, I might be able to do this myself; I assume this is a beaten path...
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?):
- Did you contact the site owner?:
  - No, for Gutenberg this seems redundant.
  - I uploaded some manually already, with permission, from another site (names files with pattern https://www.kellscraft.com/JapaneseHomes/JapanHomes001.jpg, to JapanHomes301.jpg, 129-130 are duplicates, figure numbers do not align with file names, so combined illustrations cause no disruption to sequential numbering). The Gutenberg images are in better in many, but not all, cases (higher-res, better scan).
  - The same book is also at [297], but the images seem to be worse.

Describe the works to be uploaded in detail (audio files, images by …):
- All of the illustrations from a PD book, jpgs, architectural line drawings by Creator:Edward S. Morse.

Which license tag(s) should be applied?:
- {{PD-old-70-1923}}
- note: five years from PD-100

Is there a template that could be used on the file description pages? Do you think a special template should be created?
- {{Creator:Edward S. Morse}} If uploaded to Category:Japanese Homes and Their Surroundings (1885 book), I will manually categorize them and add descriptions. A special template seems redundant.

Thank you! HLHJ (talk) 04:17, 4 February 2020 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Baseball Hall of Fame

The National Baseball Hall of Fame and Museum is releasing a larger portion of their collection online lately, many are in the public domain. See for example this collection on Honus Wagner https://collection.baseballhall.org/PASTIME/wagner-honus-0?page=7&fbclid=IwAR2cYYBeEMTsN_PGJFCXB5qqYoTtcBBCPkGwFGh3NqUbNtYYww7OWHizdvA

Is there a practical way to batch extract and upload files that are tagged with "http://rightsstatements.org/vocab/NoC-US/1.0/" under the "Copyright note" section? They basically confirm which files are in the public domain. Or they will sometimes post in that same section "The National Baseball Hall of Fame and Museum is not aware of any U.S. copyright or any other restrictions in the documents."

Oaktree b (talk) 02:16, 23 November 2019 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

OpenUp RBINS Beetles collection

Source to upload from: http://projects.biodiversity.be/openuprbins/
- Do the media URLs follow a pattern? Yes: http://projects.biodiversity.be/openup/rbins/pictures_only/<PICTURE_ID>.jpg
- Does the site have an API? No
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Since I helped build the website, I have a CSV file containing metadata for each picture: scientific name, family, location where the beetles was collected, photographer name, ...
- Did you contact the site owner? Yes. They approve the upload of medium resolution images (such as on the existing website), and may approve later higher resolution versions of those.

Describe the works to be uploaded in detail (audio files, images by …): 4,074 detailed pictures of 1,926 different beetles species. See content on http://projects.biodiversity.be/openuprbins/
Which license tag(s) should be applied? {{CC-BY-SA-4.0}}
Is there a template that could be used on the file description pages? Do you think a special template should be created?

Niconoe (talk) 09:12, 26 June 2019 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

GeoDIL

There are 3096 pictures of rocks and minerals.

Source to upload from: https://geodil.dperkins.org/
- Do the media URLs follow a pattern? The images themselves are /i/NUMBER.jpg. The pages for the images are /h/NUMBER.html. Numbers range from 1-3144 with some gaps.
- Does the site have an API? No.
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) The site owner uses a script to generate the HTML and the sitemap, /sitemap.xml. That data could be modified if it would make uploading significantly easier. On the back end, information is stored in a CSV, /db/details.csv, should that be useful.
- Did you contact the site owner? Site owner: Douglas Perkins.

Describe the works to be uploaded in detail (audio files, images by …): JPGs of rocks and minerals. Most of these were taken by people working on the GeoDIL project at the University of North Dakota, 2001-2002.

Which license tag(s) should be applied? 2,711 are CC Zero, and 14 are government works and PD. The remainder are not freely licensed. All licensed images are noted as such on their HTML pages, and it's also in the sitemap.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Douglas Perkins (talk) 01:14, 10 March 2019 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

NPGallery

Source to upload from: https://npgallery.nps.gov/
- Do the media URLs follow a pattern? https://npgallery.nps.gov/AssetDetail/<GUID>
- Does the site have an API? Unknown
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Unknown
- Did you contact the site owner? No

Describe the works to be uploaded in detail (audio files, images by …):

"NPGallery supports a wide array of digital asset file types (images, MS office formats, adobe pdfs, audio files, videos)." We would, I think, be primarily interested in their photographs of national parks.

Which license tag(s) should be applied?

{{PD-USgov}} may apply to many images, but they need to be checked individually. This could probably be automated to some degree.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Standard templates such as {{Photograph}} should be acceptable.

This was spotted by Animalparty on COM:VP. BMacZero (talk) 00:12, 22 January 2019 (UTC)[reply]

Opinions

Comments by Animalparty.

{{PD-USGov}} would be the most inclusive template, but is rather vague. More specific templates include {{PD-USGov-NPS}} and {{PD-USGov-Interior}}. Any Photographer field that says "NPS Staff" or "NPS Photo" (e.g. [298]) should automatically get PD-USGov-NPS.
I think {{Photograph}} or {{Information}} are fine, ideally with detailed semi custom fields for keywords, collection, location, etc., as seen in the Library of Congress images uploaded by User:Fæ (example).
The more pre- or auto-categorization, or at least clearly noting collection, yeear/decade, geographic unit, etc., the better, else we dump thousands of unsorted of images into already cluttered categories like Yosemite National Park.
There may be overlap with some material on Archives.gov , individual National Park Flickr feeds/websites, and such material already uploaded. But I think the value of the images uploaded at their largest file size and with curated metadata outweigh the inconvenience of some duplication.
Many files have geographical coordinates, but I suspect that many are generic coordinates of the center of the National Park or Monument, rather than being unique to the photograph.
Thanks for initiating this, sorry if these comments are basic/obvious to experienced mass uploaders. --Animalparty (talk) 01:29, 22 January 2019 (UTC)[reply]

On some more inspection, certain images may be a bit problematic in terms of copyright, namely works of art (e.g. paintings and sculptures) not explicitly credited to NPS employees, but that are nonetheless labeled "Public domain:Full Granting Rights". Some of these appear to be created by Artist-in-Residence programs (e.g. this gallery and this one), and from browsing elsewhere it appears that different parks may have different rules regarding copyrights. Rocky Mountain National Park states "Artists are also required to provide the copyright for this artwork to the National Park Service. The National Park Service will not allow the commercial use of any donated artwork once it is selected and accessioned into the Park's permanent museum collection", which is a restriction against public domain. Perhaps no art from Rocky Mountain was transferred to NPGallery? These 2 images from the U.S.S. Arizona memorial are labeled PD on NPGallery, yet on a different NPS page their status is ambiguous, with the included usage disclaimer "Multimedia credited with a copyright symbol (indicating that the creator may maintain rights to the work) or credited to any entity other than NPS must not be presumed to be public domain; contact the host park or program to ascertain who owns the material" (emphasis added).

Side note: I think every photograph I've viewed on NPGallery has the Copyright disclaimer "Permission must be secured from the individual copyright owners to reproduce any copyrighted materials contained within this website. Digital assets without any copyright restrictions are public domain.", but every file is also labeled Public domain in the Constraints Information.

Another snag I've noticed, just from browsing the term "Artist", are that some images are scans/photographs from newspapers that were most likely not originally created by Federal employees (although the derivative scans/photos are): for instance Louis Grell illustration album, with cartoons by Louis Grell published in World War I.[299] These are still PD via pre-1924 publication (and possibly by {{PD-USGov-Military}}), but it hinders accurate bot-designation of PD template.

And public domain rationale is ambiguous on this vido, with Copyright" "Photo courtesy of Betty Maya Foott, Colorado Plateau Dark Sky Cooperative" (so, probably not a federal employee), yet is nonetheless labeled "Public domain:Full Granting Rights". I may have just found a relative handful of exceptions. But there are also probably a good deal of historical photographs that are PD-1923 or PD-no-notice yet not US Government works. Perhaps a generic umbrella template similar to {{Flickr-no known copyright restrictions}} could be used to encapsulate different possibilities, like {{PD-NPGallery}}.

I think it would be a good idea to contact someone at NPGallery to double check that all media labeled public domain is in fact public domain, for some reason, especially when rationale is ambiguous or lacking. We also might want to consider not transfering the somewhat intimidating, potentially misleading Copyright message "Permission must be secured from the individual copyright owners to reproduce any copyrighted materials contained within this website. Digital assets without any copyright restrictions are public domain." This may be a liability disclaimer on NPGallery's end, but ideally, everything we transfer to Commons would be in the public domain, and so no permission need be secured. --Animalparty (talk) 11:45, 25 January 2019 (UTC)[reply]

Working on adapting my bot to handle this. I'll contact them, and also start with only things that are obviously PD. BMacZero (talk) 17:50, 9 February 2019 (UTC)[reply]

I e-mailed NPGallery a while back about the public domain statuses of images and neglected to share here. Unfortunately got a not-too-helpful response essentially saying that the licenses and attributions are not "consistent" and "there is not a good way to assure an asset id is truly in the public domain, or not". We'll have to figure out what types of signals we can rely on to decide whether {{PD-USGov-NPS}} or other templates apply. Of course, publication pre-1924 will be a good one to start. BMacZero (talk) 04:30, 11 April 2019 (UTC)[reply]

I'm currently harvesting a list of all the images. It's going a bit slow but it should only a take a few days. After that I'll start downloading the metadata, which may take several days. BMacZero (talk) 04:45, 12 April 2019 (UTC)[reply]

Ah, a shame about the inconsistent licensing criteria. I guess pre-1924 and files credited to "NPS staff" or similar can be prioritized for now. --Animalparty (talk) 19:13, 12 April 2019 (UTC)[reply]

Started downloading the item metadata. You can check on the progress on this fun page I made. BMacZero (talk) 15:49, 13 April 2019 (UTC)[reply]

BRFA filed (Commons:Bots/Requests/BMacZeroBot 6). BMacZero (talk) 05:35, 10 May 2019 (UTC)[reply]

Started uploading last night, will probably be ongoing for quite a while. See Category:Images from NPGallery to check to help with validation and categorization! – BMacZero (🗩) 16:35, 29 June 2019 (UTC)[reply]

Assigned to	Progress	Bot name	Category
User:BMacZero	In progress	User:BMacZeroBot	Category:Images from NPGallery to check

See Also

APPLAUSE

Source to upload from: https://www.plate-archive.org/applause/
- Do the media URLs follow a pattern? https://www.plate-archive.org/objects/dr.3/ + plates or logbooks or notes or envelopes + /101_xxxx/ (x is a variable number)

Does the site have an API? Yes: 101_xxxx (x is a variable number)
What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) https://www.plate-archive.org/query/
Did you contact the site owner? No

Describe the works to be uploaded in detail (audio files, images by …): Historical astronomical plates, logbooks, envelopes or notes https://www.plate-archive.org/applause/info/gallery/ (we don't need to upload all, but I think the plates would be insteresting.
Which license tag(s) should be applied?

Plates: {{CC-0}} for example https://www.plate-archive.org/objects/dr.3/plates/101_3309/
Others: {{CC-BY-4.0}} for example https://www.plate-archive.org/objects/dr.3/logbooks/101_53/

The database is licensed under CC-0 (https://www.plate-archive.org/applause/project/disclaimer/)

Is there a template that could be used on the file description pages? Do you think a special template should be created? Yes, I think a template should be created.

Habitator terrae 🌍 16:37, 27 October 2018 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

PauloGuedes

Source to upload from: Institution:Arquivo Municipal de Lisboa
- Do the media URLs follow a pattern? yes:

This url generates 94 results pages, each linking to 10 individual image pages. Each image page url is

http://arquivomunicipal2.cm-lisboa.pt/X-arqWeb/ContentPage.aspx?ID=code&Pos=1&Tipo=PCD

while the image in it is at

http://arquivomunicipal2.cm-lisboa.pt/X-arqWeb/ContentDisplay.aspx?ID=code&Pos=1&Tipo=PCD&Thb=0

with code being a 20-digit lower-case hex number — which has no bearing with the official identification references (cota — see below).

Does the site have an API? dunno

- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) consistent, machine-generated HTML (parsable, even if not necessarily valid)
- Did you contact the site owner? No
Describe the works to be uploaded in detail (audio files, images by …): Smallish batch (711, according to inventory, or 933, according with the database search report) of scanned b/w photos in various hardcopy formats.

Which license tag(s) should be applied? {{PD-old}} Creator:Paulo Guedes

Is there a template that could be used on the file description pages? Do you think a special template should be created? {{AMLx}}; it needs to be fed at least {{{cota}}} (given also as código de referência), a slashed crumbthread-like alphanumeric string of variable length; other values to be (trivially) extracted from each image page are:

Título
Assunto
Data(s)
Dimensão e suporte
Nota(s)
Cotas antigas or Cotas or Cota(s)

The filenames can be constructed from Título (possibly trimmed) and the two last crumbs of {{{cota}}}, in parenthesis, devoided of the slash (which is one of the Cotas)

-- Tuválkin ✉ ✇ 16:54, 30 June 2018 (UTC)[reply]

Never mind. The bunch of imcompetents at CMLarq changed their software and “of course” old urls wont work. As they are also copyfraud goons, the new search functionality throws us back to the 1970s and it’s even less usable. Better visit their facilities in Lisbon (now rehoused in a modern neighbourhood becuae their historic HQ had to be converted into a tourist trap) and fiddle around with a microfilm viewer or some such nonsense. -- Tuválkin ✉ ✇ 22:03, 5 August 2024 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

VOA News files

Source to upload from: https://web.archive.org/web/*/https://www.voanews.com/mp3/voa/english/nnow/NNOW_HEADLINES.mp3
- Do the media URLs follow a pattern? They all have the same name. The date when archived is given in 14 digits, with the first eight digits being the year, month, and day respectively, with the remaining digits being the time of day archived, in UTC.
- Does the site have an API? Don't know.
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Don't know.
- Did you contact the site owner? No need to, since U.S. government works so public domain.

Describe the works to be uploaded in detail (audio files, images by …): VOA world news headline newscast audio files for (almost) every day spanning from 5 May 2009 to 6 July 2019.

Which license tag(s) should be applied? Template:PD-USGov-VOA

Is there a template that could be used on the file description pages? Do you think a special template should be created? Just use the standard one. Upload as "VOA News Headlines (MONTH DAY, YEAR)". If possible, upload them in FLAC, WAV, and OGG.

– Illegitimate Barrister (talk • contribs), 13:07, 26 May 2019 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

HiRISE

Source to upload from: https://www.uahirise.org/catalog/
- Do the media URLs follow a pattern? Yes! (Based on the catalog ID)
- Does the site have an API? No!
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)
  Full index tree of the images on the site is accessible via:
  https://hirise-pds.lpl.arizona.edu/PDS/RDR/ESP/
  
  With extra files in:
  https://hirise-pds.lpl.arizona.edu/PDS/EXTRAS/RDR/ESP/
  
  Each image file *.JP2 (sample [Big file]) accompanies the additional information in a separate label file *.LBL in PDS format (sample)
- Did you contact the site owner? Nope!

Describe the works to be uploaded in detail (audio files, images by …):
Images by HiRISE (High Resolution Imaging Science Experiment)

Which license tag(s) should be applied?

As explained in each image's description page for example: "All of the images produced by HiRISE and accessible on this site are within the public domain: there are no restrictions on their usage by anyone in the public, including news or science organizations. We do ask for a credit line where possible: NASA/JPL/University of Arizona"
PD-USGov-NASA or a variation of it to include JPL and University of Arizona must be used.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

There is no template yet. It must be created to include all the relevant data e.g. Acquisition date, Latitude , Longitude , etc. from the label files.

Note: Due to JPEG2000 not being currently supported on Wikimedia Commons, a conversion to PNG is also needed. File sizes may be large!

Meisam (talk) 21:58, 20 June 2018 (UTC)[reply]

Opinions

Support Seems like an interesting project --Kristbaum (talk) 16:00, 8 May 2019 (UTC)[reply]
Info Template:PD-NASA-HiRISE has been created for these images! -- Meisam (talk) 17:31, 11 May 2019 (UTC)[reply]
@Meisam: - I am interested in pursuing this. I think it would be a logical extension of my work with uploading from ESRS. Do you have any suggestion as to how we most efficiently store the PDS data with each image? Askeuhd (talk) 08:28, 6 June 2022 (UTC)[reply]
@Askeuhd: I don’t have any good solutions. I suppose we can store them as tEXt chunks in the PNG image and also add them in a table (using wiki templates) to the image description page. -- Meisam (talk) 11:30, 6 June 2022 (UTC)[reply]
@Meisam: - I would personally much prefer the latter option, I fear that the former option may not be very user friendly. We will also have to parse as much of the data as possible to SDC. I will try to think of a suitable paradigm. Askeuhd (talk) 11:40, 6 June 2022 (UTC)[reply]

Assigned to	Progress	Bot name	Category

PDS data import proposal

Proposal for the import of PDS data for each image, to ensure as much as possible is added to SDC and necessary data for the user is displayed prominently in the wikitext.

I propose that the entire LBL file is imported as a collapsible text field in the template for each file, preserving all formatting and indentation, so that researches or other users familiar with the PDS format may be able to utilize plain text search for the values we will not be able to add to SDC, similar to this:

Raw Planetary Data System data
`PDS Content of LBL file`

In addition to this, I have broken down the example file, to try and maximize possible SDC data migration, as well as adding some of the data to a custom wikidata template for this particular import. I am highly interested in any suggestions. I take the libery of pinging @Multichill: as you have previously been very helpful in a similar endeavour with the ISS photos. I hope you would be interested in adding your valuable input here as well.

I will make a couple of example files in a few days or so to test the SDC structure and a potential template, before starting any peliminary coding, so the concepts can be tested out.

Reference to be set as stated in (P248) --> "Planetary Data System" for all SDC values imported from PDS.

All PDS identifiers can be looked up here for clarification. The LBL example file also contains some in-line comments.

Paradigm for importing PDS data - based on example ESP_053850_2170_RED.LBL
PDS Identifier	PDS Value in example file	Commons/SDC identifier	Commons/SDC value
PDS_VERSION_ID	PDS3	N/A	N/A
NOT_APPLICABLE_CONSTANT	-9998	N/A	N/A
DATA_SET_ID	"MRO-M-HIRISE-3-RDR-V1.1"	N/A	N/A
DATA_SET_NAME	"MRO MARS HIGH RESOLUTION IMAGING SCIENCE EXPERIMENT RDR V1.1"	part of the series (P179)	Appropriate wikidata-entity for this property (to be created)
PRODUCER_INSTITUTION_NAME	"UNIVERSITY OF ARIZONA"	affiliation (P1416) as a qualifier to creator (P170)	University of Arizona (Q503419)
PRODUCER_ID	"UA"	N/A	N/A
PRODUCER_FULL_NAME	"ALFRED MCEWEN"	creator (P170)	"some value" --> "Alfred McEwen"
OBSERVATION_ID	"ESP_053850_2170"	catalog code (P528) and {{NASA-image}}	"some value" --> "ESP_053850_2170", qualified with catalog (P972) and an appropriate wikidata-entity for this property (to be created).
PRODUCT_ID	"ESP_053850_2170_RED"	N/A	N/A
PRODUCT_VERSION_ID	"1.0"	N/A	N/A
INSTRUMENT_HOST_NAME	"MARS RECONNAISSANCE ORBITER"	location of creation (P1071) & location of the point of view (P7108)	Mars Reconnaissance Orbiter (Q183160)
INSTRUMENT_HOST_ID	"MRO"	N/A	N/A
INSTRUMENT_NAME	"HIGH RESOLUTION IMAGING SCIENCE EXPERIMENT"	captured with (P4082)	HiRISE (Q1036092)
INSTRUMENT_ID	"HIRISE"	N/A	N/A
TARGET_NAME	"MARS"	depicts (P180)	Mars (Q111)
MISSION_PHASE_NAME	"EXTENDED SCIENCE PHASE"	significant event (P793)	Appropriate wikidata-entity for this property (to be created)
ORBIT_NUMBER	53850	orbits completed (P1418)	value: 53850 possibly qualified with type of orbit (P522) --> areocentric orbit (Q3884965)
SOURCE_PRODUCT_ID	(ESP_053850_2170_RED0_0, ESP_053850_2170_RED0_1, ESP_053850_2170_RED1_0, ESP_053850_2170_RED1_1, ESP_053850_2170_RED2_0, ESP_053850_2170_RED2_1, ESP_053850_2170_RED3_0, ESP_053850_2170_RED3_1, ESP_053850_2170_RED4_0, ESP_053850_2170_RED4_1, ESP_053850_2170_RED5_0, ESP_053850_2170_RED5_1, ESP_053850_2170_RED6_0, ESP_053850_2170_RED6_1, ESP_053850_2170_RED7_0, ESP_053850_2170_RED7_1, ESP_053850_2170_RED8_0, ESP_053850_2170_RED8_1)	N/A	N/A
RATIONALE_DESC	"Monitoring new impact site"	to be added to {{En}} in main template - We might also go over all LBL files to search for obvious depicts (P180) statements	"PDS description: Monitoring new impact site"
SOFTWARE_NAME	"PDS_to_JP2 v3.19 (1.53 2012/01/24 03:07:27)"	I was unable to find an appropriate wikidata property here, but I feel like there should be one	?
OBJECT = IMAGE_MAP_PROJECTION
DATA_SET_MAP_PROJECTION	"DSMAP.CAT"	N/A	N/A
MAP_PROJECTION_TYPE	"EQUIRECTANGULAR"	I was unable to find the appropriate wikidata property for "projection", I might be looking in the wrong place. spatial reference system (P3037) was the closest I got	equidistant cylindrical projection (Q1326965)
PROJECTION_LATITUDE_TYPE	PLANETOCENTRIC	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are planetocentric)	N/A
A_AXIS_RADIUS	3389.5743490888 <KM>	N/A (simply the mean readius of Mars)	N/A
B_AXIS_RADIUS	3389.5743490888 <KM>	N/A (simply the mean readius of Mars)	N/A
C_AXIS_RADIUS	3389.5743490888 <KM>	N/A (simply the mean readius of Mars)	N/A
COORDINATE_SYSTEM_NAME	PLANETOCENTRIC	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are planetocentric)	N/A
POSITIVE_LONGITUDE_DIRECTION	EAST	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are east-positive)	N/A
KEYWORD_LATITUDE_TYPE	PLANETOCENTRIC	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are planetocentric)	N/A
POSITIVE_LONGITUDE_DIRECTION	EAST	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are east-positive)	N/A
KEYWORD_LATITUDE_TYPE	PLANETOCENTRIC	N/A (coordinates given by globe planetocentric Martian coordinates (Q106948918) are planetocentric)	N/A
CENTER_LATITUDE	35.000 <DEG>	See below	See below
CENTER_LONGITUDE	180.000 <DEG>	coordinates of depicted place (P9149) - not completely sure though, as it the example specifically comments that the location is the center of the projection not necessarily the center of the image. So it may not be so helpful to import this value as the coordinates of depicted place (P9149) - see bounding values below	`{ "latitude": 35, "longitude": 180, "precision": 0.001, "globe": "http://www.wikidata.org/entity/Q106948918" }`
LINE_FIRST_PIXEL	1	N/A	N/A
LINE_LAST_PIXEL	32134	N/A	N/A
SAMPLE_FIRST_PIXEL	1	N/A	N/A
SAMPLE_LAST_PIXEL	25483	N/A	N/A
MAP_PROJECTION_ROTATION	0.0 <DEG>	N/A	N/A
MAP_RESOLUTION	236636.93053097 <PIX/DEG>	angular resolution (P3439)	converted to milliarcseconds/pixel (1/236636.93053097*3600000) value: 15.21317907531, unit: milliarcsecond (Q21500224)
MAP_SCALE	0.25 <METERS/PIXEL>	I was unable to find an appropriate wikidata property here, something like "ground sample distance" or similar - I think it should be included as custom field in the wikitext template for each image, as it is a very commonly needed figure	N/A
MAXIMUM_LATITUDE	36.973920949851 <DEG>	coordinates of northernmost point (P1332)	`{ "latitude": 36.973920949851, "longitude": 148.23651113052, "precision": 0.000000000001, "globe": "http://www.wikidata.org/entity/Q106948918" }` <-- longitude set to westermost longitude clarified by syntax clarification (P2916) qualifier.
MINIMUM_LATITUDE	36.838131126084 <DEG>	coordinates of southernmost point (P1333)	`{ "latitude": 36.838131126084, "longitude": 148.36797304112, "precision": 0.000000000001, "globe": "http://www.wikidata.org/entity/Q106948918" }` <-- longitude set to easternmost longitude clarified by syntax clarification (P2916) qualifier.
LINE_PROJECTION_OFFSET	8749396.5 <PIXEL>	N/A	N/A
SAMPLE_PROJECTION_OFFSET	6157087.5 <PIXEL>	N/A	N/A
EASTERNMOST_LONGITUDE	148.36797304112 <DEG>	coordinates of easternmost point (P1334)	`{ "latitude": 36.973920949851, "longitude": 148.36797304112, "precision": 0.000000000001, "globe": "http://www.wikidata.org/entity/Q106948918" }` <-- latitude set to maximum latitude clarified by syntax clarification (P2916) qualifier.
WESTERNMOST_LONGITUDE	148.23651113052 <DEG>	coordinates of westernmost point (P1335)	`{ "latitude": 36.838131126084, "longitude": 148.23651113052, "precision": 0.000000000001, "globe": "http://www.wikidata.org/entity/Q106948918" }` <-- latitude set to minimum latitude clarified by syntax clarification (P2916) qualifier.
GROUP = TIME_PARAMETERS
MRO:OBSERVATION_START_TIME	2018-01-21T12:51:50.434	N/A	N/A
START_TIME	2018-01-21T12:51:50.582	N/A	N/A
SPACECRAFT_CLOCK_START_COUNT	"1201006358:10651"	N/A	N/A
STOP_TIME	2018-01-21T12:51:53.012	inception (P571) and date field in template	date used as value for wikidata property, full time string parsed to data field for date field in wikitext template
SPACECRAFT_CLOCK_STOP_COUNT	"1201006360:38785"	N/A	N/A
PRODUCT_CREATION_TIME	2018-01-25T05:01:36	publication date (P577) but I am not completely sure here	date used as value for wikidata property.
GROUP = INSTRUMENT_SETTING_PARAMETERS
MRO:CCD_FLAG	(ON, ON, ON, ON, ON, ON, ON, ON, ON, OFF, ON, ON, ON, ON)	N/A	N/A
MRO:BINNING	(1, 1, 1, 1, 1, 1, 1, 1, 1, -9998, -9998, -9998, -9998, -9998)	N/A	N/A
MRO:TDI	(128, 128, 128, 128, 128, 128, 128, 128, 128, -9998, -9998, -9998, -9998, -9998)	N/A	N/A
MRO:SPECIAL_PROCESSING_FLAG	(NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, NOMINAL, "NULL", "NULL", "NULL", "NULL", "NULL")	N/A	N/A
GROUP = VIEWING_PARAMETERS
INCIDENCE_ANGLE	42.714413 <DEG>	N/A	N/A
EMISSION_ANGLE	0.434473 <DEG>	tilt (P8208)	value: 0.434473 --> unit degree (Q28390)
PHASE_ANGLE	42.502572 <DEG>	N/A	N/A
LOCAL_TIME	15.10520 <LOCALDAY/24>	N/A	N/A
SOLAR_LONGITUDE	118.215906 <DEG>	N/A	N/A
SUB_SOLAR_AZIMUTH	173.163664 <DEG>	N/A	N/A
NORTH_AZIMUTH	270.000000 <DEG>	N/A	N/A
OBJECT = COMPRESSED_FILE
FILE_NAME	"ESP_053850_2170_RED.JP2"	N/A	N/A
RECORD_TYPE	UNDEFINED	N/A	N/A
ENCODING_TYPE	"JP2"	N/A	N/A
ENCODING_TYPE_VERSION_NAME	"ISO/IEC15444-1:2004"	N/A	N/A
INTERCHANGE_FORMAT	BINARY	N/A	N/A
UNCOMPRESSED_FILE_NAME	"ESP_053850_2170_RED.IMG"	N/A	N/A
REQUIRED_STORAGE_BYTES	1637741444 <BYTES>	N/A	N/A
DESCRIPTION	"JP2INFO.TXT"	N/A	N/A
INTERCHANGE_FORMAT	BINARY	N/A	N/A
OBJECT = UNCOMPRESSED_FILE
FILE_NAME	"ESP_053850_2170_RED.IMG"	N/A	N/A
RECORD_TYPE	FIXED_LENGTH	N/A	N/A
RECORD_BYTES	50966 <BYTES>	N/A	N/A
FILE_RECORDS	32134	N/A	N/A
IMAGE	"ESP_053850_2170_RED.IMG"	N/A	N/A
DESCRIPTION	"HiRISE projected and mosaicked product"	Could potentially be added to {{En}} in description	N/A
LINES	32134	N/A	N/A
LINE_SAMPLES	25483	N/A	N/A
BANDS	1	N/A	N/A
SAMPLE_TYPE	MSB_UNSIGNED_INTEGER	N/A	N/A
SAMPLE_BITS	16	N/A	N/A
SAMPLE_BIT_MASK	2#0000001111111111#	N/A	N/A
SCALING_FACTOR	1.41615214363203e-04	N/A	N/A
BANDS	1	N/A	N/A
OFFSET	0.060336154982679	N/A	N/A
BAND_STORAGE_TYPE	BAND_SEQUENTIAL	N/A	N/A
CORE_NULL	0	N/A	N/A
CORE_LOW_REPR_SATURATION	1	N/A	N/A
CORE_LOW_INSTR_SATURATION	2	N/A	N/A
CORE_HIGH_REPR_SATURATION	1023	N/A	N/A
CORE_HIGH_INSTR_SATURATION	1022	N/A	N/A
CENTER_FILTER_WAVELENGTH	700 <NM>	Should be added to wikitext template along with FILTER_NAME as the images will be uploaded as PNG 16 bit grayscale	N/A
MRO:MINIMUM_STRETCH	3	N/A	N/A
MRO:MAXIMUM_STRETCH	1021	N/A	N/A
FILTER_NAME	"RED"	N/A	N/A

Additionally I propose the following the properties

media type (P1163) --> "image/png"
source of file (P7482) --> file available on the internet (Q74228490) --> ((described at URL (P973) --> value: url to LBL file) and (full work available at URL (P953) --> value: direct URL to JP2 file) and (operator (P137) --> University of Arizona (Q503419)) and perhaps (file format (P2701) --> JP2 (Q27979401)))
copyright status (P6216) --> public domain (Q19652) --> determination method or standard (P459) --> work of the federal government of the United States (Q60671452)
instance of (P31) --> photograph (Q125191)

@Meisam: --Askeuhd (talk) 16:08, 7 June 2022 (UTC)[reply]

freepd.com

Site contains production music tracks, in various genres, mp 3 format.

Source to upload from:

http://freepd.com/

- Do the media URLs follow a pattern?

None found. Tracks seem to be in sub-directories related to nominal genre, MP3 files are named for the track title apparently.

- Does the site have an API?

Unknown.

- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Unknown.

- Did you contact the site owner?

Site owner not contacted.

Describe the works to be uploaded in detail (audio files, images by …):

"Production music", in various genres., in MP3 format.

Which license tag(s) should be applied?

Site claims tracks are in the public domain:- http://freepd.com/faq.html ; However some of these tracks were previously under CC-BY on the site owners other site at incompetech.

Is there a template that could be used on the file description pages? Do you think a special template should be created?

{{Information}} with additional field as was done on the previous batch upload for incompetech.

ShakespeareFan00 (talk) 10:20, 18 December 2017 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Commons:Batch uploading/timbeek.com/

Source to upload from:

http://timbeek.com/ in particular music tracks listed in http://timbeek.com/royalty-free-music/isrc/

- Do the media URLs follow a pattern?

No general pattern, but there's a master list (not sure if it's complete) of track pages here - http://timbeek.com/royalty-free-music/isrc/, Donwload links in the UI seem to link to numbered subdirectories, but general pattern undetermined or not obvious.

- Does the site have an API?

Unknown.

- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?)

Unknown

- Did you contact the site owner?

Site owner not contacted.

Describe the works to be uploaded in detail (audio files, images by …):

A Small set of 'production music' tracks, in assorted genres.

Which license tag(s) should be applied?

See: http://timbeek.com/royalty-free-music/license/ , assuming attribution requirments are met the music appears to be under CC-BY 4.0. (see also: http://timbeek.com/royalty-free-music/faq/ and http://timbeek.com/royalty-free-music/copyright/)

Is there a template that could be used on the file description pages? Do you think a special template should be created?

{{Information}} with additional fields as was previously implemented for the incomptech.com batch upload(this site seems to use a simmilar approach).

ShakespeareFan00 (talk) 19:05, 15 December 2017 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Images of listed buildings by Stephen Richards on Geograph.org.uk

Source to upload from: http://www.geograph.org.uk
- Do the media URLs follow a pattern? Yes: http://www.geograph.org.uk/photo/[ID]
- Does the site have an API? Yes: http://www.geograph.org.uk/help/api
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) Don't know
- Did you contact the site owner? No need
Describe the works to be uploaded in detail (audio files, images by …):

All photographs of listed buildings by this user are of high quality and are tagged [listed building]. They would be very useful to have on Commons as every listed building has an item on Wikidata. I'd like them to be uploaded en masse and given the categories Category:Listed buildings in [county or London borough] and Category:Images by Stephen Richards. I could then further refine the listed building categories manually. However, the terms "Grade I", "Grade II*" and "Grade II" (the three listing grades for buildings in England and Wales) appear in the image descriptions, so is there a way that these could be picked out and used to categorise the images on Commons?

Which license tag(s) should be applied?

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Ham II (talk) 19:50, 16 November 2017 (UTC)[reply]

Opinions

@Ham II: first time I notice this. The GeographBot is uploading again (for quite some time already). It started at one many years ago and it's now at 3645078 which was contributed 9 September, 2013. It's slowly catching up and at some point all the files you were looking for will also be uploaded. Multichill (talk) 09:31, 17 July 2022 (UTC)[reply]

Assigned to	Progress	Bot name	Category

USDA NRCS Plants Database

Source to upload from: http://plants.usda.gov/
- Do the media URLs follow a pattern? Yes.
- Does the site have an API? No.
- What else could ease uploading? (is the site valid XHTML, do they use a WCM…?) valid XHTML
- Did you contact the site owner? No.

Describe the works to be uploaded in detail (audio files, images by …): Public domain: 10771 photos and 7064 line drawings, with species information for categorization. There are other copyrighted images as well, some of which may be freely licensed.

Which license tag(s) should be applied?

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Opinions

@Guanaco: There is a lot of copyrighted material within these images, e.g. [300] [301]. (Just because this is a U.S. government web site this does not mean all the material is U.S. government material and by this means freely usable!) Actually I have not found too many images that really can be used (e.g. [302]). You should at least provide a procedure how to distinguish between copyrighted and free material. --Reinhard Kraasch (talk) 11:02, 9 July 2017 (UTC)[reply]

@Reinhard Kraasch: The gallery search function [303] has a filter by copyright status. [304]

I've found that the URLs linked by the thumbnails provide species information within <title>: https://plants.usda.gov/core/profile?symbol=HACA2&photoID=haca2_003_ahp.jpg#

and correspond to the URLs of the actual files: https://plants.usda.gov/gallery/pubs/haca2_003_php.jpg
as well as the URL with copyright status and recommended attribution info: https://plants.usda.gov/java/usageGuidelines?imageID=haca2_003_ahp.jpg

The search is navigable with &page=2, 3, 4, etc.

I'm actually interested in scripting this myself now, though it would be my first batch upload task. Guanaco (talk) 14:23, 9 July 2017 (UTC)[reply]

@Guanaco: Well, just go on... On the other hand it always is a good idea to have a second opinion with such a batch upload - especially for the non-technical aspects. --Reinhard Kraasch (talk) 20:52, 10 July 2017 (UTC)[reply]

Assigned to	Progress	Bot name	Category

US National Archives

I am hoping to begin a bulk upload of media from the US National Archives in the next few weeks. This will be a very different approach from the first upload, which was based on uploading files from an offline drive and scraping HTML for the metadata. This time around, NARA has an API for our online catalog, and so I am building a bot, using mwclient, to upload using the live metadata and files from the API. Some details:

Dataset

The dataset includes all PD materials at https://catalog.archives.gov (API: https://catalog.archives.gov/api/v1). I plan to begin with a series of ~100,000 WWI-era photos. Technically, there are over 15 million files (and counting) in this dataset.

File names

The script is currently configured to name files with the formula: For single-page items:

"File:[TITLE] - NARA - [NAID].ext"
Where "[TITLE]" is the catalog record's title field, and "[NAID]" is the National Archives Identifier. If this is over the character limit, "[TITLE]" is automatically truncated, with "(...)" appended.

For multi-page items (since the above formula would give all files belonging to one catalog record the same title):

"File:[TITLE] - NARA - [NAID] (page X).ext"

Metadata

We are developing a custom metadata mapping, since NARA does not adhere to a metadata standard. You can see the metadata template we use here: {{NARA-image-full}}. Some notes:

While all the records in this catalog come from NARA or partner institutions, there are many different facility locations, and some NARA facilities have their own institutions templates already (e.g. US presidential libraries). Therefore, I am creating institution templates to go along with all NARA locations, and the script will insert the correct institution template based on a mapping.

NARA's authority file is not yet mapped to Wikidata, however that is definitely something that would be useful in the future. For now, we will upload files with NARA's creator and author names and their NAIDs and links back to the catalog authority record. However, including the NAIDs in a Commons template field means that in the future, Wikidata could be used to make creator templates appear instead. Any help with this would be appreciated.

Licenses

Because NARA records are nearly all (>99%) derived from the records of US federal agencies, these uploads will use {{PD-USGov}} or its subtemplates. Most NARA records are in one of about 600 record groups based on their creating agency, so I am using a mapping of NARA record groups to Commons PD-USGov templates so that the bot can apply the more specific agency templates in most cases. Help filling out this mapping would be appreciated.

Nearly all holdings of the US National Archives are in the public domain as a work of the federal government (or, otherwise, due to age). This is marked in the "use restriction" field in the catalog, with a value of "Unrestricted" indicating public domain determination by the archivists. Therefore, the script will be configured to skip over any records in which the use restriction is anything other than "unrestricted" (even "possibly" ones, which could ultimately be PD, but need a human determination).

Categories

All uploads will be automatically categorized by the metadata template into Category:Media contributed by the National Archives and Records Administration and a category for the series they belong to (such as Category:US National Archives series: DOCUMERICA: The Environmental Protection Agency's Program to Photographically Document Subjects of Environmental Concern, compiled 1972 - 1977). Eventually, the script will be designed to create the series category if a file is uploaded for a series which does not yet have one.

When it comes to topical categories, past NARA uploads utilized the {{Uncategorized}} tag to encourage the community to add topical tags. However, since this creates work for the community, I am planning this time around to run uploads a small batch (hundreds to a few thousand) at a time, so I can upload them with one or more topical categories that apply to all records in the batch, rather than uncategorized.

Code

You can find the upload bot's code at https://github.com/usnationalarchives/wikimedia-upload. This project is being developed in public on NARA's official GitHub account. I would welcome collaboration (pull requests or otherwise) there. In addition, the Commons community is welcome to file issue reports on that repo.

Examples

The most recent test uploads can be viewed in Category:US National Archives series: American Unofficial Collection of World War I Photographs. I am still polishing the upload script, but these examples essentially represent what should be expected from the bot once it gets started.

Opinions

The bot account is technically already flagged from the last bulk upload a couple of years ago, however I would like to submit the current plan to community review before restarting uploads. If there are any opinions on the bot's design or the format of uploads or other issues, I am happy to hear them. We'd also like to know whether to limit what is uploaded in any way—as in, would Commons actually be interested in 15 million files, or might some of these, like the millions of census cards, not be of interest. Also, if anyone is interested in helping out with the coding or other tasks, please feel free to let me know. This is a big undertaking. Thanks! Dominic (talk) 17:25, 31 May 2017 (UTC)[reply]

Assigned to	Progress	Bot name	Category
User:Dominic	Coding	User:US National Archives bot	Category:Media contributed by the National Archives and Records Administration

ESA-Rosetta-NAVCAM

Source to upload from: http://imagearchives.esac.esa.int/index.php?/recent_pics
- Did you observe an URL pattern? See http://imagearchives.esac.esa.int/index.php?/page/rosetta_navcam
- Do you know whether the site has an API
- What else can ease uploading (is the site valid XHTML, WCM they use…)?
- Did you contact the site owner? No.

Describe the works to be uploaded in detail (audio files, images by …):

Images the comet 67P/CHURYUMOV-GERASIMENKO by the NAVCAM on the Rosetta spacecraft.

Which license tag(s) should be applied? ESA/Rosetta/NAVCAM – CC BY-SA IGO 3.0 (see {{ESA-ROSETTA-NAVCAM}} for the specific license template.)

Is there a template that could be used on the file description pages? Do you think a special template should be created?

Yann (talk) 14:32, 6 June 2015 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

USC Cinema

Source to upload from

https://archive.org/details/usc-sound-effect-archive

License

The files in this collection claim to be licensed CC BY 4.0. This is not true--all this archive is was someone collating the files and uploading them there. These files were all uploaded by Craig Smith on freesound.com under a CC0. The Gold and Red files are a valid {{CC0}} as Craig Smith works for USC. The Sunset Editorial files are all either {{PD-US-defective notice}} or {{PD-US-defective notice-1978-89}}. The notices are defective because according to the linked blog, all SSE ever got was a credit line. The company was no longer active by 1989, and I checked and there are no copyright registrations under SSE's name. The publication years of the sound effects however, are unknown, so I plan on tagging everything with PD-US-defective notice-1978-89.

Description

This is a set of audio files by the University of Southern California and Sunset Editorial consisting of the original recordings of sound effects used in movies from the 60s to 80s; a few of these sound effects are very famous (like the Wilhelm Scream). This file conveniently maps all the sound effects with a metadata .csv file with descriptions and upload dates and everything, so setting up a batch upload isn't too difficult. I'm prepared to do this upload myself.

Snowmanonahoe (talk) 02:21, 27 May 2023 (UTC)[reply]

Opinions

Assigned to	Progress	Bot name	Category

Old requests (before 2020-01-01)

Batch uploads in progress

Batch uploads on hold

Done (to be moved to past batch uploads)

Failed

Scripters

Multichill (talk · contribs)
Jarekt (talk · contribs)
Slick (talk · contribs) - no audio/video
Fæ (talk · contribs) - see project list
Husky (talk · contribs)
DaxServer (talk · contribs)

Currently inactive

TheDJ (talk · contribs)
Duesentrieb (talk · contribs)
Aude (talk · contribs) - including batch audio & video uploads
Basvb (talk · contribs)

Tools

See Commons:Upload tools. The Python Wikipedia Bot framework supports image uploads and is particularly versatile.
Commonist - free Java program to upload large numbers of files to Commons
d:Help:QuickStatements - tool for batch upload of metadata to Wikidata, which can be than accessed by {{Artwork}} and other templates.
Flickrripper allows batch uploading from a set, group or a user id on flickr.

Scripts, Examples and Information

the scripts I using on jobs here and here
a bash script to extract the VRINs on (U.S. military) pictures on commons, can very usefull to find duplicate before upload
Details about 'Zoomify' images and how to get it (in German)
Howto import images from news.kremlin.ru: import news.kremlin.ru news gallery.sh & import news.kremlin.ru photo gallery.sh
Another option so to download the images to your local machine, then upload with Pattypan.