Commons talk:Structured data/Archive 2024

Category:Commons talk archives#Structured%20data

Aerial photographs

How do you identify whether an aerial photo was taken from an airplane, a balloon or a drone? XRay 💬 19:20, 1 February 2024 (UTC)

By "identify" I assume you mean "indicate (if known)", not how you work it out from looking at the photo (which often you can't). - Jmabel ! talk 20:28, 1 February 2024 (UTC)
You're right, indicate. --XRay 💬 20:38, 1 February 2024 (UTC)
I'm a little surprised we don't have a property corresponding to photographic technique (Q1439691). Then you could set the value of that to drone photography (Q12033087) (for example). Can anyone see a way to do an an equivalent of that? - Jmabel ! talk 20:36, 1 February 2024 (UTC)
Please have a look to Category:Aerial photographs by camera vehicle. How to model camera vehicle? --XRay 💬 20:42, 1 February 2024 (UTC)
Slightly off topic, but shouldn't Category:Aerial photographs by Smart Drone‎ be a subcat of Category:Unmanned aerial vehicle aerial photography? - Jmabel ! talk 01:56, 2 February 2024 (UTC)
@XRay: It has been suggested on Wikidata that we could use genre (P136) with drone photography (Q12033087). - Jmabel ! talk 16:26, 4 February 2024 (UTC)
This is logical and understandable. However, another value that is often used is more general: aerial photography (Q191839). You can't tell from the value whether it was a drone, an airplane, a hot air balloon or something else. --XRay 💬 18:14, 4 February 2024 (UTC)
captured with (P4082) exists for drones and the like. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 22:34, 4 February 2024 (UTC)

How to depict flexed muscles with expression, gesture or body pose (P6022)?

How do you recommend modelling flexed muscles with expression, gesture or body pose (P6022) for photos like this one or other photos in Category:Biceps brachii? ? Lovelano (talk) 03:12, 5 March 2024 (UTC)

instead of p6022, try
depicts (P180)
Preferred rank Erin Cummings
shown with features (P1354) supination of the forearm
1 reference


add value

perhaps? Arlo James Barnes 08:14, 8 March 2024 (UTC)

Some more examples might be helpful.

I've been adding SDC, but I'm really not sure if I'm doing it right. Could I have some feedback, and is there a place for more examples? (I looked at Commons:Structured data/Modeling/Depiction.)

See File:Cape Air ticket counter at MWA airport.jpg; I've described it as a ticket counter, which is part of the specific airport. Is that the best way to express this?

I've added a captured with statement for the drone model in File:Caspersen Beach aerial view.jpg, but this is showing a warning. An I doing it wrong, or should this be automatically populated from EXIF or categories like Category:Taken with DJI FC220 anyway? (Is that happening in some cases but not others?)

For construction equipment like File:Caterpillar backhoe loader at construction site in Sunnyvale, back view.jpg, I get a warning when I say it depicts the generic type of equipment, and specify the make as the manufacturing company. I'm also not sure how to specify the model number. See also File:Google self-driving Lexus RX450h rear view.gk.jpg; specifying the make and model of a car seems reasonable, but I get a warning with the make and didn't know how to specify the model.

Thanks in advance to anyone who can help. I'm intrigued by the tools, but I feel like I'm using them... not quite right. grendel|khan 11:11, 15 March 2024 (UTC)

what property to mark a file that contains use of german sign language?

How do I add info about use of german sign language to a foto or to a video? C.Suthorn (@Life_is@no-pony.farm - p7.ee/p) (talk) 17:18, 25 December 2023 (UTC)

1. Click "Add statement".
2. Add property language of work or name (P407).
3. Add item German Sign Language (Q33282).
Lovelano (talk) 21:25, 4 March 2024 (UTC)
done, https://commons.wikimedia.org/w/index.php?title=File:Diversity_2023_25_(video_capture_001).png&diff=prev&oldid=861811635 ff. C.Suthorn (@Life_is@no-pony.farm - p7.ee/p) (talk) 13:59, 19 March 2024 (UTC)

Garbage "tags" still being added in bulk

So this is what the universe looks like. And NASA.

Well-intentioned users are still being encouraged by apps to stuff the depicts statement with garbage "tag" values.

For instance this recent edit tagged the above image of part of the surface of the Earth from the ISS as depicting the ISS, "Expedition 22", "outer space", NASA, and the universe. The edit summary includes #suggestededit-add-tag 1.0" and the edit is tagged "Mobile edit Mobile app edit Android app edit Suggested Edits edit".

How do we get this stopped? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 17:21, 10 March 2024 (UTC)

Seconding this request. I find myself clearing crap like this off of the file pages of my pictures every day or two. How much of this must be accumulating on file pages that no one is watching? - Jmabel ! talk 18:33, 10 March 2024 (UTC)
Strange, I never had crap tags added to the files I uploaded (not to say it is impossible, but it never happened to my pictures). Ymblanter (talk) 18:35, 10 March 2024 (UTC)
Better instructions in the apps or a visual example? I'm speculating here but if people see a blank page they have no idea how to use correct properties so something to steer people towards right approach would be useful? Let's say people want to add "creation date" but the property they need is called "inception" - something to steer people towards right property might be useful. Another thing entirely is how well-made the translations for different languages are.. Ipr1 (talk) 21:29, 21 March 2024 (UTC)
Imagine being a new user and you see an empty box on the page, so maybe you would assume that is where you put things and magic happens. Adding a statement? A claim? A qualifier? It takes a more experienced users to know what those are and how they relate to values they see on the page. Ipr1 (talk) 21:40, 21 March 2024 (UTC)
I frequently have crap tags added to my pictures, which I remove.
It isn't a matter of better teaching clueless newbies what is a good value to add as a "depicts" on someone else's picture. It's a matter of not encouraging clueless newbies to add these. - Jmabel ! talk 07:01, 22 March 2024 (UTC)

Subtitles hardcoded into video track?

Secretary Janet Yellen on the Child Tax Credit

maybe there should be some properties about subtitles embedded in the video track? in contrast to subs from timed text files? RZuo (talk) 19:31, 23 March 2024 (UTC)

What properties to use to state conservation status and types of damage of old photographs

Hello! I'm trying to add structured data to several old photographs of my city we've uploaded to Commons, like this one: https://commons.wikimedia.org/wiki/File:Plaza_Constituci%C3%B3n_-_0001FMHC.tif I have a big dataset several datapoints available, including "conservation status" (good, bad or regular) and "types of damage" (like silver mirroring Q112581162, mutilation, scratchs, inscriptions...). Apparently all these data follows the General International Standard Archival Description conventions, but I still can figure out which Wikidata properties should I use. Any ideas? Thanks! Piracalamina (talk) 15:58, 4 April 2024 (UTC)

Tool to copy sdc from file to file

is there one for problems like https://commons.wikimedia.org/w/index.php?title=Commons_talk:CropTool&oldid=867301107#Structured_Data ? we need a tool like this. RZuo (talk) 07:31, 12 April 2024 (UTC)

Maybe d:MediaWiki:Gadget-moveClaim.js could be adjusted to do this, @Matěj Suchánek, Melderick? --Marsupium (talk) 16:22, 15 April 2024 (UTC)

Aerial point of view

What would be the appropriate way to depict with structured data that a photograph was taken from an aerial point of view? Sdkbtalk 15:54, 16 April 2024 (UTC)

Commons_talk:Structured_data/Modeling/Location#Aerial_photographs have some dicussion related to this. -- Zache (talk) 17:10, 16 April 2024 (UTC)
Thanks for the link! Let's centralize discussion there. Sdkbtalk 18:38, 16 April 2024 (UTC)

Maintenance category broken?

Quality images missing SDC Commons quality assessment currently has over 14,000 files in it. But from a random sample of 15 files in this category, I found 9 to actually already include the statement. Clearing the category's cache didn't help. This makes that category not really helpful any more. Any ideas, what could cause this problem?

@Mike Peel, @Dschwen: could QICbot possibly set Commons quality assessment (P6731) Wikimedia Commons quality image (Q63348069) upon promotion and make this maintenance category obsolete? MB-one (talk) 10:34, 11 March 2024 (UTC)

This was being done by @Multichill's bot, I'm not sure why that stopped? I haven't figured out how to set SDC values using pywikibot/the API yet, it's on my to-do list, but I'm not sure when I'll get to it. Thanks. Mike Peel (talk) 11:27, 11 March 2024 (UTC)
@Mike Peel Maybe that helps you to get started: User:SchlurcherBot/commonsapiaddclaimsoauth Let me know anything else you need. --Schlurcher (talk) 06:55, 23 May 2024 (UTC)
Indeed https://commons.wikimedia.org/w/index.php?sort=last_edit_desc&search=-haswbstatement%3AP6731+hastemplate%3AQualityImage+filetype%3Abitmap&title=Special%3ASearch&profile=advanced&fulltext=1&advancedSearch-current=%7B%22fields%22%3A%7B%22hastemplate%22%3A%5B%22QualityImage%22%5D%2C%22filetype%22%3A%22bitmap%22%7D%7D&ns6=1 shows far fewer results and no false positives so far. MB-one (talk) 18:46, 11 March 2024 (UTC)
@MB-one and Mike Peel: Jobs User:BotMultichillT used to run have been disabled by the Wikimedia Foundation, see phab:T319912. Didn't really have the motivation to fix it yet. Multichill (talk) 17:40, 15 March 2024 (UTC)
@Multichill: so do I understand correctly that you decided that with this change of technical environment, you are abandoning all your bot tools and have no interest in migrating them? Are you willing to work at all with some other user who might be interested in migrating them? I suspect there are a fair number of these someone would like not to lose. - Jmabel ! talk 18:40, 15 March 2024 (UTC)
I've had to convert several of my bot scripts over to the new system (e.g., setup script for the venv, then new job submission script, with the run script loading the venv), would be happy to help here if you need. Although, this is why I run most of my scripts on an environment I have full control over... Thanks. Mike Peel (talk) 18:59, 15 March 2024 (UTC)
Update: Multichill's bot BotMultichillT has started thankfully started to tag images in Quality images. But Quality images missing SDC Commons quality assessment still lists about 5,000 files at any given time, although the actual number of files missing this tag hasn't exceeded around 150 in a month now. I wonder why.--MB-one (talk) 13:37, 22 May 2024 (UTC)
@MB-one: There seems to be something amiss due to Template:Quality image and its categorisation and the data it pulls from the WD ether. The metadata may need to be refreshed, however, it is so not visible and is beyond the scope of normal people to push purge, view, manipulate.  billinghurst sDrewth 02:04, 23 May 2024 (UTC)
Based on my experience you have to perform a null edit as a purge is not sufficient. I can do this ealy next week for the category. --Schlurcher (talk) 06:51, 23 May 2024 (UTC)
Yeah, I'm slowly recovering some of the bots. Category:Quality images missing SDC Commons quality assessment is empty now and looks like all quality images have the statement. Multichill (talk) 11:42, 24 May 2024 (UTC)

Proposal to add perceptual hashes to SDC

Just FYI, I made a Village pump proposal for adding pHash checksum (P9310) and Imagehash difference hash (P12563) values to commons images. Currently, FinnaUploadBot have been adding these only to images uploaded from Finna.fi --Zache (talk) 17:00, 17 May 2024 (UTC)

So this is essentially a front-end to Wikidata?

Do I understand this correctly: every file on Commons is also a node (item?) on Wikidata, in the sense that it is assigned a QID, and thus you can start making statements about it — the structured data tab is essentially a front-end to the Wikidata database? The main difference to editing statements on Wikidata appears to be that this interface does not show the subject QID anywhere! Or are you using other URIs as resource identifiers? (Being explicit with such technical details would have helped me understanding what this was all about.)

Do you have any examples of statements relating one file to another? (I didn't encounter anything like that, browsing a few of the examples in the list of properties.) Practical application: relating different BSicons to each other. 88.129.117.158 14:59, 7 July 2024 (UTC)

No, each image does not have a QID. Each media item instead has an identifier consisting of "M" followed by its page ID, but as you say the interface mostly hides this. I don't think there's any way to relate files to one another directly; I've certainly never seen one. Notably, when Wikidata wants to refer to Commons files, it does it by name and not by using the "M" IDs. --bjh21 (talk) 15:31, 7 July 2024 (UTC)

Bot to remove duplicate statements

Hi folks, sometimes we end up with duplicate statements on files. This often happens when bots edit the same file twice. I set up bot to clean that up about once a month (code). It goes through the latest dump at https://dumps.wikimedia.org/commonswiki/entities/ (the copy on Toolforge) and removes any duplicate statements. It hashes the statement for quick processing so it doesn't catch cases (yet) like this one where the qualifiers are in different order. Multichill (talk) 20:29, 7 July 2024 (UTC)

List of bots involved

i think it'd be good for information, reference and documentation purposes to have a page that lists all bots and how they are involved in editing sdc. then we will know, whether a specific task in mind is already taken charge of, or if any bot malfunctions which tasks are not carried out, etc. RZuo (talk) 10:41, 6 August 2024 (UTC)

for example, i want to know now whether some bot is already in charge of migrating accidental addition of coordinate location (P625) to location of creation (P1071). RZuo (talk) 10:43, 6 August 2024 (UTC)
See Commons:Structured_data/Modeling#Bots --XRay 💬 10:55, 6 August 2024 (UTC)
thx a lot.
also i made a mistake above. should be moving p625 to coordinates of the point of view (P1259) instead of p1071. RZuo (talk) 11:57, 6 August 2024 (UTC)

Cannot find haswbstatement:P1259

https://commons.wikimedia.org/w/index.php?search=haswbstatement:P1259 shows nothing despite File:Martin-Luther-Kirche Behringen 2024-04-09 03.jpg for example. why? this shouldnt be expected, right? RZuo (talk) 11:58, 6 August 2024 (UTC)

@RZuo: Ah yes, I remember looking at this before. I think CirrusSearch can only see properties whose values are text or Wikidata items. If you use the API to get cirrusbuilddoc for that page (API sandbox link) you'll find that statement_keywords contains source of file (P7482), copyright license (P275), copyright status (P6216), captured with (P4082), media type (P1163), and checksum (P4092), but not creator (P170), inception (P571), data size (P3575), or coordinates of the point of view (P1259). bjh21 (talk) 17:37, 6 August 2024 (UTC)
thx a lot for the tips! RZuo (talk) 19:36, 6 August 2024 (UTC)
You can have a look to the cirrusdump: --XRay 💬 20:04, 6 August 2024 (UTC)

Inception for files where only Upload date is available

Is there a preferred way to add SDC inception (P571) claims for files with unknown creation date, but known {{Upload date}}? As far as I see, there's no "upload date" qualifier on Wikidata. Is it ok to simply use the upload date for inception (P571)? Fl.schmitt (talk) 20:30, 22 August 2024 (UTC)

@Fl.schmitt: how about "unknown value" with qualifier latest date (P1326) and the upload date. - Jmabel ! talk 04:30, 23 August 2024 (UTC)
@Jmabel - that's a good solution! I've used it already at Karte Bodensee Birnau.png - and {{Information}} is able to use latest date (P1326) as default if there's no date parameter - great! I wasn't able to set "unknown date" manually, but pywikibot was able to create such a claim with snaktype "somevalue". Fl.schmitt (talk) 07:16, 23 August 2024 (UTC)

VIRIN property proposal

See d:Wikidata:Property proposal/VIRIN. Multichill (talk) 13:27, 24 August 2024 (UTC)

Test environment for SDC?

Is there a installation for testing SDC? I found https://test.wikipedia.org but that doesn't have SDC. https://test-commons.wikimedia.org seems to be closed.
 ∞∞ Enhancing999 (talk) 19:06, 2 September 2024 (UTC)

On https://commons.wikimedia.beta.wmflabs.org many testing should be possible but you might need to request rights on Beta Wikidata to create properties. GPSLeo (talk) 19:31, 2 September 2024 (UTC)

YouTube SDC

Please see Commons:Bots/Work requests#Add P1651 YouTube video ID structured data from "source" attribute of Filedesc template for the sample schema, which is more or less identical to Commons:Structured data/Modeling#Upload from a platform like Panoramio, Geograph or Flickr -- DaxServer (talk) 10:32, 13 September 2024 (UTC)

SDC OpenStreetMap way ID (P10689) / OpenStreetMap relation ID (P402) / OpenStreetMap node ID (P11693)?

It seems that there's no consensus yet on using OpenStreetMap way ID (P10689) (or OpenStreetMap relation ID (P402) and OpenStreetMap node ID (P11693), respectively) in SDC. Using those properties would allow using SPARQL queries based on OSM IDs, like https://w.wiki/B3sq or https://w.wiki/B3su - with the advantage that such a query could return multiple views of an OSM map feature (similar to Google Maps Images for a map object). What's your opinion on this proposal? Fl.schmitt (talk) 06:19, 30 August 2024 (UTC)

Better example: https://w.wiki/B3t6 ("Big Ben", London) Fl.schmitt (talk) 06:33, 30 August 2024 (UTC)
Query for current usage of P10689, grouped by OSM ID: https://w.wiki/B3tC Fl.schmitt (talk) 06:37, 30 August 2024 (UTC)
If i remember correctly, the opposition to using OSM identifiers in Wikidata was that the OSM identifiers weren't stable. The proposed method then was to add Wikidata items to OSM and link from OSM to Wikidata. This would also create a permanent identifier to OSM for an entity. However, the problem with this approach was that it is impossible to know in the wiki Lua/template code if there was anything on the OSM side, which is a problem when the template creates links to OSM or uses OSM location in maps. Soving this for inwiki use would require adding it to software (Lua/Wikidata) and afaik only workaround is to add these as properties. In SPARQL however images can be queried using federated queries ( https://w.wiki/B3tZ or https://w.wiki/B3tt), but with performance penalty. --Zache (talk) 07:05, 30 August 2024 (UTC)
Photos don’t have OSM way/relation/node IDs, only the depicted places have them. Therefore I don’t think these properties should be used directly in SDC: add them to the appropriate Wikidata item and link to that using depicts (P180). Those can also be queried, using federation:
#defaultView:ImageGrid
select ?place ?placeLabel ?thumb with {
  select * {
    service <https://query.wikidata.org/sparql> {
      bind('54486345' as ?way_id).
      ?place wdt:P10689 ?way_id;
             rdfs:label ?placeLabel.
      filter (lang(?placeLabel) = "en").
    }
  } 
} as %places where {
  include %places. 
  ?image wdt:P180 ?place;
         schema:url ?thumb.
}
Try it! (By the way, as you can see from these results, your example of finding the Big Ben / Elizabeth Tower by the ID https://www.openstreetmap.org/way/54486345 is wrong: it’s the nearby St Margaret’s Church. This doesn’t invalidate your example, but it does mean that the files your query finds have incorrect SDC and should be fixed.) —Tacsipacsi (talk) 13:35, 1 September 2024 (UTC)
@Tacsipacsi: Thanks a lot - that's in fact interesting, I didn't check the OSM ID for the Big Ben example. Good hint, I will look at it. Regarding "Photos don’t have OSM way/relation/node IDs, only the depicted places have them": Here I disagree - that's exactly the point of my question. Of course, there's no such thing like a 1:1 relation between a photo and it's object (insofar I agree). But there are 429,892 OSM tags which have Commons Files or Categories "attached". So while OSM entities have Commons content (and Categories/Files can reference OSM objects using {{On OSM}} or {{OSMLink}}), there seems to be a practical need for such relations. Usage Bot has collected more than 200,000 commons files used on OSM. I admit that OSM identifiers may not be stable, but is this a practical problem? This looks like a standard maintenance task to be done by a bot - periodically check such ID references for validity.
Regarding depicts (P180): Using that property in SDC works only in cases if there's a wikidata item as target. But on OSM, the wikimedia_commons or image attributes are used on objects like hiking sign posts, wayside shrines or other "non-notable" things. Especially for hiking sign posts (but also for fountains, sculptures and other 3d objects), it would be very useful to have multiple images available, showing different perspectives of that object. On OSM, there's currently no "recommended" way to reference multiple images, leading to an incoherent use of the respective tags. Assigning an OSM node ID to a Commons file would allow a 1:n relation between OSM objects and Commons files. Fl.schmitt (talk) 17:34, 1 September 2024 (UTC)
To add to previous comments: OpenStreetMap way ID (P10689), OpenStreetMap relation ID (P402) & OpenStreetMap node ID (P11693) shouldn't be used here at all. It's just data pollution. Multichill (talk) 20:05, 30 September 2024 (UTC)

sdc experts? how to query based on camera data?

@XRay asked in Commons:Categories_for_discussion/2024/09/Photographs_by_technical_parameters#c-XRay-20240925100400-Voting, how to search for files based on iso values, etc.

is it currently possible? any sdc experts? RZuo (talk) 16:43, 30 September 2024 (UTC)

Sorry, it looks like I have no idea. It's a bit out of context. For example, you can query with haswbstatement or with SPARQL. The first way is rather inflexible, the second is not quite as simple. IMO the ISO value is not searchable with haswbstatement. --XRay 💬 17:48, 30 September 2024 (UTC)
By the way, you can find out what you can query with ?action=cirrusdump. However, this also shows how little can actually be queried. --XRay 💬 17:50, 30 September 2024 (UTC)
Here is a little bit documentation: --XRay 💬 17:52, 30 September 2024 (UTC)
i saw your question over there and thought that it should be raised to more users' attention. if no one has any solutions then a feature request should be filed on phab. RZuo (talk) 18:38, 30 September 2024 (UTC)
You're right about that. The query options are relatively inflexible. SPARQL requires a lot of knowledge. I mentioned things because I don't think the extensive removal of categories makes sense. I query the source code a lot with regular expressions, so deleting would be counterproductive. I think the SDC options for flexible searches with the normal search function are important, but unfortunately there is far too little available. --XRay 💬 04:44, 1 October 2024 (UTC)
@RZuo: Basically query can be done like this, but there are couple of reasons why SDC is not currently substitute for categories. One is that there is that to be useful values are used in combination with other categories. Easiest way afaik to do this is using Petscan which can filter articles based on multiple categories. This is accessible for normal users. In theory some of use cases can be currently done also using SPARQL (example below), there is performance limitations and not all things can be done using SPARQL because information needed for combined queries are not accessible from SPARQL.
:# P6789 = ISO speed 
:# P2151 = Focal length
:# P6790 = f-number
:# P6757 = exposure time
:SELECT * WHERE {
:  ?file wdt:P6790  ?aperture .
:  ?file schema:url ?image.
:  FILTER (?aperture > 14) 
:}
:LIMIT 10
:
Try it!
--Zache (talk) 08:36, 1 October 2024 (UTC)

Emails

Why am I now receiving unsolicited emails about this? I complained back in 2021 about the intrusive notifications, and now someone has the lack of awareness to start emailing me to do an interview. Who in Wikimedia authorised this? STOP. Cnbrb (talk) 09:19, 19 October 2024 (UTC)

Why would a person doing research interviews have any idea that you asked someone else not to contact you? - Jmabel ! talk 14:01, 19 October 2024 (UTC)
If you read what I wrote, you might get an inkling that being pestered about this project is not entirely welcome. And that's not how privacy works. Why would a person doing research interviews assume that I want to receive emails about something I have already expressed disinterest in? Sounds like lax use of personal data to me. Cnbrb (talk) 15:48, 19 October 2024 (UTC)
On your user page is the invitation "Email this user". If you do not wish to receive unsolicited emails you can turn that off in your account preferences. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 12:29, 20 October 2024 (UTC)

"Publish changes" does nothing

I have changed the geocoordinates of File:Kreuzung L489 - panoramio.jpg in wikitext, because I know better than Panoramio where this picture has been taken. However, now I'm getting the Commons:Structured_data/Reconciliation warning. Switching to the SDC tab, I can add the new coordinates and remove the old ones, but the "Publish changes" button does nothing. No HTTP request. There is an error in the developer console saying Error: View mediainfoview does not exist, but this error already appears when loading the page. "Remove all" removes the coordinates from the list, but no corresponding HTTP request happens, and when reloading the page, the coordinates are back to the way they were, including the discrepancy warning. Scytale (talk) 22:54, 28 October 2024 (UTC)

Derivatives

I have been wondering for some time how the information on derivatives is correctly provided. The starting point is the crop tool, which basically provokes violations of the license conditions. The information is usually not customized. In addition, a bot like User:BotMultichillT comes very quickly and transfers the inadequate information directly into the structured data. Sources, information on the persons who created the derivation and other information relating to the license are missing. I have tried to correct the information for one derivation (File:2019 BMW i3 (Giga Turbine style 429 wheel).jpg) that I know of. But I don't know whether this is correct. I even doubt it, because I'm not really familiar with the structures from Wikidata. Can someone perhaps give me a hint? --XRay 💬 11:38, 30 November 2024 (UTC)

Important to distinguish crops from derivatives in general. Both may affect "depicts," but crops never change where and when the image was created, and with what equipment, and rarely (if ever) create new intellectual property rights, but may require indicating who did the crop where the original license says that derivatives must not be made to appear to be entirely the work of the original author. But the latter does not always apply: for example CC-0, PD, or just cropping out a border. - Jmabel ! talk 17:50, 30 November 2024 (UTC)
Basically, I think it is important that all information is as correct as possible. As a photographer, I decide what a photo should look like. Derivations of any kind are permitted and possible under license. In my opinion, however, it is not acceptable if the person who created the derivative is not named - in other words, if the terms of the license are not adhered to. The person is responsible for the derivative. This is important to me and can also have legal consequences, because not every excerpt is legally permitted. (See for example File:Louvre at night centered.jpg.) However, the question remains as to what the structured data should look like. (The background: I want to use the structured data of the derivatives of my photos in a meaningful way.) --XRay 💬 18:11, 30 November 2024 (UTC)

To be a little more specific: I am concerned with the following properties:

But: How to set this properties? How to fix the crop tool? How to fix the bots like User:BotMultichillT or User:SchlurcherBot?

--XRay 💬 06:59, 1 December 2024 (UTC)

@XRay: I have similar considerations. On my user page, I keep a log of categories, where I feel the SDCs are not sufficient as of now. These are: Other versions (like SVG), retouched versions and other derivatives like crops. I'm happy to update my bot as soon we have something like an aligned modeling specification for this. Here the problem starts for me already is why can extracted from (P7009) not be added to files manually? Why is it greyed out if added by bots? Can we fix this first and then use it? --Schlurcher (talk) 22:44, 1 December 2024 (UTC)
There seem to be a special Wikimedia type of commonsMedia: https://www.wikidata.org/w/index.php?title=Special:ListProperties/commonsMedia&limit=500&offset=50. These directly link to Commons files (but however cannot be added by the structured data UI). extracted from (P7009) is of type commonsMedia, but based on (P144) is not. Should we use for the latter based on media (P12346) instead, which is again of type commonsMedia? --Schlurcher (talk) 08:03, 9 December 2024 (UTC)

OpenRefine - Commons upload validations

As you may know, OpenRefine lets users upload media files to Commons in batch. Because some of the uploads done in this way add too little metadata to the uploaded files, we are considering introducing more pre-upload checks to prevent that. We need your help to determine which metadata fields should be required for any file uploaded via OpenRefine. Are these guidelines still up to date and accurate? Based on this information, we would require the users to provide:

We would not require copyright license (P275) as this statement is not required for works in the public domain, and we don't anticipate being able to be able to express this conditional dependency.

We also looked into adding constraints on the wikitext associated to the media files but this is likely too complicated to implement reliably, as some required parts could be added via different sorts of templates, which OpenRefine isn't able to expand before upload.

What do you think of this plan? Can you think of any case where it would be fine to upload a file without one of the 5 fields mentioned above? Do you think OpenRefine should only warn the user about those missing fields, or even prevent the upload entirely if those fields are not provided? SunilNOpenRefine (talk) 04:41, 11 December 2024 (UTC)

Deleted Wikidata entities used in SDC

There are a number of deleted Wikidata items that are still in use in SDC; see d:Wikidata:Database reports/Deleted Wikidata entities used in SDC. One item is used by 33357 files! Seventy-eight are used on over 50 files. I've opened a discussion at d:Wikidata:Project chat#Deleted items still in use elsewhere. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 20:18, 28 December 2024 (UTC)

And absolutely no way to know what they were. Probably we need someone from Wikidata to take a look at this to see if some of these should be undeleted because of use here, especially the heavily used ones. - Jmabel ! talk 23:56, 28 December 2024 (UTC)
Most of them are spam, but some indeed might need to be restored. Ymblanter (talk) 08:25, 29 December 2024 (UTC)
As I demonstrate in the linked discussion, we know what some of them were via the Wayback Machine. Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:01, 29 December 2024 (UTC)
Category:Commons talk archives