TimedText:Introduction to QuickStatements (Wikidata Leveling Up Days 2024).webm.en.srt

1 00:00:01,920 --> 00:00:03,250 (interviewer) Hello, everyone.

2 00:00:03,250 --> 00:00:04,590 We are here with Sannita,

3 00:00:04,590 --> 00:00:06,580 and we're going to talk about QuickStatements.

4 00:00:07,735 --> 00:00:09,655 So tell us, do you remember the first time

5 00:00:09,655 --> 00:00:11,095 you started editing Wikidata,

6 00:00:11,095 --> 00:00:12,795 and what got you into it?

7 00:00:13,755 --> 00:00:17,180 Well, what got me into Wikidata was,

8 00:00:17,740 --> 00:00:22,000 the presentation at Wikimedia-- sorry, at the Wikimania 2012.

9 00:00:22,220 --> 00:00:24,080 I was there in Washington, DC.

10 00:00:25,225 --> 00:00:27,590 I was at the presentation,

11 00:00:27,590 --> 00:00:29,955 and I immediately fell in love with the project.

12 00:00:30,655 --> 00:00:36,860 So much so that I started editing the beta version of Wikidata,

13 00:00:36,860 --> 00:00:41,480 trying out all the all the things, all the links, and all of that,

14 00:00:42,035 --> 00:00:44,763 and managing to convince, finally,

15 00:00:45,115 --> 00:00:50,262 the team to actually use Italian Wikipedia as a test bed

16 00:00:50,782 --> 00:00:51,980 for the links.

17 00:00:52,610 --> 00:00:55,430 So I have quite a history with Wikidata.

18 00:00:56,110 --> 00:00:58,470 My first edits, though, are about,

19 00:00:59,330 --> 00:01:00,712 time zones,

20 00:01:01,432 --> 00:01:02,515 for some reasons.

21 00:01:02,515 --> 00:01:04,355 I really don't know why

22 00:01:04,355 --> 00:01:06,455 I decided to do time zones,

23 00:01:07,235 --> 00:01:12,260 and then biographies of journalists

24 00:01:12,260 --> 00:01:16,360 and newspaper editors that won a particular award.

25 00:01:17,255 --> 00:01:18,355 I was doing...

26 00:01:19,775 --> 00:01:23,165 a cleanup of those things on Italian Wikipedia.

27 00:01:23,385 --> 00:01:26,370 They said, "Okay. Let's do it first on Wikidata.

28 00:01:26,570 --> 00:01:30,190 Just let's collect all the things that we can do there,

29 00:01:30,330 --> 00:01:32,490 and then move it from there."

30 00:01:32,490 --> 00:01:33,900 And I still haven't finished,

31 00:01:33,900 --> 00:01:35,882 but that's another story for another day.

32 00:01:36,370 --> 00:01:38,505 (interviewer) Nice. Impressive.

33 00:01:38,505 --> 00:01:41,475 Okay. So tell us a little bit about QuickStatements.

34 00:01:41,960 --> 00:01:45,450 What is that tool and why is it particularly useful

35 00:01:45,450 --> 00:01:47,980 for people to enhance their editing experience?

36 00:01:49,235 --> 00:01:50,755 So QuickStatements

37 00:01:50,755 --> 00:01:54,685 it's yet another tool made by Magnus Manske,

38 00:01:54,915 --> 00:01:56,642 and it's particularly useful

39 00:01:56,642 --> 00:02:00,180 when you have to upload a huge batch of data.

40 00:02:00,180 --> 00:02:03,050 So, basically, we all know Wikidata,

41 00:02:03,050 --> 00:02:06,550 and we know that Wikidata hosts a lot of data.

42 00:02:06,775 --> 00:02:12,395 And sometimes you have a huge pack of data that you want to import.

43 00:02:13,255 --> 00:02:16,087 Unfortunately, if you go with the user interface,

44 00:02:16,540 --> 00:02:20,380 you would do it one line at a time,

45 00:02:20,380 --> 00:02:25,520 which means every single subject of that data set

46 00:02:25,520 --> 00:02:26,965 that you are going to import

47 00:02:26,965 --> 00:02:28,535 has to be treated individually.

48 00:02:28,905 --> 00:02:30,345 With QuickStatements,

49 00:02:30,345 --> 00:02:35,440 you can actually do this in a more, faster way.

50 00:02:35,820 --> 00:02:39,732 In this way, you can create a syntax

51 00:02:39,732 --> 00:02:42,905 that is basically creating a sentence,

52 00:02:43,205 --> 00:02:46,905 which is, subject verb object,

53 00:02:47,365 --> 00:02:49,152 which is the number of the item,

54 00:02:49,152 --> 00:02:51,760 the property, and the value of the property.

55 00:02:52,300 --> 00:02:54,240 And in this way, you can upload

56 00:02:54,780 --> 00:03:01,475 a lot of data very easily, very fast, and you don't go one item per time.

57 00:03:02,495 --> 00:03:04,385 (interviewer) Okay. Sounds amazing.

58 00:03:04,385 --> 00:03:06,275 Can you show us what it looks like?

59 00:03:07,030 --> 00:03:09,710 Okay. So I'm going to show something

60 00:03:12,000 --> 00:03:14,010 on my computer first.

61 00:03:14,415 --> 00:03:17,395 I'm going to show you first something here,

62 00:03:18,015 --> 00:03:19,545 which is my CSV.

63 00:03:19,545 --> 00:03:23,315 This is an old project of mine about the quarters of Bologna.

64 00:03:24,200 --> 00:03:26,140 So I already imported that.

65 00:03:26,520 --> 00:03:30,150 What I did was to create a CSV file

66 00:03:30,527 --> 00:03:33,415 with all the things that I need

67 00:03:33,925 --> 00:03:38,335 in terms of item that needs to be edited

68 00:03:38,635 --> 00:03:40,126 and all the values

69 00:03:40,507 --> 00:03:44,580 that needed to be imported in that particular item.

70 00:03:45,570 --> 00:03:49,600 As you can see in this way, I already prepared...

71 00:03:50,455 --> 00:03:53,765 I already prepared all the data

72 00:03:54,025 --> 00:03:56,555 as already should be on QuickStatements,

73 00:03:57,015 --> 00:04:02,050 which means this is p1448, which is official name.

74 00:04:02,350 --> 00:04:07,570 As you can see, you format it as language, column,

75 00:04:08,070 --> 00:04:11,245 bracket, text bracket

76 00:04:11,245 --> 00:04:13,685 or the data or the date. Sorry.

77 00:04:14,330 --> 00:04:17,470 Which is 571 is the inception,

78 00:04:17,610 --> 00:04:20,330 576 is the date of the solve--

79 00:04:20,730 --> 00:04:23,150 of the the solve of the item.

80 00:04:23,465 --> 00:04:29,105 And, also, this is prepared in the way QuickStatements asks to do it.

81 00:04:29,385 --> 00:04:31,805 Of course, if you don't know how to do this,

82 00:04:32,100 --> 00:04:36,598 there's actually a help page on Wikidata that tells you

83 00:04:38,180 --> 00:04:39,807 how to create all this data,

84 00:04:39,807 --> 00:04:44,035 how to model all of this data with the precision,

85 00:04:44,035 --> 00:04:46,835 with the correct brackets, and everything.

86 00:04:46,835 --> 00:04:48,215 So if you need help,

87 00:04:48,480 --> 00:04:52,310 you just type "help colon QuickStatements"

88 00:04:52,310 --> 00:04:54,480 on the Wikidata search bar,

89 00:04:54,480 --> 00:04:56,402 and you'll go directly to the page,

90 00:04:56,402 --> 00:04:58,325 and you will have the user manual there.

91 00:04:59,035 --> 00:05:00,718 (interviewer) Alright. So this is the way

92 00:05:00,718 --> 00:05:03,941 you prepare your data, basically, in a spreadsheet

93 00:05:04,265 --> 00:05:08,410 - before, putting it into QuickStatements. - Exactly.

94 00:05:09,370 --> 00:05:15,330 Right. Then you export it into a CSV form.

95 00:05:15,865 --> 00:05:18,285 And there's another tool that I wanted to share,

96 00:05:20,665 --> 00:05:22,162 which is...

97 00:05:23,562 --> 00:05:25,780 this one, CSV to QuickStatements.

98 00:05:26,480 --> 00:05:28,973 So you export your data

99 00:05:30,050 --> 00:05:33,895 already prepared in a spreadsheet to a CSV file.

100 00:05:34,675 --> 00:05:38,135 You upload the data here.

101 00:05:41,310 --> 00:05:46,230 And as you can see, you already have the data immediately here.

102 00:05:46,590 --> 00:05:52,755 I'll just make it a little bit bigger.

103 00:05:53,455 --> 00:05:54,992 So as you can see,

104 00:05:54,992 --> 00:06:00,481 you have the subject verb object phrase

105 00:06:00,481 --> 00:06:02,090 that I told you about.

106 00:06:02,630 --> 00:06:05,925 So what happens is that QuickStatements will add,

107 00:06:06,465 --> 00:06:11,356 step-by-step, all of this data all in one go.

108 00:06:12,320 --> 00:06:16,580 Which is basically add to item

109 00:06:17,550 --> 00:06:21,105 Q110825732,

110 00:06:21,265 --> 00:06:23,615 add the property 1448

111 00:06:23,615 --> 00:06:26,705 and with the value "Andrea Costa - Saragozza."

112 00:06:28,375 --> 00:06:30,510 Then the property 571

113 00:06:30,510 --> 00:06:34,100 with the value "1962... etc. etc."

114 00:06:34,670 --> 00:06:39,805 "5761985... etc. etc.," and so on and so forth.

115 00:06:40,185 --> 00:06:43,005 All of this is already prepared for QuickStatements.

116 00:06:43,385 --> 00:06:46,205 So you just click on copy to QuickStatements,

117 00:06:47,060 --> 00:06:52,680 and it opens automatically the QuickStatements interface.

118 00:06:53,595 --> 00:06:58,315 And as you can see, here, you have the command ADD

119 00:06:58,315 --> 00:07:00,575 which means you can add this particular,

120 00:07:01,710 --> 00:07:02,770 data here,

121 00:07:02,990 --> 00:07:05,327 and it reads automatically

122 00:07:05,675 --> 00:07:08,712 which item or which property you are adding

123 00:07:08,712 --> 00:07:11,055 or which other item you're adding as value.

124 00:07:11,855 --> 00:07:13,662 But first, remember

125 00:07:13,978 --> 00:07:19,481 that you have to log in inside the tool.

126 00:07:20,385 --> 00:07:22,260 So you click on log in.

127 00:07:22,880 --> 00:07:25,940 It will open this dialogue box here.

128 00:07:26,210 --> 00:07:28,650 In order to complete your request needs permission,

129 00:07:28,915 --> 00:07:30,542 you click on Allow

130 00:07:31,885 --> 00:07:35,593 and then you can reload the data

131 00:07:36,245 --> 00:07:37,462 and there you go.

132 00:07:37,872 --> 00:07:40,190 You can already import the data.

133 00:07:40,410 --> 00:07:41,950 You just click on Run

134 00:07:42,650 --> 00:07:43,947 and then you're done.

135 00:07:45,057 --> 00:07:46,625 You have imported the data.

136 00:07:46,625 --> 00:07:48,925 Of course, I'm not going to import this data

137 00:07:48,925 --> 00:07:50,785 because I already did it in the past,

138 00:07:50,985 --> 00:07:52,535 and it would be a repetition.

139 00:07:52,840 --> 00:07:58,060 But with these quick steps,

140 00:07:58,280 --> 00:08:00,220 you can do it on your own.

141 00:08:00,915 --> 00:08:02,215 If you don't want to use

142 00:08:03,395 --> 00:08:05,565 CSV to QuickStatements,

143 00:08:06,015 --> 00:08:09,895 you can create your own command from the batch.

144 00:08:10,160 --> 00:08:11,820 You just go to QuickStatements.

145 00:08:11,820 --> 00:08:14,240 Again, remember to log in.

146 00:08:14,240 --> 00:08:16,690 Always need to check if you're logged in here.

147 00:08:17,239 --> 00:08:19,140 Then you click on New batch.

148 00:08:19,775 --> 00:08:22,615 This will open this window,

149 00:08:22,615 --> 00:08:24,415 this dialogue window,

150 00:08:24,415 --> 00:08:30,230 in which you can copy paste the commands.

151 00:08:30,930 --> 00:08:34,430 It must be on a tab separated value,

152 00:08:34,595 --> 00:08:37,175 which means every line has to have

153 00:08:37,795 --> 00:08:39,555 the item that you're going to edit,

154 00:08:39,555 --> 00:08:41,375 the property that you're going to add,

155 00:08:41,375 --> 00:08:43,735 and the value that you're going to add. Okay?

156 00:08:44,310 --> 00:08:46,770 This means that every single line

157 00:08:47,330 --> 00:08:51,110 must have only one value, remember that.

158 00:08:51,570 --> 00:08:54,475 No commas, only tabs. Okay?

159 00:08:55,015 --> 00:08:59,515 And then you click on Import V1 commands here.

160 00:09:01,470 --> 00:09:02,770 And then you have,

161 00:09:03,310 --> 00:09:04,590 as you can see,

162 00:09:04,590 --> 00:09:08,110 again, this window

163 00:09:08,110 --> 00:09:12,365 where you have the item that you're going to edit,

164 00:09:12,745 --> 00:09:16,255 then the action that you're going to do, add the statements.

165 00:09:16,510 --> 00:09:19,250 You can also remove statements, but that's for later.

166 00:09:19,870 --> 00:09:22,162 And then the property that you're going to add

167 00:09:22,162 --> 00:09:23,815 and the value that you're going to add.

168 00:09:24,135 --> 00:09:26,255 Another thing that I didn't show you before

169 00:09:26,255 --> 00:09:28,505 is that you can also check

170 00:09:28,505 --> 00:09:31,195 how many edits you want to show,

171 00:09:31,550 --> 00:09:35,410 like, 25, 50, 100 or 500.

172 00:09:36,240 --> 00:09:38,050 This is really important.

173 00:09:39,175 --> 00:09:42,485 When it shows you the window,

174 00:09:42,935 --> 00:09:46,075 please double check what you're uploading.

175 00:09:46,750 --> 00:09:48,740 Once you upload the data,

176 00:09:48,740 --> 00:09:50,440 you cannot stop.

177 00:09:50,580 --> 00:09:52,500 You can stop the--

178 00:09:53,545 --> 00:09:56,685 you can stop at any time the upload of the data.

179 00:09:57,145 --> 00:09:59,405 But if you do a mistake,

180 00:09:59,705 --> 00:10:03,400 then you'll have to fix it manually.

181 00:10:03,780 --> 00:10:07,570 So it is extremely important for you

182 00:10:07,570 --> 00:10:10,200 to double check what data you're importing.

183 00:10:10,595 --> 00:10:13,745 And if I can give you a suggestion,

184 00:10:14,395 --> 00:10:17,870 import small chunks of data every single time.

185 00:10:18,190 --> 00:10:22,600 So focus on 50 to 100 edits,

186 00:10:22,600 --> 00:10:24,775 and then you move on to the next batch.

187 00:10:26,565 --> 00:10:31,545 (interviewer) Alright. So the data is imported to Wikidata

188 00:10:31,545 --> 00:10:33,785 directly as soon as you click on Run.

189 00:10:34,560 --> 00:10:39,608 How can you check if your data was correctly imported to Wikidata?

190 00:10:40,960 --> 00:10:43,120 You can check on your contributions.

191 00:10:43,765 --> 00:10:46,245 You should see at some point,

192 00:10:46,245 --> 00:10:50,165 you can see this is the Bologna project that I was talking about.

193 00:10:50,165 --> 00:10:53,065 In fact, I already imported lots of data.

194 00:10:53,480 --> 00:10:54,900 You can check because--

195 00:10:54,900 --> 00:10:58,690 you can check on your contributions here.

196 00:10:59,240 --> 00:11:04,115 And then you see that every line is Created claim: property, value

197 00:11:04,445 --> 00:11:07,485 and then it has the tag #quickstatements,

198 00:11:09,145 --> 00:11:10,870 temporary batch, blah blah blah.

199 00:11:11,110 --> 00:11:14,840 Of course, you can also give a name to your batch

200 00:11:14,840 --> 00:11:17,530 but that's another story for another day.

201 00:11:17,710 --> 00:11:22,595 So, you can check your data directly in your contributions,

202 00:11:22,915 --> 00:11:25,200 and you can quickly sort them

203 00:11:25,200 --> 00:11:28,645 if you check, your contributions

204 00:11:28,645 --> 00:11:32,080 with the tag quickstatements 2.0.

205 00:11:32,440 --> 00:11:34,540 (interviewer) Right, but what if I realize

206 00:11:34,540 --> 00:11:37,100 a little bit too late that I made a mistake

207 00:11:37,100 --> 00:11:39,625 and I imported wrong data into Wikidata?

208 00:11:39,625 --> 00:11:41,035 Is there a way I can fix it?

209 00:11:42,135 --> 00:11:46,025 So you imported the wrong data. This happened to me too.

210 00:11:46,370 --> 00:11:47,550 So don't worry.

211 00:11:47,550 --> 00:11:48,830 First of all, don't worry.

212 00:11:48,830 --> 00:11:50,290 You're not going to be killed,

213 00:11:50,290 --> 00:11:52,937 and you're not going to be blocked because of it.

214 00:11:53,310 --> 00:11:55,390 Every everybody does mistakes,

215 00:11:55,390 --> 00:11:56,595 so can you.

216 00:11:57,185 --> 00:11:58,472 That's the first part.

217 00:11:59,075 --> 00:12:01,615 The second part is that, unfortunately,

218 00:12:01,615 --> 00:12:03,535 you have to fix it manually.

219 00:12:04,030 --> 00:12:05,890 You go directly to the item,

220 00:12:06,510 --> 00:12:09,330 and you fix manually the value.

221 00:12:09,790 --> 00:12:12,150 Of course, if you do a lot--

222 00:12:12,150 --> 00:12:17,655 if you notice that you have lots of mistakes in your dataset,

223 00:12:18,315 --> 00:12:22,335 stop immediately the import.

224 00:12:23,070 --> 00:12:25,410 Where after you click on Run,

225 00:12:25,870 --> 00:12:30,330 the command will change into a big red button called Stop.

226 00:12:30,785 --> 00:12:32,545 You use it immediately.

227 00:12:32,545 --> 00:12:35,885 You fix the data, first on Wikidata,

228 00:12:36,225 --> 00:12:39,230 then in your in your dataset,

229 00:12:39,770 --> 00:12:43,630 and then you restart again your import.

230 00:12:43,900 --> 00:12:45,310 (interviewer) Alright. Thank you.

231 00:12:45,605 --> 00:12:47,875 Can you give us a few examples

232 00:12:47,875 --> 00:12:50,875 of projects where you can use QuickStatements,

233 00:12:50,875 --> 00:12:53,305 like cases where it's particularly useful?

234 00:12:53,595 --> 00:12:57,820 Yes. I have an an example of a project that I can show you,

235 00:12:57,820 --> 00:13:00,640 which is exactly the quarters of Bologna project

236 00:13:00,640 --> 00:13:02,370 that that I still have on my computer.

237 00:13:02,735 --> 00:13:06,561 But for this, I'm going to show you also another tool

238 00:13:06,561 --> 00:13:10,615 that probably will be a focus for another video,

239 00:13:10,615 --> 00:13:14,603 or you can ask around about it, which is OpenRefine.

240 00:13:15,180 --> 00:13:19,955 As you can see, I prepared here all the data very neatly

241 00:13:19,955 --> 00:13:23,415 about all the things that I wanted to import.

242 00:13:23,955 --> 00:13:28,160 And I, created that, in a way that would be

243 00:13:29,390 --> 00:13:34,810 already organized to export to Wikidata.

244 00:13:35,605 --> 00:13:39,065 So, once you're done with OpenRefine, you can do this.

245 00:13:39,205 --> 00:13:41,830 You can click on Export to QuickStatements

246 00:13:42,325 --> 00:13:44,340 and it will automatically,

247 00:13:44,340 --> 00:13:48,320 if you create a model for this data, you can create automatically,

248 00:13:49,540 --> 00:13:52,840 an export that is ready for QuickStatements.

249 00:13:53,605 --> 00:13:55,615 What is important to know

250 00:13:55,615 --> 00:13:59,755 is that you can prepare with another--

251 00:14:01,285 --> 00:14:04,740 with another software, which is Calc, maybe,

252 00:14:04,740 --> 00:14:06,620 from the OpenRefine Calc

253 00:14:06,840 --> 00:14:08,812 on Excel or OpenRefine.

254 00:14:08,812 --> 00:14:11,545 You can prepare your dataset

255 00:14:11,545 --> 00:14:16,715 in a way that is going to be easy to export for QuickStatements.

256 00:14:17,045 --> 00:14:20,455 And once you're done, you can import the data there.

257 00:14:21,000 --> 00:14:25,630 It's very important because, of course, it's going to be easier.

258 00:14:25,630 --> 00:14:29,845 It's going to be faster than doing it item per item.

259 00:14:30,315 --> 00:14:35,855 And you can have also control on all the data that you can import.

260 00:14:36,680 --> 00:14:41,456 The most important thing is that with QuickStatements,

261 00:14:42,360 --> 00:14:47,002 you have the possibility of editing more items,

262 00:14:47,002 --> 00:14:49,785 just in one fell swoop.

263 00:14:50,645 --> 00:14:53,225 So it's easier, it's faster,

264 00:14:54,090 --> 00:14:56,900 and it's also going to be--

265 00:14:58,510 --> 00:15:02,940 it's also easier for you to double check if you're missing some data.

266 00:15:03,625 --> 00:15:06,445 Because, of course, if you edit item per item

267 00:15:06,745 --> 00:15:09,465 and you have to remember

268 00:15:09,465 --> 00:15:12,925 each time, which property you're going to add.

269 00:15:13,310 --> 00:15:14,830 This is going to be tedious,

270 00:15:14,830 --> 00:15:17,770 and sometimes you can just jump one property.

271 00:15:17,770 --> 00:15:20,790 You can just skip a property without noticing.

272 00:15:20,790 --> 00:15:22,045 With QuickStatements,

273 00:15:22,045 --> 00:15:24,795 you just don't, because you just prepare the data,

274 00:15:24,935 --> 00:15:26,095 and then you're done.

275 00:15:26,095 --> 00:15:28,925 And then you just upload it in one fell swoop.

276 00:15:30,190 --> 00:15:31,930 (interviewer) Awesome! Thank you so much.

277 00:15:33,070 --> 00:15:37,390 And let's say I have some questions about Wikidata or I need some help.

278 00:15:37,390 --> 00:15:39,010 Where can I go and ask?

279 00:15:40,095 --> 00:15:44,985 So first of all, if you have doubts about how to model data,

280 00:15:45,215 --> 00:15:50,640 again, "help: QuickStatements" in the search bar on Wikidata,

281 00:15:50,940 --> 00:15:54,960 you will go directly to the user manual of QuickStatements.

282 00:15:55,435 --> 00:15:57,535 If you have problems with it,

283 00:15:57,675 --> 00:16:00,585 you can ask in the Wikidata Telegram channel

284 00:16:00,585 --> 00:16:02,095 or at the project chat.

285 00:16:02,630 --> 00:16:05,350 Or you can contact me, Sannita,

286 00:16:05,350 --> 00:16:09,850 in my talk page, and I will gladly help you in finding a solution.

287 00:16:10,175 --> 00:16:13,149 But, anyway, anytime that you need help,

288 00:16:13,615 --> 00:16:15,492 just go out and ask.

289 00:16:15,492 --> 00:16:19,610 Don't be afraid to ask because this is what we are here for,

290 00:16:19,610 --> 00:16:20,810 to help each other.

291 00:16:21,510 --> 00:16:22,625 (interviewer) Awesome.

292 00:16:22,625 --> 00:16:25,310 Thank you, Sannita, and have fun with QuickStatements.