Wikisource:Scriptorium
From Wikisource
| ← Community pages | Scriptorium | Archives→ |
| The Scriptorium is Wikisource's community discussion page. Feel free to ask questions or leave comments. You may join any current discussion or start a new one. Project members can often be found in the #wikisource IRC channel. For discussion related to the entire project (not just the English chapter), please discuss at the multilingual Wikisource. |
[edit] Announcements
[edit] November is Validation Month
At PotM we had a discussion and thought that it would be good to get some of our Proofread works up to the Validated. To add a bit of variety, we have eight texts that will loop through. If (as?) they progress to completed, we can add and subtract to the rotation. Hope that you will join us in getting a range of works completed. Cheers. -- billinghurst (talk) 22:31, 31 October 2009 (UTC)
- Found a tool to generate a link to a random page in the Category:Proofread. Use it for your morning heart-starter.billinghurst (talk) 01:58, 21 November 2009 (UTC)
[edit] Validation Count Hits 10,000 Pages
At some point during this afternoon, we passed the 10,000 page mark in Category:Validated! Many thanks to all who have taken part in this month’s special project. Onwards to 20,000! :-) Tarmstro99 (talk) 01:46, 11 November 2009 (UTC)
[edit] WikiProjects replaces Song of the Day
To let the community know that in the {{Welcome}} banner we have replaced the quiescent {{Song of the day}} with {{Active projects}} (AP). We have identified five active WikiProjects, and each will feature (weekly rotation).
Background: Recently at {{Welcome}} talk page there was a discussion about revamping what was on offer to new visitors. Hopefully this will give a nice blend.
- For discussion about {{welcome}} please use Template talk:Welcome
- For information about Wikisource:WikiProjects, and if you believe that you have an active wikiproject that should join the rotation please start discussion at Template talk:Active projects
-- billinghurst (talk) 12:54, 7 November 2009 (UTC)
[edit] Match and Split tool
User:ThomasV has graced us with a new tool in the never-ending journey of migrating to Page: images and transcluded text in place of the main namespace text alone. The tool runs using his bot and it undertakes a Match and Split of pages where a user identifies the co-existing pages in the main and Page: namespaces. I am calling for some trial works on which we can do testing, so if anyone has candidates, please identify them. Some draft instructions are at Match and Split.
To utilise this tool, you will need to turn it on via your Gadgets. billinghurst (talk) 13:20, 14 November 2009 (UTC)
- Found a work. See Special:Contributions/ThomasBot billinghurst (talk) 13:57, 14 November 2009 (UTC)
[edit] Fix of Index page status issue
ThomasV has had the fix of the Index: page status reset problem implemented into production. Woohoo Thomasv!
There is a little work that editors will need to undertake. Where the status on an index page is out of kilter with the respective Page: page status of the works, then each individual Page: page status need to be renewed by performing a null edit such pages, then undertaking a purge on the Index: page to renew its status. At some point we will look to get a bot to run through the pages, however, we cannot guarantee that we will get all pages, nor can we guarantee that it will be done soon.
[Easy purge -> click the clock top right if you have that gadget functioning] billinghurst (talk) 23:22, 16 November 2009 (UTC)
[edit] Update of Gadget Popups
With some hand-holding from Pathoschild, I have updated to the newer version of the gadget Popups. The advantage I have found with this version is that it allows one to copy and paste the title from the popup box. For those who wish to customise, especially admins, have a peek at the customisation available for your monobook.css file. billinghurst (talk) 05:40, 28 November 2009 (UTC)
- Link needs correcting please, the wikipedia page was deleted. Cygnis insignis (talk) 09:56, 28 November 2009 (UTC)
[edit] Proposals
[edit] Move PD-manifesto works to Canadian Wikilivres?
After reading the Copyright Act of Canada more closely, I found that possibly copyrighted public speeches may have Canadian legal permission to post per Wikilivres:Template:Manifesto that I moved from Wikilivres:Template:PD-manifesto and rewrote, after discussing with Yann at Wikilivres:Wikilivres:Community_Portal/en#Canadian_copyright_protection_of_lectures. I would like to ask users here, especially my dear fellow administrators, if we should move {{PD-manifesto}} works to Canadian Wikilivres. I propose this as Wikisource:Possible copyright violations talks about 2009 Alaskan Governor Resignation Speech. Meanwhile, I also ask Chinese Wikisource users about a similar page move, as I consider allowing PD-manifesto on Wikisource possibly encouraging more abuses and endless arguments, therefore contrary to wmf:Resolution:Licensing policy about Free Content License.--Jusjih (talk) 04:05, 15 October 2009 (UTC)
- Nice find, that would free up just about all speeches, wouldn't it? I can get cracking on recent Canadian works...Sherurcij Collaboration of the Week: Author:David Livingstone. 04:40, 15 October 2009 (UTC)
- Great news! I'm very glad if these are indeed acceptable at Wikilivres under Canadian law; it frees up the concerns that we have here of PD-Manifesto simply being applied willy-nilly to texts that are (usually clearly) not intended to be freely licensed, simply because we would like the text. Indeed, if we do recommend these texts go to Wikilivres, I think we should go through the non-speeches and assign them a license per-text, and remove {{PD-Manifesto}} completely. Jude (talk) 07:39, 15 October 2009 (UTC)
- Perhaps any Wikilivres administrator including myself can start importing PD-manifesto works to Canadian Wikilivres, then delete them here, possibly without further discussion, though courtesy messages to their contributors will be better, as IP edits on Wikilivres are severely restricted after excessive vandalism.--Jusjih (talk) 20:46, 15 October 2009 (UTC)
- Nah, only dead fish ”go with the flow” ... Cygnis insignis (talk) with apologies to Sarah Palin, please don't sue me!
- No. Who put that notice at the top of the page? I think we should argue each case individually on its merits, rather than prejudging them all. For example we would have to give up the Dalai Lama's manifesto for one thing. ResScholar (talk) 04:42, 16 October 2009 (UTC)
- I believe it would specifically cover speeches such as 2009 Alaskan Governor Resignation Speech that have been tagged as PD-Manifesto without any noticeable evidence of their being intended to be released in such a manner. The other non-speech works would (and shouldn't) be removed en masse: they need to individually be judged. Jude (talk) 04:54, 16 October 2009 (UTC)
- I posted on the top due to very serious copyright concern of the topic. I never intend to prejudge them all. Some speeches are not made so "publicly", so they do not even fit PD-manifesto right here. I agree Jude to individually judge relevant works. Only those clearly compatible with cc-by-sa-3.0 and GFDL should stay here, otherwise, Canadian Wikilivres if fitting Wikilivres:Template:Manifesto, or delete without exporting. These three choices are being applied on Chinese Wikisource as well. As the only administrator of both Chinese and English Wikisources, I now see PD-manifesto misleading too many contributors on both subdomains who could post clearly acceptable works.--Jusjih (talk) 22:45, 16 October 2009 (UTC)
- I agree with the copyright concerns, and had only recently decided to start going through the category and highlight texts which don't seem to be free. There are a lot of speeches on Wikisource that are of dubious freedom, and they all need to be investigated sooner, rather than later. Jude (talk) 23:45, 16 October 2009 (UTC)
- I posted on the top due to very serious copyright concern of the topic. I never intend to prejudge them all. Some speeches are not made so "publicly", so they do not even fit PD-manifesto right here. I agree Jude to individually judge relevant works. Only those clearly compatible with cc-by-sa-3.0 and GFDL should stay here, otherwise, Canadian Wikilivres if fitting Wikilivres:Template:Manifesto, or delete without exporting. These three choices are being applied on Chinese Wikisource as well. As the only administrator of both Chinese and English Wikisources, I now see PD-manifesto misleading too many contributors on both subdomains who could post clearly acceptable works.--Jusjih (talk) 22:45, 16 October 2009 (UTC)
- I believe it would specifically cover speeches such as 2009 Alaskan Governor Resignation Speech that have been tagged as PD-Manifesto without any noticeable evidence of their being intended to be released in such a manner. The other non-speech works would (and shouldn't) be removed en masse: they need to individually be judged. Jude (talk) 04:54, 16 October 2009 (UTC)
Comment. FWIW... The Palin Resignation Speech was published on the official Alaska state .GOV site at some point prior to or soon after the event taking place, so I can't understand why it would not be covered as any other publication is covered that can typically be found for public inspection on any of the 50 state's official sites. George Orwell III (talk) 00:22, 17 October 2009 (UTC)
- There is currently a discussion regarding its copyright state at WS:COPYVIO, which would be a better place for your comments. Regardless, as pointed out by Prosfilaes, states can and frequently do claim copyright on works. Just because something is freely available on their website does not make it free and useable on Wikisource. Jude (talk) 00:34, 17 October 2009 (UTC)
- Sure each state will look at things differently, and I believe that it was not that long ago that Oregon gave up enforcing protection of this sort.Eclecticology - the offended (talk) 07:59, 20 October 2009 (UTC)
- Comment. Not be a lawyer I say lets Keep It Simple if a Copyright is automatically granted upon creation unless disclaimer by the Author or the law make it public domain then {{PD-manifesto}} is no good because copyright is assumed. If a work is under copyright it under copyright no matter how many people say it is in Public Domain. There are maters internal to wikisource like Notability then there are maters external to wikisource like copyright. I say let play safe and do not have {{PD-manifesto}} or better yet let a Lawyer do what Lawyers good for keep Wikisource out HOT WATER with to external world, By giving us good info and sometimes making the call.--Lookatthis (talk) 05:04, 19 October 2009 (UTC)
- I don't think there's any danger of you being mistaken for a lawyer. ResScholar (talk) 07:51, 19 October 2009 (UTC)
- While copyright is indeed a "Matter external", WMF sites have typically taken a more restrictive interpretation of copyright law than would be demanded by the law alone. If determining whether something is under copyright or in the public domain were so straightforward we would not be having this discussion. There is a very wide gap between what is clearly in one camp or the other. No lawyer can give a definitive answer that will safely apply in all circumstances. The effective ones are able to argue either side of a case, and win. "Playing safe" is too often a loser's strategy.Eclecticology - the offended (talk) 19:48, 19 October 2009 (UTC)
- WMF can "have more restrictive interpretation of copyright law than would be demanded by the law alone" determining the copyright status is not straightforward That is why I now think we should let someone like an administrator who read lot more copyright law me make the call.--Lookatthis (talk) 23:55, 19 October 2009 (UTC)
- The connection between being an administrator and experience in reading copyright law is a non sequitur. Anyone who has been here for several years is familiar with these laws, and a multiplicity of opinions persist. The call should certainly not be in the hands of a single administrator who has his own strong POV about copyright law. That would be an abandonment of community responsibility. Eclecticology - the offended (talk) 07:59, 20 October 2009 (UTC)
- The question is whether playing it safe is a good strategy depends on how important pushing the limits is. Furthermore, we're making this available as Free Content, for people who may be forced to fold when a serious copyright complaint comes up, which is a good reason to play it safe.--Prosfilaes (talk) 00:08, 20 October 2009 (UTC)
-
- There is nothing wrong with folding when a serious copyright complaint comes up, and the situation is re-valued on its own merits, but neither is there anything wrong with defending against that complaint if that same re-valuation shows that defence is warranted. To make playing safe depend on the importance of pushing limits makes no sense at all. Nobody is proposing pushing any limits on this site. Playing safe is a kind of negative limit. Between the safe negative limit and the foolhardy positive limit there is a very wide range of options. Rather than retreating behind a doctrinnaire interpretation of Free Content, we would be further ahead by not being so risk averse. Eclecticology - the offended (talk) 07:59, 20 October 2009 (UTC)
- Anyone can "pushing the limits" on their own webpage I am not recommending that, Wikisource needs clear limits.--Lookatthis (talk) 00:53, 20 October 2009 (UTC)
-
- WMF can "have more restrictive interpretation of copyright law than would be demanded by the law alone" determining the copyright status is not straightforward That is why I now think we should let someone like an administrator who read lot more copyright law me make the call.--Lookatthis (talk) 23:55, 19 October 2009 (UTC)
I don't read Section 32.2 of the Canadian Copyright Act the way most seem to be doing. It protects "reports" ("comptes rendus" in the French version of the law). How does a report equate to a complete text? I still think that both projects should take a liberal view toward manifesti, but 32.2 alone is not a reliable authority for such action. Eclecticology - the offended (talk) 19:48, 19 October 2009 (UTC)
- Wikilivres would be free to claim under Sections 29-29.2 which suggest that the purposes of "research or private study" allow full reproduction under Fair Dealing. Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 20:03, 19 October 2009 (UTC)
- Although there are some notable differences Canadian fair dealing and US fair use are remarkably convergent. If the lecture provision is not applicable, and we must depend on fair dealing, we are no further ahead than if we relied on fair use. I am concerned that this proposal may be nothing more than passing the buck to the Canadian legal system. Eclecticology - the offended (talk) 07:59, 20 October 2009 (UTC)
- This is a misconception: fair dealing does not allow full reproduction, and is in fact explicitly prohibited on Wikisource. (Even if this were not a problem, fair dealing would not confer any of the other rights required by Wikisource.) --Piet Delport (talk) 09:35, 20 October 2009 (UTC)
- Actual manifestos, perhaps. Tagging a speech as a manifesto "just because I want it on Wikisource" is the sort of thing that will cause legal issues for the foundation. We cannot take a liberal view towards manifestos, and we shouldn't take a liberal view to manifestos because the {{PD-Manifesto}} template, as far as I understand, has absolutely no legal backing whatsoever. Jude (talk) 22:32, 19 October 2009 (UTC)
- I think that is right--Lookatthis (talk) 00:07, 20 October 2009 (UTC)
-
- The view that "manifesti" will cause legal issues for the foundation is speculation and urban legend as credible as the belief that the sewers of New York City are full of alligators. I can't say that I like the term "manifesto", and I do have concerns about its imprecision and lack of clear legal backing. Maybe the place to start is with a clear definition of what we mean with that word. This is not a simple question of "wanting it on Wikisource." Copyright law does not exist in a vacuum, and needs to be balanced by other considerations, such as the public interest. Copyright in common law countries is historically an economic right; this is also reflected in the fair use factor about the effect on a work's market. Works that could receive the "manifesto" label are largely political in nature — a politician of the day speaking on a current issue. Is it in the public interest to have a politician's words so tied in copyright knots that the transparency required for accountability is thereby so severely hampered. There is an element of irony in the notion that Sarah Palin would need to depend on a freer Canadian law to have her words properly reported. Eclecticology - the offended (talk) 07:59, 20 October 2009 (UTC)
-
-
- I don't appreciate my concerns being compared to urban legends. If we're going to go down the route of name calling and rude comments, then I think this discussion is a moot. Jude (talk) 10:04, 20 October 2009 (UTC)
- I don't see how you can confuse a statement that something is an urban legend with a comparison. There was in fact neither name calling nor rude comments. Lighten up! Eclecticology - the offended (talk) 04:13, 22 October 2009 (UTC)
- The loopholes for "public interest" in copyright law are called fair use. If Palin challenged the use of her speech as part of a manifesto from an extreme right-wing group on copyright grounds, she'd probably win; thus the speech isn't public domain.--Prosfilaes (talk) 23:14, 4 November 2009 (UTC)
- This has nothing to do with "loopholes", nor is the discussion advanced by attempting to conflate public interest with fair use. Neither does the applicability of fair use or public interest imply public domain. Who's arguing that? A document whose publication is in the public interest may or may not pass the fair use tests. Public interest in the present context is about the right of the public to be correctly informed about the pronouncements of major political figures, free of political spin. This allows the public to make informed decisions. It is conceivable that this could apply to some other speakers, but I'm not yet ready to make a case for them. Eclecticology - the offended (talk) 08:28, 6 November 2009 (UTC)
- In the US copyright law, public interest reuse falls under fair use. There's no right to quote in full the text of major political figures; if we want information about The Audacity of Hope, we have to provide a non-derivative summary and explanation, not the text of the book.--Prosfilaes (talk) 01:50, 7 November 2009 (UTC)
- Where in the Copyright Act does it mention Public interest? But if people are going to fall for your conflation there's not much that can be done about it. And why bring up The Audacity of Hope as a straw man? We were talking about speeches; books were never a part of the discussion before this. If there is no right to accurately report political speeches how can anyone be sure that "a non-derivative summary and explanation" is accurate? Eclecticology - the offended (talk) 08:23, 7 November 2009 (UTC)
- If it doesn't mention public interest in the copyright act, where are you getting your legal justification to do this? Why is reproducing polished bits of propaganda exactly more important than reproducing books and reports? There's a lot of things you have no right to exactly copy; we know that non-derivative summaries and explanations are accurate because of our trust in the source, and if we have concerns, we compare several sources. And again; Free content, not we can post it. There's lots of folding, spindling and mutilating that Free content demands that aren't in the public interest.--Prosfilaes (talk) 13:40, 7 November 2009 (UTC)
- Where in the Copyright Act does it mention Public interest? But if people are going to fall for your conflation there's not much that can be done about it. And why bring up The Audacity of Hope as a straw man? We were talking about speeches; books were never a part of the discussion before this. If there is no right to accurately report political speeches how can anyone be sure that "a non-derivative summary and explanation" is accurate? Eclecticology - the offended (talk) 08:23, 7 November 2009 (UTC)
- In the US copyright law, public interest reuse falls under fair use. There's no right to quote in full the text of major political figures; if we want information about The Audacity of Hope, we have to provide a non-derivative summary and explanation, not the text of the book.--Prosfilaes (talk) 01:50, 7 November 2009 (UTC)
- This has nothing to do with "loopholes", nor is the discussion advanced by attempting to conflate public interest with fair use. Neither does the applicability of fair use or public interest imply public domain. Who's arguing that? A document whose publication is in the public interest may or may not pass the fair use tests. Public interest in the present context is about the right of the public to be correctly informed about the pronouncements of major political figures, free of political spin. This allows the public to make informed decisions. It is conceivable that this could apply to some other speakers, but I'm not yet ready to make a case for them. Eclecticology - the offended (talk) 08:28, 6 November 2009 (UTC)
- I don't appreciate my concerns being compared to urban legends. If we're going to go down the route of name calling and rude comments, then I think this discussion is a moot. Jude (talk) 10:04, 20 October 2009 (UTC)
-
- Definitely no go
- Eclecticology is right: what 32.2 allows is to "make or publish, for the purposes of news reporting or news summary, a report of a lecture given in public, or an address of a political nature given at a public meeting", which
- does not equate the complete text, and
- does not permit the other rights also required by Wikisource's free content definition, beyond just publication (that is, modification and exploitation by anyone, in any form, and for any purpose)
- It certainly does not make the work public domain, as the use of a PD-* template would suggest. --Piet Delport (talk) 09:15, 20 October 2009 (UTC)
-
- In which case, we need to consider each and every work that is currently tagged PD-Manifesto. I have already said before that I believe each work with this tag should have a detailed rationale. Those for which a detailed rationale cannot be provided should be deleted. Jude (talk) 10:10, 20 October 2009 (UTC)
- Wikisource requires cc-by-sa-3.0 and GFDL, so reproducing the entire copyright-restricted work to claim "fair" use is impractical and thus forbidden, but Wikilivres just requires any permission to post in Canada, even if non-commercial and non-derivative. Most speeches and manifestos are unlikely getting Wikipedia articles, so perhaps making or publishing for the purposes of news reporting or news summary is possible with a short introduction in the Wikilivres header, maybe combined with Canadian fair dealing while the activities of Wikilivres are strictly non-commercial while not promising that downstream users may commercially copy everything. US fair use and Canadian fair dealing have no clear boundaries with copyright infringement regarding how much may be legally quoted. To clear this matter, we should review the Copyright Act of Canada carefully. Even if moving PD-manifesto works to Canadian Wikilivres is not feasible, many can be found on the web.--Jusjih (talk) 03:57, 22 October 2009 (UTC)
- In which case, we need to consider each and every work that is currently tagged PD-Manifesto. I have already said before that I believe each work with this tag should have a detailed rationale. Those for which a detailed rationale cannot be provided should be deleted. Jude (talk) 10:10, 20 October 2009 (UTC)
-
-
- Yes, each and every work should be considered individually. "PD-Manifesto" is a problematic term. I would prefer a "w:Public interest" category, though I am aware of the vulnerability of that term to abuse. This would primarily refer to certain political speeches. Most people who make such speeches do so in the moment without any consideration of copyright, and they only receive it because of the broad automatic operation of the law. A detailed rationale would have the same effect, but if, for example, we considered replies to a State of the Union message it should not be necessary to go into detail for each of these. A single rationale for a class of speeches should suffice. Eclecticology - the offended (talk) 04:13, 22 October 2009 (UTC)
-
I think we should copy the speeches there, not move it. The latter would (presumably) mean deleting the speeches from Wikisource, which I don't think is necessary. --Ixfd64 (talk) 19:44, 11 November 2009 (UTC)
- Wikisource doesn't discuss what Wikilivres does; if they or anyone else wants to copy the speeches, they are free to do so. But Wikilivres says "You are welcome to publish texts here if they cannot be accepted in Wikisource. Texts that can be accepted in Wikisource should be published there." So the discussion here is getting rid of a mess of content that's questionably Free content at best.--Prosfilaes (talk) 20:17, 11 November 2009 (UTC)
- To respond to Ixfd64, Wikilivres never wants to duplicate anything acceptable here. As PD-manifesto is merely the presumed but not necessarily confirmed public domain, it is not really acceptable. M. L King, Jr's "I Have a Dream" is rejected here due to confirmed case law with no evidence of licensing compatible with GFDL and CC. Tagging {{PD-manifesto}} deprecated, I may want to start clearing the 100+ pages through WS:COPYVIO to discuss whether to keep them here (with compatible license), move to Wikilivres, or delete without move.--Jusjih (talk) 02:52, 14 November 2009 (UTC)
[edit] Mark as patrolled
I suggest that we make a change in the "Mark as patrolled" logic that will automatically "Mark as patrolled" when that edit has been reverted. I find a number of vandalism's that have been reverted but are still NOT "Mark as patrolled". Kind of a waste of patrolling when checking for un-patrolled edits if they have been reverted. Jeepday (talk) 01:00, 1 November 2009 (UTC)
- If they are REVERTED by an admin using ROLLBACK, they are marked as patrolled. If someone uses UNDO, then they stay as are. Maybe this is the one reason why Rollbacker permission is useful. -- billinghurst (talk) 04:25, 1 November 2009 (UTC)
[edit] New bot proposal
Please see older thread at #Bot extraction of text already in DjVu file. I'd like to set up a pywikipedia bot to go through the nonexistent pages at Index:Dramatic_Moments_in_American_Diplomacy_(1918).djvu and fill them in using the djvutext.py script. This is the only task I plan for the bot for now. --LarryGilbert (talk) 19:48, 1 November 2009 (UTC)
- I have now done a brief trial run of the bot and had it add four pages. I confess I overlooked the guideline to throttle the updates back to 1 per minute, but I hope the restricted length of the run excuses that. I will be more careful with that in the future. Please let me know here or on my talk page if there are any problems. --LarryGilbert (talk) 21:58, 4 November 2009 (UTC)
- Looks fine. Can you test it to see what it does if it is faced with the same pages again and that it doesn't hang or get confusled.-- billinghurst (talk) 22:38, 4 November 2009 (UTC)
- I did another run using the same range of pages plus one. As expected, the bot skipped over the pages where text already existed, adding only the single new page. On this run, I neglected to run login.py first to have the bot log in to its account. I was asked for a password when I started up the bot, so I expected that it would log in at that point, but apparently it didn't (see history for Page:Dramatic Moments in American Diplomacy (1918).djvu/121). I take this to be a bug in the pywikipedia code, so I will have to take care to run login.py before any run of the bot. --LarryGilbert (talk) 01:58, 5 November 2009 (UTC)
- If the bot was posting identical text over the top, MediaWiki would have treated it as a null edit, and it would look like no edit was made. You should apply some fixes to one of those pages, then run the bot over it again, and see if the bot reverts those fixes. Hesperian 03:02, 5 November 2009 (UTC)
- I proofed Page:Dramatic Moments in American Diplomacy (1918).djvu/117 and ran the bot again with the same range of pages as the last run (the page I edited was the first in that range). It recognized that all five pages had text and skipped over them. --LarryGilbert (talk) 05:23, 5 November 2009 (UTC)
- If the bot was posting identical text over the top, MediaWiki would have treated it as a null edit, and it would look like no edit was made. You should apply some fixes to one of those pages, then run the bot over it again, and see if the bot reverts those fixes. Hesperian 03:02, 5 November 2009 (UTC)
- I did another run using the same range of pages plus one. As expected, the bot skipped over the pages where text already existed, adding only the single new page. On this run, I neglected to run login.py first to have the bot log in to its account. I was asked for a password when I started up the bot, so I expected that it would log in at that point, but apparently it didn't (see history for Page:Dramatic Moments in American Diplomacy (1918).djvu/121). I take this to be a bug in the pywikipedia code, so I will have to take care to run login.py before any run of the bot. --LarryGilbert (talk) 01:58, 5 November 2009 (UTC)
- Looks fine. Can you test it to see what it does if it is faced with the same pages again and that it doesn't hang or get confusled.-- billinghurst (talk) 22:38, 4 November 2009 (UTC)
- I ran djvutext.py as Schlep once more. I just had it add 4 more pages. All looks well from here. --LarryGilbert (talk) 18:53, 9 November 2009 (UTC)
I still don't understand why it is desirable to post uncorrected OCR, when the OCR is preloaded when you edit the page for the first time. Posting OCR doesn't help, and arguably it slows us down by giving us an extra link to click in order to edit the page. Hesperian 23:27, 4 November 2009 (UTC)
- In this case, there was uncorrected OCR text already existing for this book, no better than what is in the DjVu file; in fact, I suspect the original uploader just took the text straight out of the same DjVu file and uploaded it as one big long page (this at a time when the ProofreadPage extension didn't exist). Replacing the existing text with text divided across the Page namespace is effectively making an improvement. --LarryGilbert (talk) 00:25, 5 November 2009 (UTC)
-
-
- "I still don't understand why it is desirable to post uncorrected OCR, when the OCR is preloaded when you edit the page for the first time.", because that way we can set up a text using the PAGEs; even if it's not corrected. A badly-parsed text is better than no text at all, for the next 18 months until it gets proofread. Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 00:38, 5 November 2009 (UTC)
- I agree with Sherurcij. A further benefit of having even the raw OCR text online is that it enables semi-automated proofing and correction of the OCR’ed text (as User:TarmstroBot is presently doing with the United States Statutes at Large), leaving less work for human editors when they do eventually turn their attention to the page. Tarmstro99 (talk) 01:34, 5 November 2009 (UTC)
- "I still don't understand why it is desirable to post uncorrected OCR, when the OCR is preloaded when you edit the page for the first time.", because that way we can set up a text using the PAGEs; even if it's not corrected. A badly-parsed text is better than no text at all, for the next 18 months until it gets proofread. Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 00:38, 5 November 2009 (UTC)
-
After several runs of the bot, I'm feeling confident that it's doing what it's supposed to do. Could that account be given a bot flag, or is that still premature? --LarryGilbert (talk) 06:49, 12 November 2009 (UTC)
- I've granted its flag. There was no opposition and after almost two weeks.—Zhaladshar (Talk) 14:28, 12 November 2009 (UTC)
- Thank you! --LarryGilbert (talk) 01:22, 13 November 2009 (UTC)
[edit] Remote nomination of English texts at Canadian Wikilivres
In addition to my initiative to discuss the moving of speeches tagged as "PD-manifesto" to Canadian Wikilivres since 15 October 2009, I would also like to propose allowing users right here to remotely nominate deletion of English texts at Canadian Wikilivres because Wikilivres pages, including Wikilivres:Wikilivres:Possible_copyright_violations and Wikilivres:Wikilivres:Delete corresponding to our WS:COPYVIO and WS:DEL, forbid IP edits without logging in with usernames after excessive vandalism. Unless exception can be allowed there, I consider allowing remote control possibly needed to discuss if any English texts at Canadian Wikilivres should be deleted. Even if exception is granted there, I see 1,506 registered users, of which 10 (or 0.66%) have Sysops rights, with unknown number of "active" users, so I doubt if sufficient community exists. In case a work tagged PD-manifesto here is moved there but found without any permission to reproduce, remote control can discuss. Please discuss if we should start my proposed remote control as I also plans to propose this at Chinese Wikisource due to extremely limited Chinese users at Wikilivres.--Jusjih (talk) 03:57, 14 November 2009 (UTC)
- I'm uncomfortable with entangling the two, for legal reasons and because it makes things more complex. They run on different copyright laws and policy than we do and it would be confusing to handle both. Also, it would be help Wikilivres itself if they did this instead of depending on us; figure out how to get the sysops and get things done.--Prosfilaes (talk) 04:11, 14 November 2009 (UTC)
- If no consensus to do this here, I am willing to close this discussion, but I am waiting for Wikilivres response. For English pages, If anything hosted there was unacceptable here but is now acceptable here with evidence, where should the request to move the page in be posted? Perhaps there, but until any change, only if logged in with username. After all, I would like to ask any users interested in Wikilivres, especially administrators, to consider registering usernames there, before yours are already taken by someone else. User:Yann is the webmaster, bureaucrat, and checkuser there. Even if you are not interested in editing there, all nine administrators including myself speak English with different skill levels, then whether something should be deleted there can be decided with limited comments.--Jusjih (talk) 02:07, 15 November 2009 (UTC)
- Wikilivres:Wikilivres:Community_Portal/en#Allowing IPs to edit deletion requests? reports that exceptionally allowing IPs to edit its deletion request is not feasible, so once again, say you do not consider remote deletion nomination feasible here, please consider registering Wikilivres usernames before someone else takes yours.--Jusjih (talk) 04:32, 16 November 2009 (UTC)
- If no consensus to do this here, I am willing to close this discussion, but I am waiting for Wikilivres response. For English pages, If anything hosted there was unacceptable here but is now acceptable here with evidence, where should the request to move the page in be posted? Perhaps there, but until any change, only if logged in with username. After all, I would like to ask any users interested in Wikilivres, especially administrators, to consider registering usernames there, before yours are already taken by someone else. User:Yann is the webmaster, bureaucrat, and checkuser there. Even if you are not interested in editing there, all nine administrators including myself speak English with different skill levels, then whether something should be deleted there can be decided with limited comments.--Jusjih (talk) 02:07, 15 November 2009 (UTC)
[edit] Flagging obscure words, names, etc. for further research
I was working on the proofread of the month, and on a page from the Transactions of the Geological Society I found an interesting word: "contre." It's so interesting that I can't find a reference to it in the OED, even though it seems plainly to be in English usage here. It occurred to me that it might be nice to have a special template for flagging obscure words, phrases, names, or what have you; this would allow making them available in one spot for interested parties to see whether they can find more information on them, and if necessary add that information to one of the Wikimedia projects. Good idea or a waste of time? --LarryGilbert (talk) 19:17, 14 November 2009 (UTC)
- I cheat. I wikilink to wikt:, and if it doesn't exist, I create the request there, and then add in the page reference so that they can find it and utilise it. Then I run away very quickly.
-- billinghurst (talk) 23:09, 14 November 2009 (UTC)
[edit] Janitorial collaboration each month
Validation month is going excellently (and I say that as a reluctant WSer who hates pagescans with a passion), and it occurs to me we might want to do something similar in the future having a janitorial task in the site header each month, perhaps in December we could do pages that have {{numbers}} problems, in January we tackle {{refs}} problems, in February we focus on {{incomplete}} works and such. Again, to encourage participation, I think it helps when we have a list of the works that need fixing - and then strikethru them as they get done. Seems much more "rewarding" than simply removing the template from a work and knowing nobody will ever "see" your progress. Thoughts? Do we have room for one more "Collaboration"? Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 05:53, 28 November 2009 (UTC)
- After flicking through the history of pages at WhatLinksHere&target=Template%3ANumbers, the solution to this 'problem' would be to stop dumping uncorrected OCR into mainspace. Cygnis insignis (talk) 09:52, 28 November 2009 (UTC)
-
- Excellent idea, and the solution to the 'problem' of incomplete texts is to stop abandoning works halfway through adding them. However, that is exactly why we have clean-up templates, because it's better for WS to have a badly formatted work that contains page numbers, than to not contain the work at all...but best of all to have the work and have it properly formatted. Sherurcij Collaboration of the Week: Author:Thomas Carlyle. 15:33, 28 November 2009 (UTC)
-
- I am in favour of getting our house in order and improving our framework, though I do have a preference for pagescans as they seem the best means of robust validation. It was my intention to keep a list of documents (though smaller) needing validation and to add it to WS:PotM, and I was thinking of having it replace the previous list. Rather than a new venture, I think that they interchange well with(in) both CotW and PotM depending on what it is we are looking to fix. billinghurst (talk) 12:45, 28 November 2009 (UTC)
[edit] Other discussions
[edit] Video transcripts
Apparently, Commons is going to start doing transcripts on the audio of videos. It's a tech thing, so the feature is inevitable and who knows when anyone is going to start working out where to put the metadata and the rules on translations, much less working with us on it.--Prosfilaes (talk) 00:30, 10 November 2009 (UTC)
[edit] Questions
[edit] Index page without and with Table of Contents
index without TOC and with TOC, can we remove the blank column when adding a TOC? Or should we use a layout similar to fr:Livre:Bulletin_de_la_société_géologique_de_France_-_Tome_1.djvu (layout provided by MediaWiki:Proofreadpage index template) Phe (talk) 10:20, 7 October 2009 (UTC)
- I very much prefer the layout at fr:Livre:Bulletin de la société géologique de France - Tome 1.djvu, which wastes much less screen real estate. But I do not think it can be implemented here without further editing our own copy of MediaWiki:Proofreadpage index template, which I am personally nervous about doing. Tarmstro99 (talk) 14:15, 19 November 2009 (UTC)
[edit] Stale page link coloring on index page?
I recently finished proofreading Index:Scheme - An interpreter for extended lambda calculus.djvu, but a seemingly random selection of page links that used to be colored yellow have become red again. (Viewing the actual pages confirms that they are still proofread/yellow.)
I thought it might be a caching issue, but i have edited some of the pages in the mean time, and the problem seems to erratically persist. Does anyone know what's going on? --Piet Delport (talk) 00:24, 13 October 2009 (UTC)
- This is a known issue; all we can tell you about it is at User talk:ThomasV#Another problem?. Hesperian 00:40, 13 October 2009 (UTC)
-
- Ah, thank you for the pointer. I guess i'll wait for a fix; please let me know if i can assist in debugging in some way. --Piet Delport (talk) 01:43, 13 October 2009 (UTC)
-
-
- I do know that a null edit of individual pages fixes the issue, and for me doing it with AWB was easiest, though somewhat manual. There is also the sporadic nature of Related changes that come and go, and again are fixed with a null edit of the Index: page.-- billinghurst (talk) 04:56, 13 October 2009 (UTC)
-
This problem got worse, not better, over this past weekend. Today a large number of pages in Index:United States Statutes at Large Volume 1.djvu that are in fact validated (such as this one), and which were showing up as green links last Friday, have reverted to the red “not-proofread” background on the index page (which, in the case of the above-cited example, means that the index is now more than a year behind the actual current page status). Problems of this sort really need to be ironed out before, not after, the site launches a special campaign to get users involved in validating pages; it makes us look foolish otherwise. The solution can’t just be “make null edits to the pages to update the status,” because even that fails to make the new status “stick” in the index (and certainly new users would have no reason to know that they must check back in at random intervals to “re-validate” a page they had previously marked as validated). Tarmstro99 (talk) 14:24, 9 November 2009 (UTC)
- According to the Bug filed on Bugzilla, the problem is resolved. I would imagine in a couple days everything will go back to normal.—Zhaladshar (Talk) 15:07, 10 November 2009 (UTC)
- It hasn't stopped, and it is ... a waste of time. Can we have the link to the report on bugzilla please. Cygnis insignis (talk) 18:22, 14 November 2009 (UTC)
- Resolved in the coding, and now in SVN. Awaiting next code update to be implemented. As usual ThomasV has to wait for their window of opportunity for launch. billinghurst (talk) 23:03, 14 November 2009 (UTC)
- As discussed on my talk page, ThomasV and Phe indicated that this is due to a deeper change, which in this case it was due to our modifying some underlying templates. I am talking to AWB programmers to help me out so I can null edit the affected pages. (Takes time as it is not a std implementation, and in fact railed against for WP implementation of AWB) billinghurst (talk)
- Resolved in the coding, and now in SVN. Awaiting next code update to be implemented. As usual ThomasV has to wait for their window of opportunity for launch. billinghurst (talk) 23:03, 14 November 2009 (UTC)
- It hasn't stopped, and it is ... a waste of time. Can we have the link to the report on bugzilla please. Cygnis insignis (talk) 18:22, 14 November 2009 (UTC)
[edit] ye
Has anyone ever see the rendering of "ye" in the image caption of Page:Makers of British botany.djvu/83 before? The plate is a reproduction of a figure from a 1683 book. Presumably this is an archaic rendering? Does anyone know if it is possible to reproduce it in wikicode/html/css/whatever, or is my "ye" the best I can hope for? Hesperian 07:41, 21 October 2009 (UTC)
- Interested in the answer as I have seen similar overlay with the with the abbreviation for Number where the o was superscript and the period centred directly underneath.-- billinghurst (talk) 09:02, 21 October 2009 (UTC)
-
- Like "Nọ? Eclecticology - the offended (talk) 22:36, 21 October 2009 (UTC)
- w:Thorn (letter) covers the ground of these variants, but uses an image to reproduce this one with the e on top. Charles Matthews (talk) 09:51, 21 October 2009 (UTC)
-
- I just use a normal looking 'ye' because I haven't found a good alternative, and the positioning of the 'e' isn't really that important. But I wouldn't use an image for any portion of non-symbolic text because readers would not be able to copy and paste that portion of text. Special symbols could be images if needed. —Mike 02:25, 27 October 2009 (UTC)
[edit] Indexes and TOC
I completed formatting the indexes of Popular Science Monthly Volume 1, and in the process, I also created a page number ordered Table of Contents, although I have to edit the description column for the proper wording order. I placed it in Ineuw/TOC. Could someone let me know if it's of any use? Ineuw (talk) 02:38, 26 October 2009 (UTC)
- To note that
- I would suggest that you make them subpages of the volume, so to do that with relative links, add a '/' like [[/...
- one does not have to slavishly align page and document pagename with or header text, especially if it starts to make urls butt ugly. I try to maintain a balance between usability, reasonability and any future need to disambiguate within the nomenclature, e.g. I would not have linked the words (Illustrated); and instead of something like
[["The To-morrow of Death" (by Louis Figuier), Notice of]]may simply be sufficient to label as"[[/The To-morrow of Death /]]" (by Louis Figuier), Notice of. No need for the " " in the wikilinks unless absolutely necessary. Also note the '/' used at the end of the relative link to pipe the output differently. - Titles like "Sociology, the Study of" would be better as "The Study of Sociology", and use {{DEFAULTSORT:Sociology, the Study of}}
- billinghurst (talk) 05:17, 26 October 2009 (UTC)
-
- Many of these suggestions are matters of opinion. Indexing a work under "The" or "A" makes no sense at all. Who would look for these works there? In the short term, as long as these pages only exist individually as part of the broader publication the suggested the suggested method is reasonable, but eventually, when the articles get split out as separate works they would do better linking to the works themselves. Eclecticology - the offended (talk) 19:06, 26 October 2009 (UTC)
- Another view, stay flexible with 'naming rules' , but defer to the publisher's text; for example, abbreviating titles may be misleading. In billinghurst's example, someone searching for the "work" would find a single paragraph 'notice of'. I have to use exact match in searches when I'm looking for articles on the web, to sift out the similar titles, reviews, or citations.You may find a half title in the text, but otherwise put what you see and change it only if makes it more accessible. Users read pages, not urls, our title is insignificant unless it interferes with access. Get the full title as a redirect at least, there are millions of titles and it doesn't take long to find problems when you cutting corners. You don't need to put a 150 characters into the header template display, a subpage only needs to be distinguished from other parts of the work. As they say, redirects are cheap and default sorts can drop 'the' and 'A' for indexing (which are sometimes sensible titles).
What is meant by 'eventually split out'; some things may justify multiple transclusions of the same page: namespace, but this seems to imply the larger work would eventually be disassembled. Cygnis insignis (talk) 20:23, 26 October 2009 (UTC)
- Flexibility, by all means! - beginning with the meaning of "a work". The second article in the first volume of Popular Science is "The Recent Eclipse of the Sun" by R. A. Proctor, Author:Richard Anthony Proctor. It was not an original publication, but was copied from an unspecified issue of Cassell's Magazine where it may or may not have had the same title. Later in the same volume his "A Giant Planet" was taken from Cornhill Magazine. The articles here are what would be the works, and not the magazine. Splitting out would put emphasis on Proctor's work rather than on the magazine. On the other hand, the "Notes" page may be better treated as a sub-element of the magazine. Certain "Review" publications such as Quarterly Review don't really give titles to the articles, and one long article frequently reviews a number of other publications together. Many 17th and 18th century works have titles that amount to whole paragraphs; many of these should likely be reduced. Displacing a leading "a" or "the" is a matter of established bibliographic conventions. Yes, there are exceptional cases where it would be important to retain the article in place, but not all searching should be electronic; we need to allow for browsing by humans. In a library the experience of browsing the shelves is qualitatively different from the experience of finding something in the card catalogue. Eclecticology - the offended (talk) 19:52, 27 October 2009 (UTC)
-
-
- The same thing can be created with a redirect. The publication is the work we are dealing with, so as we don't necessarily know any exact earlier source, or if it is an exact copy, I would propose that we keep it in situ with the published work, and create a link from the root level in the main namespace to the article. This also assists disambiguation. billinghurst (talk) 21:03, 27 October 2009 (UTC)
- I think the best way to handle works that have seen publication in multiple meta-works is like this. Hesperian 23:33, 27 October 2009 (UTC)
-
-
- Definitely agree with Hesperian on the matter. That {{versions}} should be used when we have more than one copy of an article, or where we know that more than one copy exists. And to do that, I do feel that we need to have the works as subpages to manage their locality and provenance. billinghurst (talk) 02:32, 28 October 2009 (UTC)
-
- That seems excessive. For every British novel we host, I can find distinct editions in British and American spelling. Quite a few of our major novels have abridgments and variant editions in the PD. Preemptively creating versions pages when we have but one version is a lot of work for little to no gain.--Prosfilaes (talk) 13:52, 28 October 2009 (UTC)
- Let me modify my statement to more reflect my brain thoughts {{versions}} should be used when we have more than one copy of an article, or where we know that more than one copy exists and need to disambiguate it.
- To illustrate the extent of the problem, I've provisionally added Popular Science/Volume 1#Table of Contents for the first number. For now we need only concern ourselves with material where copyright is not an problem. A significant number of the articles in that number first appeared elsewhere. In the nineteenth century it was very common for American publications to copy from European ones (especially British) without any permission or payment whatsoever, and often without credit. In theory, using the Wiki-is-not-paper principle, we could host them all, but it may not be practical to document the small variations that distinguish two or more versions. The risk is that we can too easily oversimplify the problems. For a play like Macbeth, how important is it to have an expurgated school-boy version that omits the scene with the porter? Eclecticology - the offended (talk) 00:56, 29 October 2009 (UTC)
- I don't see a problem. Posting of multiple versions will be mediated by user enthusiasm. We will have (if we don't already) users who see value in posting Thomas Bowdler's expurgated versions of Shakespeare alongside the originals. We will have (if we don't already) users who see value in posting different versions, with different provenances, of important manuscripts and speeches. We will have (if we don't already) users who see value in posting every single public domain version of the Bible. But I doubt we'll ever have users who are willing to waste their time posting and proofreading multiple versions of the same document, where the differences are trivial and entirely lacking in scholarly importance and interest. It is only in this last case that any problems arise. Why structure our work around the anticipation of a problem that has never happened and probably never will? Hesperian 02:34, 29 October 2009 (UTC)
- True enough, but in that last case we can't know if there are any differences until the two are compared, and that's as much work as proofreading. The problem is not as unlikely as you believe. It has come up in the comparison of the original Dictionary of National Biography and its first reprint. A large number of changes were made in the reprint. This was primarily to correct errors, but that often necessitated other changes to avoid having the corrections spill over onto another page. An added sentence required removing material elsewhere on that page. Probably a majority of articles did not require changes at all; what benefit is there to duplicating these? Eclecticology - the offended (talk) 08:24, 29 October 2009 (UTC)
- I use a text editor to compare documents, by proof-reading a few random pages or using the machine read text in the page scan to get an indication of whether it is worthwhile. If a user was interested in a particular entry in DNB, or an article in the above, they may get around to comparing the available editions. If another work gave special reference to these versions, for what they think is a notable reason, they can be linked in someway. This is perhaps less work than annotating the trivial to the substantial variants in the notes (or worse, the text), which may be subjective, unknown or incomplete. The versions can only increase the integrity of works projected for completion, it doesn't detract from them. Placing similar versions in their context is good for that browsing factor mentioned above, it is qualitatively different. That references given in works like DNB use these details surely indicates is not always trivial. This sort of thing is actually and potentially useful, or harmless. (On the subject of displacing or omitting the article in the title, I still reckon that is not always appropriate; the need to retain them is more than rare and it sometimes ungainly. It is good for bibliographies ... ) Cygnis insignis (talk) 21:09, 29 October 2009 (UTC)
- The most practical way to show changes, which are most often short, is right in the text. Thus
old textnew text. Previously, I was putting the new text in square brackets, but that led to some ambiguous situations when square brackets were already used in the original. Another point to be made about magazine titles is that many with a long history such as Popular Science or National Geographic dropped the article somewhere along the way. Eclecticology - the offended (talk) 10:04, 2 November 2009 (UTC)
- The most practical way to show changes, which are most often short, is right in the text. Thus
- I use a text editor to compare documents, by proof-reading a few random pages or using the machine read text in the page scan to get an indication of whether it is worthwhile. If a user was interested in a particular entry in DNB, or an article in the above, they may get around to comparing the available editions. If another work gave special reference to these versions, for what they think is a notable reason, they can be linked in someway. This is perhaps less work than annotating the trivial to the substantial variants in the notes (or worse, the text), which may be subjective, unknown or incomplete. The versions can only increase the integrity of works projected for completion, it doesn't detract from them. Placing similar versions in their context is good for that browsing factor mentioned above, it is qualitatively different. That references given in works like DNB use these details surely indicates is not always trivial. This sort of thing is actually and potentially useful, or harmless. (On the subject of displacing or omitting the article in the title, I still reckon that is not always appropriate; the need to retain them is more than rare and it sometimes ungainly. It is good for bibliographies ... ) Cygnis insignis (talk) 21:09, 29 October 2009 (UTC)
- True enough, but in that last case we can't know if there are any differences until the two are compared, and that's as much work as proofreading. The problem is not as unlikely as you believe. It has come up in the comparison of the original Dictionary of National Biography and its first reprint. A large number of changes were made in the reprint. This was primarily to correct errors, but that often necessitated other changes to avoid having the corrections spill over onto another page. An added sentence required removing material elsewhere on that page. Probably a majority of articles did not require changes at all; what benefit is there to duplicating these? Eclecticology - the offended (talk) 08:24, 29 October 2009 (UTC)
- I don't see a problem. Posting of multiple versions will be mediated by user enthusiasm. We will have (if we don't already) users who see value in posting Thomas Bowdler's expurgated versions of Shakespeare alongside the originals. We will have (if we don't already) users who see value in posting different versions, with different provenances, of important manuscripts and speeches. We will have (if we don't already) users who see value in posting every single public domain version of the Bible. But I doubt we'll ever have users who are willing to waste their time posting and proofreading multiple versions of the same document, where the differences are trivial and entirely lacking in scholarly importance and interest. It is only in this last case that any problems arise. Why structure our work around the anticipation of a problem that has never happened and probably never will? Hesperian 02:34, 29 October 2009 (UTC)
- To illustrate the extent of the problem, I've provisionally added Popular Science/Volume 1#Table of Contents for the first number. For now we need only concern ourselves with material where copyright is not an problem. A significant number of the articles in that number first appeared elsewhere. In the nineteenth century it was very common for American publications to copy from European ones (especially British) without any permission or payment whatsoever, and often without credit. In theory, using the Wiki-is-not-paper principle, we could host them all, but it may not be practical to document the small variations that distinguish two or more versions. The risk is that we can too easily oversimplify the problems. For a play like Macbeth, how important is it to have an expurgated school-boy version that omits the scene with the porter? Eclecticology - the offended (talk) 00:56, 29 October 2009 (UTC)
-
-
[edit] What is the go with {{Indent}}?
I see that Template:Indent is still on the books and used, though is noted as deprecated with people being pushed other ways. Not knowing the history of the deprecation, is anyone able to provide further information? Are we looking to better retire such templates? -- billinghurst (talk) 05:51, 26 October 2009 (UTC)
- A chequered history. It was invented to handle line staggering in poems. This functionality became redundant when the <poem> tag was invented. Nonetheless the template was retained because it had found a new and unanticipated use: preservation of paragraph indentation. For that purpose it was implemented very badly indeed, and so it was updated to support indentation via css styling. This latter functionality was called in a slightly different way, so that it was not necessary to remove the old, crappy implementation in order to bring on board the new implementation. Keeping the old functionality meant that it would still be possible to view old versions of pages that use the old version of the template. Thus we ended up with a template that does two different things; or rather, the same thing in two different ways. Meanwhile, I became irritated that {{indent}} required the indentation size to be the second parameter rather than the first; i.e. {{indent|Blah blah blah|1em}}<nowiki> I felt that <nowiki>{{indent|1em|Blah blah blah}} was much more intuitive, especially when wrapping large blocks of text; plus the latter was in keeping with {{hanging indent}} which I had created to use the latter calling order. Therefore I forked {{indent}} to {{text-indent}}, with the latter using the better calling order, and which I recommend as a replacement. Hesperian 07:10, 26 October 2009 (UTC)
-
- As there is no further comment, I propose that all the works using {{indent}} get converted to {{text-indent}} and
is sent to the great template retirement home in the sky. billinghurst (talk) 01:02, 4 November 2009 (UTC)
- As there is no further comment, I propose that all the works using {{indent}} get converted to {{text-indent}} and
-
-
- I support this, but (in case he doesn't turn up to say so himself) I think JV does not. He has a point: once you delete a template, old revisions that use that template don't render properly any more. That argument doesn't carry much weight with me, because that line of thinking tell us that every time we modify a template we are falsifying the history of every article that has ever used it; but that has never stopped us improving our templates. Nonetheless that point of view exists, so I thought I should report it. Hesperian 06:18, 21 November 2009 (UTC)
-
[edit] IRC channel?
Do we have a dedicated IRC channel? Ineuw (talk) 17:53, 27 October 2009 (UTC)
- We have the one IRC channel for WS, and it is not too crowded and a reasonable place to be. There is a link at the top of this page, and also at Special:RecentChanges. billinghurst (talk) 20:57, 27 October 2009 (UTC)
[edit] Portals
I'm trying to formulate (as are others) a good way to handle reference material that is naturally divided into "topics" texts (e.g. discrete encyclopedia articles or biographies). One thing that has come up is that the Portal namespace is not so used here; and so maybe it should be, to organise such topic-related material by topic. Before speculating too much about that, I thought I should check out what exists. Three kinds of things: Portal:French literature looks like a traditional portal; Portal:Australia redirects to Wikisource:Australia; and Portal:Speeches is a kind of preamble to Wikisource:Speeches. Evidently this is all very top-down, while the problem needing a solution for "topics" is that where there are several texts relating to the same topic, there is a need to present to the reader a choice or menu. Considered as a disambiguation issue, you disambiguate the titles somehow: but this leaves the real issue, which is that everything is now nice and distinct, but not more findable than before. If there is a dab page, how are people going to find it?
Anyway, the theory of portals is that they are about user-friendly ways in. There seems to be some tension with the Wikisource: namespace and its function(s). I'm really angling for some discussion of a way in which Microportal:Napoleon (say) could exist and perform more like an Author page, collecting various internal and other links, without having to be a page about an author, and could coexist with Microportal:Duke of Wellington and other such material. All without prejudice to anything anybody else would wish to have in Portal space. Charles Matthews (talk) 09:41, 30 October 2009 (UTC)
- I 100% agree with you that the Wikisource: namespace should be for project management and other meta- issues, and we are misusing it by putting topic indices (i.e. the contents of Category:Wikisource index pages in there. I also agree that these are not portals. Methinks we need a Topic: namespace. Hesperian 11:37, 30 October 2009 (UTC)
-
- If we do. I would like to see whatever we do have a close link to other wikis so that there is an opportunity to x-wiki between Portals. That plays on the strength of wikis, and helps to align and drive traffic. billinghurst (talk) 11:48, 30 October 2009 (UTC)
-
-
- After a brief look around, it doesn't look as if many Wikisources use portals a lot. French Wikisource seems to have more than most, and organised in a fairly clear way. Others are similar to English Wikisource, including Portuguese Wikisource or Italian Wikisource, in that they have few portals which don't seem that coherent at the moment. - AdamBMorgan (talk) 12:33, 30 October 2009 (UTC)
-
-
-
-
- Portals are strange creatures, quite unlike our current Wikisource Index pages; they mash together templates of "Did you know...", "Recent additions..." and such. Not really something into which we should change our Wikisource:Mermaids, Wikisource:Wars, Wikisource:Islam or Wikisource:Education in my opinion. As ABM says, Portals aren't used a whole lot, especially in Wikisources - and the idea of having a Portal:Napoleon Bonaparte in addition to a Portal:Duke of Wellington seems to strain the limits of my vision for Wikisource. Works about a person go on that person's author page (even when, like some of our authors, they wrote no works themselves). To be honest, I'd be in favour of nixing "Portal" namespace altogether, and just adding "Index", and then we could have Index:Education and Index:Islam -- but that's probably a much larger decision than just en. can handle. Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 13:02, 30 October 2009 (UTC)
-
-
-
-
-
-
- In this regard, I was thinking Portals at Wikipedia:, more like those listed at w:Portal:Contents/Portals billinghurst (talk) 13:03, 30 October 2009 (UTC)
-
-
-
It looks as if there is a strong preference for namespaces with clear functional boundaries (kind of content both in terms of how generated and for what purpose). Which is to be expected. Perhaps the point to make next is that a portal implies a page looking for relevant content, while what I'm really talking about is a bunch of topic texts looking for relevant indexation.
One schematic would be to note that we already have a category system, so that the way a Topic namespace might fit in would be in a layer cake: Portal pages, over categories dedicated to topic pages, over pages in Topic space itself. This provides an odd definition, in that Topic pages would be potential orphans except that they should be clearly categorised in relevant ways by a special part of the category system, and would exist to index texts of factual and reference value. And so they are really there because you can't annotate entries on a category page?
Perhaps there is an idea here trying to get out, that Topic pages should be by intention "atoms" designed to transclude into bigger "molecular" pages. That at least expresses the notion that while Portal pages are naturally top-down, there is some missing notion of bottom-up indexation driven by the fact that we may have five texts on the same topic and currently just juggle them. Charles Matthews (talk) 14:21, 30 October 2009 (UTC)
- There was some discussion of the problem with the Index: namespace at Wikisource:Scriptorium/Archives/2008-08#Standardize index linking. One underlying problem is that Wikisource undertook an idiosyncratic use of "Index" to organize the Page: namespaces. This is contrary to what most people would normally expect from indexes as a way of finding information within works. I would very much agree that some repurposing is in order here, and that much of the existing idiosyncrasies should be moved to the Page: namespace in the format "Page:Name of publication/Contents". This should not conflict with existing material in the Page: namespace since these are already distinguished by page numbers.
-
- From my limited understanding of the coding, there are specific differences in how Index: and Page: work, and that is handled by different templates and javascript. I think that we have to accept that they retain their own namespaces. Trying to move from Index: at this point does not represent a value proposition IMNSHO. This discussion would be better targeted at what we can reasonably and practicably, and what alternate terms exist for what we want.
-
-
- Perhaps so. If these must be separate namespaces, can the Index: namespace be renamed? But if traditional indexing needs to be in a new and different namespace this is not going to be the most important aspect of what we are addressing. Eclecticology - the offended (talk) 10:03, 3 November 2009 (UTC)
-
- The broader question here is to the general purposing of all namespaces and how they might best be utilized. I agree that the Wikisource namespace should be limited to broad issues of project management, and anything there that guides the reader through works or helps a person to find content should be moved elsewhere. The non-contributing reader is not likely to have any interest in project management issues.
- A word of definition about what we mean by "bottom-up" and "top-down". Categories are bottom-up because they start from individual articles and look to place those articles in a topical home. Portals or Wikipedia's "List of ..." pages are top-down because they begin from a topical home and look for content to live there. Both are valid approaches to organizing knowledge. Curently we have both author categories and author wikisource pages. The problem with the latter is its need for maintenance. Unless a person goes there to add a new author it doesn't happen. As a top-down scheme it could just as easily include red links to authors for whom we do not yet have pages.
- The Portal: namespace is indeed underutilized. In the short term it's use would be better restricted to major topic areas, but that could be expanded at a later stage. Well done as it may be, Portal:French literature may still be too specific for a top level portal; Portal:Literature would certainly be a better top level portal that guides us to the literatures of various languages and/or countries. I reserve opinion on a possible Topic: namespace; it may be a good idea, but where it goes depends on how we see it as related to the other namespaces. Eclecticology - the offended (talk) 18:09, 2 November 2009 (UTC)
-
- For me part of the discussion resides with searchability and findability. I think that many people never get out of the main namespace, and don't understand other aspects of namespaces and how they are used. We don't help. I am open for the discussion, it should be good. billinghurst (talk) 23:35, 2 November 2009 (UTC)
-
-
- To sharpen one of the main points being made, navigation is good, but need it be through the part of the site devoted to project management? This would seem to take you immediately backstage. And the demarcation of projects is not the same as any natural division by subject area. The granularity appropriate to such a division is what really needs to be addressed. If the Portal namespace is to remain the main tool, then the thinking on granularity has to be more flexible; if not, then we need need another namespace and spec. Charles Matthews (talk) 08:08, 3 November 2009 (UTC)
-
-
-
-
- First, let me make sure that we understand the word "granularity" in the same way. I read it as the ability to classify data into progressively more specific topics.
-
-
-
-
-
- I agree that [wiki]projects are not the same as topical divisions, but again let's define our terms clearly. There is an ambiguity to the term "project management" My first inclination was to refer to that term as relating to the overall management of Wikisource as a project in itself. That approach would result in the Wikisource: namespace being used for such things as policy development and operational issues. I was concerned that you could be referring to the management of wikiprojects such as those for DNB and EB. The management of wikiprojects could merit a portal in its own right.
-
-
-
-
-
- "Findability" and "flexibility" could be important features of a portal based systems. "Searchability" is primarily a machine function, and may be more related to software development than portals. I think that mediawiki search functions are seriously underdeveloped for such large databases as are found in Wikimedia. Solving that problem is well out of the scope if what we can do.
-
-
-
-
-
- Rather than calling portals a "main tool" I would prefer to see them as top-level classifications. If one or more new namespaces need to be developed, that's fine, but I would avoid acting prematurely on this. For any top down classification system it is important to consider what might be the optimum number of divisions at any stage in the process. Too few, and we haven't divided things at all; too many and readers get tired before they can get through the whole list. When someone is Google searching how does he set the default for the number of hits on a page, and how many Google pages does he look through before giving up? What are the hazards of search terms that are either too broad or too specific? The problems that we need to address are not too different from those in Google searches. Eclecticology - the offended (talk) 10:03, 3 November 2009 (UTC)
-
-
-
-
- Checking out Category:Categories, I think I see one end of the problem that needs discussion: none of the top-level categories within it addresses the idea that someone might come to Wikisource to research some given area of content. And being conscious that this all started from the idea that the same topic may be treated in the ED and DNB and other reference works, I feel that the comments before about indexation have merit: there are things needing to be indexed. Category:Portals is now a subcategory of Category:Wikisource, but it would be the work of a moment to change that. Would it be a good idea to place it as a subcategory of Category:Topical classification, as a psychological step in differentiating portals from project management, and suggesting that some work at least should be done within the category system on findability by topic? Charles Matthews (talk) 14:53, 3 November 2009 (UTC)
-
-
-
-
- That could be a positive step, or even a name that suggests that this is a temporary category for only those things that we want moved out of the Wikisource namespace, and into something more meaningful. The more I look into some of this the more convoluted the problem appears. In part there seems to be a great unwillingness to clean out old ad hoc concepts that never got very far, and were effectively abandoned when their creators got bored with Wikisource. The detailed discussion required before these can be cleaned up discourages any serious efforts. Take Portal:Influential Books as an example. It may have been a good idea when it started but it doesn't contain a lot of material and hasn't had a substantive edit since January 2007. Could it be deleted without a lot of drama?
-
-
-
-
-
- Of course, marking pages in the Wikisource namespace for moving, or efforts to delete abandoned ideas does not address the more positive question about what we want. I think we still need both an organized bottom-up category system, and a more or less parallel top-down system with portals at its top.
-
-
-
-
-
- We can argue that some system of indexation should be available to link the appearances of the same person in various encyclopedic works, but the theory is more powerful than the practice. Looking to see if the individual DNB biography has a counterpart in EB or some other encyclopedic work is not an intuitive process for the editor who happens to be focused on DNB at the moment. How is he to know which other encyclopedic works also contain biographical material about the same person? Eclecticology - the offended (talk) 01:06, 4 November 2009 (UTC)
-
-
-
-
- Concrete suggestion: a category like Category:Topical classification (don't mind the precise name) to be created, to house at least Category:Portals, to be an over-category of Category:Works by subject which obviously remains a subcategory of Category:Works too — and to have some other pages in it. Those other pages would be the pages in the notional Topic namespace, so defining what that namespace would be for ought to be the same debate as what further classification pages are required. It seems clear that there is a tendency for the Wikisource: namespace to act as a sort of Bermuda Triangle of projectspace, drawing in organisational material. So some work probably should go on, looking at which pages could be recategorised or moved around or split up in some way that would make their place clearer to those readers who are not active project members. Charles Matthews (talk) 11:26, 4 November 2009 (UTC)
- Sounds like the beginning of a plan.
- At this stage perhaps a more ephemeral sounding name like Category:WS pages to be moved would emphasize that this category will eventually no longer be needed. This will start rescuing material from the Bermuda Triangle.
- Unless there is a reasonable chance that the Index namespace could be repurposed, I would propose a "Topic" namespace. It would be a top down parallel to categories. It would include red links for things to be done, and provide additional information about its separate elements that cannot be included in the Categories. In the short term this could be a pseudonamespace until we can be sure that people will use it.
- I would also propose a "Project" namespace that could be used to marshal the various wikiprojects. Many of these are too easily forgotten once the initiator loses interest. By making these more visible we may be able to convince new people to work on completing these projects.
- The "Portal" namespace needs further development. It should operate to bring together topics, categories and projects related to top level concepts. There may be some differences of opinion about which concepts are top level. I am willing to put some effort on developing these, beginning with the most obvious. Eclecticology - the offended (talk) 08:29, 5 November 2009 (UTC)
- Sounds like the beginning of a plan.
- Concrete suggestion: a category like Category:Topical classification (don't mind the precise name) to be created, to house at least Category:Portals, to be an over-category of Category:Works by subject which obviously remains a subcategory of Category:Works too — and to have some other pages in it. Those other pages would be the pages in the notional Topic namespace, so defining what that namespace would be for ought to be the same debate as what further classification pages are required. It seems clear that there is a tendency for the Wikisource: namespace to act as a sort of Bermuda Triangle of projectspace, drawing in organisational material. So some work probably should go on, looking at which pages could be recategorised or moved around or split up in some way that would make their place clearer to those readers who are not active project members. Charles Matthews (talk) 11:26, 4 November 2009 (UTC)
-
I rather like the idea of moving all "Index" (DJVU) pages to the PAGE namespace, then moving the WS:Works pages to the Index: namespace. But this shouldn't be attempted by a single user anxious to see this issue concluded, we have thousands of backlinks that will need changing, etc. Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 12:52, 4 November 2009 (UTC)
- While I support this in principle, I'm conscious of the possible technical difficulties raised by Billinghurst. If you think that this can work try moving the Index (DJVU) pages of one or two books to see what glitches arise. If they are too serious we may have to accept that it won't work. Eclecticology - the offended (talk) 08:29, 5 November 2009 (UTC)
- Just to get the desired end result straight, do you intend to end up with a hierarchy the looks something like this?:
-
- Portal:Religious texts
- a) Topic:Baha'ism, Topic:Buddhism, Topic:Christianity, Topic:Hinduism, Topic:Islam, Topic:Judaism, Topic:Wicca, etc.;
b) Project:Wiki Bible, Project:Hinduism etc. (from Wikisource:WikiProject Wiki Bible and Wikisource:WikiProject Hinduism);
c) Category:Religion and sub-categories - The actual texts (ie. The Bible, Qur'an etc.)
-
- (Using Religious texts from Wikisource:Works as an example because it is a convenient size.) - AdamBMorgan (talk) 12:37, 5 November 2009 (UTC)
- Just to get the desired end result straight, do you intend to end up with a hierarchy the looks something like this?:
-
-
- Essentially, yes. Still I would prefer the simpler, Portal:Religion. "Texts" is somewhat of a redundancy, since the whole of Wikisource is about texts. When editors need to link to something they are probably happier when it can be expressed in one word. I'll see if I can start developing the idea at Wikisource:Portal. (Probably more intuitive to have this named "Portals" in the plural.) That's the sort of page that should stay in the Wikisource namespace because it's about portals ... unless somebody can make a convincing argument that it belongs in the Help namespace. Eclecticology - the offended (talk) 17:41, 5 November 2009 (UTC)
-
[edit] Bot extraction of text already in DjVu file
I am having trouble finding an answer to this. There is a DjVu file Index:Dramatic Moments in American Diplomacy (1918).djvu that already has OCR'd text in it, but it hasn't all been extracted and put into the "Pages" namespace yet. Is there a bot that can do this while leaving the already-placed pages intact? (billinghurst, is this what you were trying to do with it before?) --LarryGilbert (talk) 18:40, 30 October 2009 (UTC)
- Yes, the djvutext.py script for the pywikipedia bot enables this sort of functionality, although I believe it would be necessary to supply the script with a list of pages to be uploaded manually to avoid clobbering those that are already there. At the moment my little bot is fully occupied autocorrecting the United States Statutes at Large scans, but I would be happy to queue this job up for it after it has completed its current run (unless somebody else does it for you first). Tarmstro99 (talk) 19:38, 30 October 2009 (UTC)
- I think I'd heard of pywikipedia somewhere, but I'd forgotten about it. This may be a good opportunity for me to learn it and put it through some paces. Thanks for the suggestion! --LarryGilbert (talk) 20:29, 30 October 2009 (UTC)
- There is no need for that now. The text layer is automatically imported if it exists when the page is created. This avoids creating 1000s of pages which won't be proofreaded before ages. See where I look? ;o) Yann (talk) 21:46, 30 October 2009 (UTC)
- In this case, though, the text is already on the site, just not linked to any scanned pages. The easiest way to connect the two seems to be to replace the existing text with what's in the DjVu file. --LarryGilbert (talk) 00:27, 31 October 2009 (UTC)
-
-
- When I looked at that work, it was not proofread, nor sorted to chapters, it was an flat long file of not proofread. Given that, it was preferable to do the work in the Page: namespace where we can work with the text to do side by side proofreading.
-
-
-
- BTW I did ask JV about whether he was going to update his script and he said he would put it into the queue for 2011. :-/ billinghurst (talk) 03:27, 31 October 2009 (UTC)
-
[edit] Indexes and TOC for The Popular Science Monthly
Created several variations for a proposed page hierarchy for TPSM. Please look look here at my proposals. Which naming style is best suited for Wikisource? Your input is much appreciated. Ineuw (talk) 19:10, 30 October 2009 (UTC)
- Thanks for providing those mockups! Your proposal #4 (full name of publication (not abbreviated) / volume number / issues within volume / works listed within that issue) looks the most consistent with what I have seen done with other works here. Tarmstro99 (talk) 19:45, 30 October 2009 (UTC)
- I vote for number four as well. It most confirms to our standards. -Mattwj2002 (talk) 03:58, 31 October 2009 (UTC)
- Number 4 it is. I will complete the month of May first and let you know. Ineuw (talk) 19:31, 31 October 2009 (UTC)
- I disagree to the actual proposition, I'm unsure if people liking this idea noticed the Page: prefix before a title or took it as a generic name for main:, Ineuw proposition consists of moving the hierarchy for the TPSM to the Page: namespace, which is really a bad idea imho. Ineuw proposition #4 is the best but must be done in main:, not in Page:*. Phe (talk) 15:20, 1 November 2009 (UTC)
[edit] Page structure for The Popular Science Monthly
Phe, thanks for your input. I had some problems trying to define a page (not a Page:), as you have suggested. I will test it again and will post here. In the meanwhile, the current structure I am using is here. Ineuw (talk) 02:10, 2 November 2009 (UTC)
- I find the aim of this thread a little confusing. It's also important for the Table of Contents to show the article author, and the original source of the article when that is known. Eclecticology - the offended (talk) 10:09, 2 November 2009 (UTC)
[edit] Change of MediaWiki:Proofreadpage pagenum template
Hi, can an admin change this template to allow direct links to a page number ? It requires insertion of <span id="{{{num}}}"></span> between <span id="zzz" style="display:none;"></span> and the <nowiki>. This is used by <pages >, the old template:Page do this but not the new. Example of use : Transactions of the Geological Society, 1st_series, vol. 1/Index (to test it, change the template, do a dummy edit of the target page and try the link) Phe (talk) 17:54, 31 October 2009 (UTC)
Done billinghurst (talk) 22:58, 31 October 2009 (UTC)
- Thanks, for people interested in linking from a TOC, I created an helper template, {{TOC link}}. The intent is to to provide a way to link from a djvu page to a page inside the same book but when transcluded to link to a subpage of the same book, in the shortest possible way. Phe (talk) 15:08, 1 November 2009 (UTC)
- I have found that {{TOC link}} is for use when the main namespace TOC is used a subpage, so I have semi-forked the code for {{TOC mainlink}}. If there is someone clever who can work out how to amalgamate the two functions into the one template, then go for it. billinghurst (talk) 02:09, 22 November 2009 (UTC)
- Both template can be merged using something ala (simplified code) {{#ifexist: ../{{{2}}} | [[../{{{2}}}]] | [[/{{{2}}}]]}} but it exists probably a better way to do it, I don't see exactly how but I suspect we can get the needed information by a clever comparison of {{PAGENAME}} {{SUBPAGENAME}} and {{#titleparts}} without adding any parameter. One case that must be handled is book structure as book_name/Val_1/Chapter 1 where the TOC can be transcluded either in book_name or in book_name/Vol1 Phe (talk) 10:55, 23 November 2009 (UTC)
- I'd much prefer these templates were substituted. It is hard enough for newbs to make sense of this place without piling on esoterica like this. Hesperian 11:41, 23 November 2009 (UTC)
- They can't, their expansion depend on the transclusion context, and {{TOC link|13|Account of Guernsey, and the other channel island|1}} is a lot of easier to read than the alternate way [[<noinclude>Page:Transactions of the Geological Society, 1st series, vol. 1.djvu/49</noinclude><includeonly>../Account of Guernsey, and the other channel island</includeonly>|1]]. Anyway only simple template should be substed. Phe (talk) 11:58, 23 November 2009 (UTC)
- I'd much prefer these templates were substituted. It is hard enough for newbs to make sense of this place without piling on esoterica like this. Hesperian 11:41, 23 November 2009 (UTC)
- Both template can be merged using something ala (simplified code) {{#ifexist: ../{{{2}}} | [[../{{{2}}}]] | [[/{{{2}}}]]}} but it exists probably a better way to do it, I don't see exactly how but I suspect we can get the needed information by a clever comparison of {{PAGENAME}} {{SUBPAGENAME}} and {{#titleparts}} without adding any parameter. One case that must be handled is book structure as book_name/Val_1/Chapter 1 where the TOC can be transcluded either in book_name or in book_name/Vol1 Phe (talk) 10:55, 23 November 2009 (UTC)
- I have found that {{TOC link}} is for use when the main namespace TOC is used a subpage, so I have semi-forked the code for {{TOC mainlink}}. If there is someone clever who can work out how to amalgamate the two functions into the one template, then go for it. billinghurst (talk) 02:09, 22 November 2009 (UTC)
- Thanks, for people interested in linking from a TOC, I created an helper template, {{TOC link}}. The intent is to to provide a way to link from a djvu page to a page inside the same book but when transcluded to link to a subpage of the same book, in the shortest possible way. Phe (talk) 15:08, 1 November 2009 (UTC)
[edit] Template help is sought
I need a template to raise the caps of words like THE, and I created a copy of the dropcap named raisecap. Raising the first character works OK, but it inserts a line break immediately afterwards. Don't know enough about HTML to find the problem. Would someone please look at it? Thanks. Ineuw (talk) 19:47, 31 October 2009 (UTC)
- Hi, see the edit comment [2] Phe (talk) 20:28, 31 October 2009 (UTC)
- We do already have Template:Largeinitial. That doesn't do the job? billinghurst (talk) 22:54, 31 October 2009 (UTC)
- Thanks to all. Template:Largeinitial is fine. Phe, thanks for the correction. — Ineuw (talk) 01:18, 1 November 2009 (UTC)
[edit] Proper placement of copyright tags/boxes
Is there a guideline or best practice for placing a copyright box? I'm afraid I've searched for an answer without any luck. Does it belong on the main page of a text or on its discussion page? Near the top or the bottom? Anything else? --LarryGilbert (talk) 02:52, 4 November 2009 (UTC)
- I stick it on the front page of the work and at the bottom. That is what I saw done, and I want I now do. So there will be quite a few done that way. :-) billinghurst (talk) 03:06, 4 November 2009 (UTC)
- Same as Billinghurst; it's an important part of the metadata we need to present, so it has to be on the main page, but the big box doesn't really need to be at the top of the page. I wouldn't mind putting it in normal size type in header; "This is a US work that wasn't renewed {tiny image to link to full text}", "This was published before 1923 {tiny link}", "and this was published by an author who died 57 years ago", etc. But the current big boxes should go to the bottom.--Prosfilaes (talk) 15:03, 4 November 2009 (UTC)
- For once I generally agree with you. Whatever the notice says the boxes are butt-ugly. Many sites place their copyright notice discretely at the bottom without increasing or decreasing whatever rights they have. The important thing is that the information is available. Eclecticology - the offended (talk) 19:03, 6 November 2009 (UTC)
[edit] Problem with interface
In the Page namespace I am having trouble using the drop-down controls. I can navigate every symbol except ^ . When I move my cursor down the list, the whole list disappears at "image" when the pop-up image appears to the left. The list also disappear if I move my cursor off of the list and attempt to get at ^ sideways. Does any one else have this issues or know of a fix?--BirgitteSB 19:18, 6 November 2009 (UTC)
- I am not quite sure to what you are referring. Are you referring to Mediawiki:Edittools that is appended to the bottom of all pages when in edit mode? Is it that you want the circumflex in symbols, or that you cannot navigate to the Circumflexes drop down mode? Or something different? billinghurst (talk) 13:23, 7 November 2009 (UTC)
- No I am talking about the navigation for Page:Foo to Index:Foo. There is a tab witch is a drop-down list and and the last item is listed as ^ which signifies the navigation to the associated index page.--BirgitteSB 01:02, 9 November 2009 (UTC)
- Which skin are you using? I don't have those options in my monobook version, I just have the tabs across the top and have a tab at the end with the ^. No dropdown boxes to be seen. billinghurst (talk) 02:04, 9 November 2009 (UTC)
- No I am talking about the navigation for Page:Foo to Index:Foo. There is a tab witch is a drop-down list and and the last item is listed as ^ which signifies the navigation to the associated index page.--BirgitteSB 01:02, 9 November 2009 (UTC)
[edit] Understanding <section> tags and their use with <pages> tags
I'm adding a book by Michael Faraday and I'm having trouble understanding the proper use of the <section> and <pages> tags.
Let's take the first "chapter" from the book. It starts on page 1 and finishes a short way down page 5. I thought it would be enough to put <section begin="chapter1"/> at the start (p. 1) and <section end="chapter1"/> at the end (p. 5), then use something like this to put the whole article on its own page:
<pages index="Experimental_researches_in_chemistry_and.djvu" from=16 to=20
fromsection="chapter1" tosection="chapter1"/>
But this does not work, so either I'm misunderstanding <pages> or I'm expecting too much from it, I fear. By experimenting with including or omitting the "tosection" parameter, the only thing I can change is whether all of page 5 is included or not. I can't get just the relevant portion to be included and leave the rest out.
I imagine I could work around this by using multiple {{Page}} templates, but I thought that was the sort of thing the <pages> tag was designed to avoid.
--LarryGilbert (talk) 00:11, 9 November 2009 (UTC)
- Off the top of my head, I suggest you try it without the quotes aroudn the section titles. I have a vague memory of having the same problem myself a while back. Hesperian 00:27, 9 November 2009 (UTC)
- On re-reading, I see the problem. Section tags are used to demarcate an individual page into sections; it doesn't work across pages. What you want to transclude is:
- Just the chapter one part of page 1
- all of page 2
- all of page 3
- all of page 4
- just the chapter one part of page 5
- In order to do the first of these, you need to define the chapter one part of page 1; to do that you wrap the chapter one part of page 1 in <section begin="chapter1"/> and <section end="chapter1"/>. Then you do the same thing on page 5. That should get you what you want. Hesperian 00:32, 9 November 2009 (UTC)
- That's exactly what was needed. Thanks! --LarryGilbert (talk) 00:39, 9 November 2009 (UTC)
[edit] Fundraiser banner
At w: there's an option in My preferences|Gadgets to "Suppress display of the fundraiser banner". Can this be given here also? Moondyne (talk) 06:54, 11 November 2009 (UTC)
[edit] Odd problem with transclusion of footnote
ProofreadPage gurus, any hints on why the content of the footnote at the bottom of this page gets mangled when it gets transcluded to here? It looks like some boldfacing is added that isn’t present in the original, and one of the internal wikilinks vanishes entirely. Tarmstro99 (talk) 16:26, 12 November 2009 (UTC)
- It's because the links in the Footnote point to the same page that it's being transcluded in. For example, I'm linking to the Scriptorium: Wikisource:Scriptorium (notice how it's in bold, but the source is typical wiki-markup). If you want to to be fixed, the only way I can think of is to remove the links altogether, because all you're doing is linking a page to itself.—Zhaladshar (Talk) 16:33, 12 November 2009 (UTC)
- Aha! Thanks. Tarmstro99 (talk) 18:34, 12 November 2009 (UTC)
[edit] DJVU Viewer App Available for the iPod Touch/iPhone Now
I just wanted to let everyone that there is a DJVU viewer app available for the iPod touch/iPhone now. The app is called Heru. It cost $1.99 usd. You have to manually copy and paste the URL of the file to download. I just thought you guys would be interested since we use DJVU so much with Wikisource. This makes the iPod Touch and iPhone a nice eBook reader. :) Anyways, what do you guys think? I just thought I would spread the word. --Mattwj2002 (talk) 06:02, 13 November 2009 (UTC)
[edit] Batch-moving pages
I had Commons rename File:1960 FBI primer on Nation of Islam.djvu to File:1965 FBI monograph on Nation of Islam.djvu due to the incorrect year in the file name itself. Now I am in the middle of moving all of the pages formerly under Index:1960 FBI primer on Nation of Islam.djvu to Index:1965 FBI monograph on Nation of Islam.djvu--obviously a tedious task. It occurred to me that there must be an existing tool I can use to do a "batch" move like this without having to fire up a bot to do the task. Does anyone know of such a thing, or should I just dust off my own bot to do the work? --LarryGilbert (talk) 18:50, 14 November 2009 (UTC)
- I will have a go and see how it works out. billinghurst (talk) 23:26, 14 November 2009 (UTC)
Done was manually with the bot application, but not horrific. billinghurst (talk) 23:50, 14 November 2009 (UTC)
[edit] Formating when proofreading
Hi, I would like to ask if it is necessary to take car of text format when proofreading? Look at this example: [3]. I was trying to figure out how to format the text to look simmilar to the text from scan, but I have found no way.--Juan de Vojníkov (talk) 08:48, 17 November 2009 (UTC)
- You could either use {{indent}} or the <poem>...</poem> tags. I've edited the page using the latter (and validated the page). The text between the poem flags is formated as it is presented, leading spaces and all.- AdamBMorgan (talk) 17:34, 17 November 2009 (UTC)
[edit] Is there any easy way to upload external OCR?
I'm looking at Index:A grammar of the Bohemian or Cech language.djvu which has Tessaract OCR. But one of the most tedious things for some of us is entering all those Czech letters. I could take the images and dump them through my local copy of ABBYY Finereader, which will handle the Czech OCR just fine, but I have no way to quickly upload the OCRed pages.--Prosfilaes (talk) 23:17, 18 November 2009 (UTC)
- If you e-mail the (good) OCR output to me as a set of zipped text files, I can have User:TarmstroBot replace the existing text with the amended versions en masse. (I assume you do not wish to replace any pages that have already progressed to Proofread or Validated status.) Tarmstro99 (talk) 02:25, 19 November 2009 (UTC)
complete. Tarmstro99 (talk) 19:53, 20 November 2009 (UTC)
[edit] Something weird happening
This morning when I click on the links to scans button I see this:
You are looking at the HTML representation of the XML format. HTML is good for debugging, but probably is not suitable for your application. See complete documentation, or API help for more information. <?xml version="1.0"?> <api />
Any ideas? Is it just me? Moondyne (talk) 01:16, 19 November 2009 (UTC)
- The similar links to scanned pages gives can’t run the script “displayOptionText("links to scanned pages","display:none;"” blah blah, ... Cygnis insignis (talk) 02:19, 19 November 2009 (UTC)
{{Option}} is doing it too, so it is clearly a problem with handling of the "OptionText" class—the mechanism by which it is made possible to turn the page links on and off. No-one has edited Mediawiki:Common.js recently, nor even Mediawiki:Common.css, so it would seem to be a backend issue.
There has been talk at Template talk:Page#Missing page numbers? about how the disappearance of page numbers only causes confusion, and to no advantage, and therefore the "links to scanned pages" display option would be better off removed. Since there was no opposition there after quite a while, and since this functionality is broken at present anyhow, it seems like there is no better time than the present to implement it. Done.[4][5] Hesperian 03:26, 19 November 2009 (UTC)
See also Wikipedia:Wikipedia:Village pump (technical)#Scripts are down. Hesperian 03:31, 19 November 2009 (UTC)
[edit] Unvalidating a Page
Can someone please undo my edit to Page:Fair Circumvention.djvu/1? I read and validated the page but this blanked it for some reason. I can't undo my own edit because that would involve changing the proof reading status of the page. Thanks, AdamBMorgan (talk) 20:31, 19 November 2009 (UTC)
- done Sherurcij Collaboration of the Week: Author:Khwaja Kamal-ud-Din. 20:52, 19 November 2009 (UTC)
[edit] Question about Corriegenda and Addenda
I have been working on proofreading A Dictionary of Music and Musicians, volume 1. Today I transcluded the Beethoven article to mainspace. However, in volume 4 of the dictionary there is an Appendix which contains Addenda and Corriegenda (for the Beethoven article this is on page 533 through to 542). Should the corrections be made to the original article? Should the addenda be transcluded to the end of the original article? Or should they be transcluded separately? Beeswaxcandle (talk) 09:06, 20 November 2009 (UTC)
- I like to maintain the errors, but make a note of anything corrected in the Errata; e.g. see An introduction to physiological and systematical botany/Chapter 10. Hesperian 11:14, 20 November 2009 (UTC)
[edit] Ancient Greek?
Hi, I think I have completed A Highland Poem... which is part of the collaboration of the week. Its the first book Ive done, so I'm not sure if I can do anything more. Specific help is required with one of the poems as it has a non A-Z title ... I think it is ancient Greek (its rendered OK in the DjVU version). Anyone good at converting this to HTML? At present I have included it as "???" and its halfway down the table of contents. Thanks Victuallers (talk) 17:22, 21 November 2009 (UTC)
There are a lot of poems there; which one is it in? Angr 17:27, 21 November 2009 (UTC)- Never mind, I found it. I've moved the page to τέτλαθι δὴ κραδίη. Angr 17:34, 21 November 2009 (UTC)
- I also corrected Miserrere to Miserere. Angr 17:40, 21 November 2009 (UTC)
- Thanks, for both corrections, Angr Victuallers (talk) 20:03, 21 November 2009 (UTC)
[edit] New texts
Who are the Ausralians?
-
- I think that they are related to the Ancient Geeks ;-)
Does "New texts" have an age requirement? (Maybe we can rate it "R".)
An inquiring mind 05:07, 25 November 2009 (UTC) (ResScholar)
-
- Thanks, fixed. Hesperian 05:18, 25 November 2009 (UTC)
- Lest I be accused of insensitivity to Alcee Hastings' indignant complaints as entitled with one of Cirt's typographical errors, I would direct other inquiring minds to w:Keith Windschuttle for example. ResScholar (talk) 05:36, 25 November 2009 (UTC)
[edit] Wikimedia Strategy
Hi, good people of Wikisource.
I've come over from strategy.wikimedia.org. We're interested to know two things about how you work here on Wikisource.
First, do you have any competitions? On en:wp there are quite a few different competitions that seem to help motivate editors to do good work and more of it.
Here's an example:
http://en.wikipedia.org/wiki/Wikipedia:CUP
More can be found at:
http://strategy.wikimedia.org/wiki/Editor_awards_and_rewards#Contests
Does Wikisource run anything like that?
Also on en:wp there are a number of WikiProjects which help editors to bond as smaller communities within the larger one.
Here's an example:
http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Military_history
More can be found at:
http://en.wikipedia.org/wiki/Wikipedia:WIKIPROJECT
Can you point to any sort of sub-communities within Wikisource which help editors bond as a smaller group within the project as a whole?
Answers to these questions will be valuable to us as we work on Wikimedia Strategy. I will be grateful for any information you can provide. --Bodnotbod (talk) 18:18, 25 November 2009 (UTC)
- I've located these groups:
-
- Wikisource:Collaboration of the Week
- Main page highlights new texts and Feature text.
- Wikisource:Proofread of the Month and recently every participant gets recognition for participating each month, eg. User:Mattwj2002 no matter how large, or how small. This month we had 8 newbies participate on the project, and it is a great way to learn the ropes.
-
- Noting that we are a smaller community, so we are able to more closely manage matters wholistically with a smaller group of volunteers/admins, we are at that point of close working relationship, and assisting editors in their parts or in the whole. It seems to be working well as disputes are not prevalent here, and newbies seem to feel welcomed, and work progresses. At this time we are more collegiate than competitive, even across the languages of WS there is a level of cooperation.
-
- Pretty easy ... We have our customised {{welcome}} aiming to get all our new users. And that covers general welcome, our special projects, and our Wikisource:WikiProjects. Apart from the customised welcome, because we have auto-patrolling, each visitor is essentially assisted to our methods & means; given guidance and assistance in joining our projects or assistance with their works of interest to our standardised format.
-
- We have been working quite productively on Wikisource:WikiProject DNB in recent times, making progressive jumps and getting quite productive in getting more source and crosslinks with information to Wikipedia. billinghurst (talk) 02:09, 26 November 2009 (UTC)
- Wikisource:WikiProject DNB is an excellent example of collaboration project here. I'll add it to our list. The people working on the strategic planning wiki and focusing on the community health of wikis are brainstorming about the different ways that the the local wikis and the Foundation might be more helpful in making collaborative projects work better. My general observation is that the success of most of these projects (on all wikis) is driven by the people that start them. And too often if the people organizing them get busy or loss interest then the project stop. It would great if we could find a way to get enough momentum behind them that they don't cease to exist. FloNight (talk) 08:26, 26 November 2009 (UTC)
- The DNB project here is "twinned" in a cross-wiki way with the one on enWP, w:Wikipedia:WikiProject Missing encyclopedic articles/DNB. This was quite deliberately done (the part on WP is technically a subproject of a bigger WikiProject, but still). I'm not aware of other examples of twin projects of this kind, but it would be interesting to know more. Charles Matthews (talk) 08:43, 26 November 2009 (UTC)
- Wikisource:WikiProject DNB is an excellent example of collaboration project here. I'll add it to our list. The people working on the strategic planning wiki and focusing on the community health of wikis are brainstorming about the different ways that the the local wikis and the Foundation might be more helpful in making collaborative projects work better. My general observation is that the success of most of these projects (on all wikis) is driven by the people that start them. And too often if the people organizing them get busy or loss interest then the project stop. It would great if we could find a way to get enough momentum behind them that they don't cease to exist. FloNight (talk) 08:26, 26 November 2009 (UTC)
- We have been working quite productively on Wikisource:WikiProject DNB in recent times, making progressive jumps and getting quite productive in getting more source and crosslinks with information to Wikipedia. billinghurst (talk) 02:09, 26 November 2009 (UTC)
[outdent]
- sometimes it is about branding; sometimes it is about critical mass. I like the work of biographical works as they are directly related to the crux of putting referenced works onto WP. Similarly, the pairing of other works where the original content is at WS and the encyclopædic article is at WP. There is the scope for a Poetry project. Similarly some of the history projects could do well to consider housing relevant texts at WS, or vice versa we let specific projects know that a work exists, eg. the PotM work Notes on the churches in the counties of Kent, Sussex, and Surrey is relevant to the projects on those three counties of the UK. Though often requires good means to provide ready references (special templates per work) from WP to WS. billinghurst (talk) 10:49, 26 November 2009 (UTC)
Golly, it looks like we're going to end up presenting ourselves as a community rich in structured collaborations, where in fact the opposite is true. My personal view is that our low community overhead is an ingredient in our success. We're not running around tagging talk pages into projects, arguing best practice across a range of style manuals, running competitions and dishing out barnstars left right and centre. We're just getting on with the job. Maybe the strategy dudes need to be hearing this, rather than an enumeration of the occasional collaborative effort. Hesperian 11:14, 26 November 2009 (UTC)
- Data shows that Wikimedia wikis, including Wikisource, are not retaining editors as well as they could. See data. The ideas is to identify what motivates people to contribute to wikis so that we can do a better job of keeping people interested in editing. Research studies show that positive recognition of contributions of volunteers is a way to retain volunteers. Or course people that don't want to participate in Wikiprojects don't need to get involved in them. But there is some indication that these Projects do encourage content creation and give satisfaction to people involved with them. For the most part, instead of inventing loads of new projects or processes under the Strategic Planning process, I think we should support and expand the existing ones. That is the reason that I want to collect the information about the current projects on the various wiki. FloNight (talk) 12:29, 26 November 2009 (UTC)
-
- I thank Hesperian for his comment, we need to hear precisely what people are thinking; any input is marvellous. To address points raised or implied by Hesperian: the primary aim of any sort of contest must be that it improves the content of the project. If a contest isn't doing that it is cruft. Some contests on en:wp have been very good at increasing the quality of articles and encouraging people to provide great pictures. Essentially, where something works well we want to think how we can support that. But if a project says "we don't run contests because x, y and z" then that's all taken to heart too.
-
- In terms of creating sub-communities, some projects have quite a small population as it is. They likely don't need additional ways of bringing people together because the primary editors are already well aware of each other. On Wikibooks the fact that each book has a tendency to be a community of its own likely means that sub-communities occur naturally there. It is more of an issue in en:wp where there is an increasing atmosphere of hostility; newbies are becoming increasingly alienated by harsh treatment when they make their first edits. On a smaller project people may be treated much more kindly. So it is worth considering if introducing sub-communities to en:wp might engender a bit more kindness and give newbies a way in where they will be given a helping hand. Wikisource may not require anything like that and that's what I'm here to find out. --Bodnotbod (talk) 14:18, 26 November 2009 (UTC)
-
- I'm not sure I interpret that graph the same way, Flonight. For most of the projects there, including Wikisource, they're keeping the number of editors about stable, which is what we'd expect in the long run. But as for Hesperian's comment, part of what causes me to take time from Distributed Proofreaders for here is that fact that I can do things so much quicker and flexibly here; I can type in Alice's Adventures in Cambridge, without fighting about page scans and images, and it will get done sooner than through DP. But the dark mirror of that is that projects that are created in DP are inexorably drawn towards a complete, well-proofread end, whereas here, there's too many projects that are started and sit in limbo indefinitely. At least personally, any project that encourages the completion of projects, especially validation of them, would make me feel more happy about spending my time here.--Prosfilaes (talk) 15:09, 26 November 2009 (UTC)
-
-
- And to put it terms of community, we do a lot of individual projects; more group validation and the like, and more emphasis on it, would help reduce the feeling like everything was one-man projects.--Prosfilaes (talk) 15:14, 26 November 2009 (UTC)
-
-
-
-
- What Hesperian said! & What Prosfilaes said! Very nice! They are both right, and that is very much how are little patch is successful, and pretty much without the bickering. Common goals, no particular savage ego, though our subject matter lends less to content disputes. Noting also why we had the suggestion for November WS:PotM, and in some ways it success.
-
-
-
-
-
- One reason recently for our combined efforts is work of ThomasV on his valuable Proofread Page extension that has allowed us to add further rigour to our work, and is very much a joint tool, and one that is very supportive of how we can do our work to a consistent quality and cooperatively by allowing us to move in a direction and to lift our standards, do note however that getting such extensions into place in the Mediawiki upgrade regime is becoming problematic. Being able to look to how there is a smoother and more time efficient process to get valuable extensions in place through the rigamarole would be sweet!
-
-
-
-
-
- For our project it often varies on why people are here
- Some to produce a quality product of the work as a meritable work that deserves sharing
- Some to exhibit the opus of works of a specific person
- Some to bring an old text out for modern reading
- Some as source material for other places/projects, especially WP.
- For our project it often varies on why people are here
-
-
-
-
- All can lend themselves to projects, though the character of each project relates to the commonality of the goals. Each will have its own quirks and each has a different level of requirement for support.
-
-
-
- The issue about projects is that they need administrative support, and at WS, many of us want to get on with the transcribing and reproducing the work, not sit back with the less glamorous (boring?) components. So some of the thing that would be helpful for us are (quick brain dump) into the start of a wishlist. billinghurst (talk) 21:12, 26 November 2009 (UTC)
-
[edit] Wishlist
- Getting our help and support pages into a applicable framework
- Assistance in standardisation of some of our templates
- A search engine that gives us better results, and part of this may also be assistance with good meta data management
- Finding some vols who love the process of organisation
- A means outside of enWS to exhibit some of our projects for wider consumption
[edit] Copyright notices for Gutenberg?
Can anyone tell me what copyright notices are required for a Gutenberg text posted in Wikisource? In particular, I began to insert de Maupassant's novel [[Bel Ami] Ineuw (talk) 19:06, 26 November 2009 (UTC)
- That it is a Gutenberg text is irrelevant as they cannot own the copyright for transcribing a work (no intellectual property rights). The copyright tag will be what is relevant to apply to the author. You might attribute the source as being from Gutenberg in the Notes, alternatively on the talk page using {{textinfo}}. billinghurst (talk) 20:38, 26 November 2009 (UTC)
[edit] Quran translations in 40 languages
Guys, I found something awesome on the Internet Archive it is still being proceed by the Internet Archive. So far it has been processing almost a month. It is the Quran in 40 languages. The link can be found here. It will take a long while before we have all of the DJVU files, but when we do we will have the Quran on a bunch of wikis. Anyways, what do you guys think? --Mattwj2002 (talk) 09:25, 28 November 2009 (UTC)
- That's interesting, although there are two issues: the name of the translators are not mentioned (I think it is important in this case), and I am a bit suspicious about the neutrality of the translation when it is promoted by the Saoudi government. I feel the same about the Bible's translation distributed by the Jehovah Witnesses, as an example (see w:Criticism of Jehovah's Witnesses#Biblical criticisms). Yann (talk) 10:20, 28 November 2009 (UTC)
- For what it's worth, most modern translations you'll find of the Quran are distributed by the Saudi government; it's a huge deal for them, and the vast majority of (North American) Muslims will use it as their English copy without any concern over accuracy. In part this is because there are no blatant differences between it and other translations, and in part because it's part of the religion that any translation is flawed and not to be trusted, only as a "primer" until you can learn the Arabic original. I wouldn't worry too much about the Saudi aspect as we'd certainly mention it in the header text, just so long as the copyright license is acceptable. Sherurcij Collaboration of the Week: Author:Thomas Carlyle. 16:05, 28 November 2009 (UTC)