2. Re: How to use Mwdumper? (Brion Vibber)

classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

2. Re: How to use Mwdumper? (Brion Vibber)

J Apple

Hello,

The READ Me is not very newbie friendly...

Don't even know what to do with the .jar file.
WHen I double click it nothing happens on my machine.
Can anyone give me more detailed instructions on what
I'm supposed to do with it?

Thanks!
--- [hidden email] wrote:

> Send Wikitech-l mailing list submissions to
> [hidden email]
>
> To subscribe or unsubscribe via the World Wide Web,
> visit
>
>
http://mail.wikipedia.org/mailman/listinfo/wikitech-l

> or, via email, send a message with subject or body
> 'help' to
> [hidden email]
>
> You can reach the person managing the list at
> [hidden email]
>
> When replying, please edit your Subject line so it
> is more specific
> than "Re: Contents of Wikitech-l digest..."
>
>
> Today's Topics:
>
>    1. Re: Re: New anti-spam initiatives (Brion
> Vibber)
>    2. Re: How to use Mwdumper? (Brion Vibber)
>    3. Re: New anti-spam initiatives (Aerik Sylvan)
>    4. Re: German wikipedia to be printed - where's
> the stable
>       version feature? (Tim Starling)
>    5. Re: Re: German wikipedia to be printed -
> where's the stable
>       version feature? (Domas Mituzas)
>    6. Re: MediaWiki API (Daniel Wunsch)
>    7. Re: Re: New anti-spam initiatives (Brion
> Vibber)
>    8. MathML support in MediaWiki (David Harvey)
>    9. Re: how to add an extension to wp (Petr
> Kadlec)
>   10. Re: how to add an extension to wp (Oscar
> Quintanilla)
>
>
>
----------------------------------------------------------------------

>
> Message: 1
> Date: Sat, 28 Jan 2006 16:42:14 -0800
> From: Brion Vibber <[hidden email]>
> Subject: Re: [Wikitech-l] Re: New anti-spam
> initiatives
> To: Wikimedia developers <[hidden email]>
> Message-ID: <[hidden email]>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Aerik Sylvan wrote:
> > Brion - it looks like you're collecting the same
> info I set out to get, plus
> > some more - but is this running live anywhere?  I
> was under the impression that
> > the captcha is still totally in development?
>
> Running currently on all but the top 20 Wikipedias
> plus meta, commons, and a few
> misc.
>
> -- brion vibber (brion @ pobox.com)
>
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: signature.asc
> Type: application/pgp-signature
> Size: 249 bytes
> Desc: OpenPGP digital signature
> Url :
>
http://mail.wikipedia.org/pipermail/wikitech-l/attachments/20060128/c4fc0f7b/signature-0001.bin

>
> ------------------------------
>
> Message: 2
> Date: Sat, 28 Jan 2006 16:42:44 -0800
> From: Brion Vibber <[hidden email]>
> Subject: Re: [Wikitech-l] How to use Mwdumper?
> To: Wikimedia developers <[hidden email]>
> Message-ID: <[hidden email]>
> Content-Type: text/plain; charset="iso-8859-1"
>
> J Apple wrote:
> > Hello,
> > I'm a newbie.
> > I installed MW 1.5.5
> >
> > then I downloaded an SQL dump from wikipedia and
> > extracted the bzip2 to xml.
> >
> > Then how do I open/install/use mwdumper to get the
> DB
> > into my mediawiki?
>
> See the README.
>
> -- brion vibber (brion @ pobox.com)
>
> -------------- next part --------------
> A non-text attachment was scrubbed...
> Name: signature.asc
> Type: application/pgp-signature
> Size: 249 bytes
> Desc: OpenPGP digital signature
> Url :
>
http://mail.wikipedia.org/pipermail/wikitech-l/attachments/20060128/69bfeeda/signature-0001.bin

>
> ------------------------------
>
> Message: 3
> Date: Sun, 29 Jan 2006 01:03:13 +0000 (UTC)
> From: Aerik Sylvan <[hidden email]>
> Subject: [Wikitech-l] Re: New anti-spam initiatives
> To: [hidden email]
> Message-ID: <[hidden email]>
> Content-Type: text/plain; charset=us-ascii
>
> Brion Vibber <brion@...> writes:
>
> > Running currently on all but the top 20 Wikipedias
> plus meta, commons,
> > and a few misc.
> >
> Great!  How do I go about getting the logs?
>
> Thanks!
> Aerik
>
>
>
>
>
> ------------------------------
>
> Message: 4
> Date: Mon, 30 Jan 2006 02:22:51 +1100
> From: Tim Starling
> <[hidden email]>
> Subject: [Wikitech-l] Re: German wikipedia to be
> printed - where's the
> stable version feature?
> To: [hidden email]
> Message-ID: <drimkh$mg$[hidden email]>
> Content-Type: text/plain; charset=ISO-8859-1;
> format=flowed
>
> I'm replying to this wikipedia-l post in wikitech-l,
> it's more relevant
> here.
>
> Brion Vibber wrote:
> > I'd been waiting on Tim's in-progress code to
> compare. Apparently there's not
> > really anything much of that left (his work mostly
> transmogrified into the
> > templatelinks temple) so I'm poking at Magnus's
> code now.
>
> Salvatore's moderation feature was implemented in a
> similar way to
> Magnus' one, in that it used an extra revision ID
> field in the page
> table to point to the relevant version. Salvatore's
> used parameters
> passed back to Revision to determine whether
> page_latest or
> page_verified should be used, whereas Magnus's code
> operated mainly at
> the UI level, redirecting to a page with an oldid
> parameter, IIRC.
>
> Neither of them had the structure required for
> efficient caching, that
> is, page/tag retrieval instead of page/revision
> retrieval. The basic
> problem is that tugela, which we are now using
> instead of memcached, has
> no efficient means for identifying and purging
> expired keys. In fact at
> the moment, this garbage collection is not done at
> all. To limit the
> growth of the cache under these circumstances, it's
> better to index the
> parser cache by page and tag, rather than page and
> revision ID. I
> thought that the best way to implement a tag
> concept, to merge Magnus's
> and Salvatore's features while minimising MySQL
> index space, would be to
> put the tag information in its own table.
>
> Then there's the problem of template and link colour
> changes. I posted
> to wikitech-l about that before. Magnus's suggestion
> of storing the
> wikitext with the templates expanded at save time is
> a quite reasonable
> solution.
>
>
=== message truncated ===


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around
http://mail.yahoo.com 
_______________________________________________
Wikitech-l mailing list
[hidden email]
http://mail.wikipedia.org/mailman/listinfo/wikitech-l
Reply | Threaded
Open this post in threaded view
|

RE: 2. Re: How to use Mwdumper? (Brion Vibber)

Dorożyński Janusz
| -----Original Message-----
| From: ... J Apple
| Sent: Monday, January 30, 2006 6:31 PM
/
| The READ Me is not very newbie friendly...

:-))

| Don't even know what to do with the .jar file.
| WHen I double click it nothing happens on my machine.

Any details about your machine OS? But anyway mwdumper have not GUI
front-end yet, but works as command-line. So use shell  or open your
command-line tool, write and execute command like

java -jar proper-path-to/mwdumper.jar --output=file:your-wiki-dump.sql
--format=sql:1.5 --progress=10000 your-wiki-dump.xml

Of course "proper-path-to", "/", "your-wiki-dump" are only for example and
you must replace them with your really names/symbol. Next, you must have JRE
1.4 or preferable 1.5 installed in your OS. And remember, that the --output
argument must be before the --format argument if --format is used.

Reg.

 Janusz "Ency' Dorozynski

_______________________________________________
Wikitech-l mailing list
[hidden email]
http://mail.wikipedia.org/mailman/listinfo/wikitech-l
Reply | Threaded
Open this post in threaded view
|

Re: 2. Re: How to use Mwdumper? (Brion Vibber)

Bugzilla from rowan.collins@gmail.com
In reply to this post by J Apple
On 30/01/06, J Apple <[hidden email]> wrote:
> Don't even know what to do with the .jar file.
> WHen I double click it nothing happens on my machine.
> Can anyone give me more detailed instructions on what
> I'm supposed to do with it?

1) Near the beginning of the readme, it says:
> Sample command line for a direct database import:
>   java -server -jar mwdumper.jar --format=sql:1.5 pages_full.xml.bz2 |
>   mysql -u <username> -p <databasename>
That means you need to be at a shell/command prompt, and type in that
command. There is no user-interface built into the tool ("* GUI
frontend(s)" is in the TODO list), so double clicking will not do
anything even if you have .jar files associated with the appropriate
Java interpretter.

Try typing that exact command, and see if it works; if not, play with
dumping into a file or connecting directly to the DB (see below).

2) Daunting as they look, the options are neatly divided into 3 categories:
a) where do you want your output (e.g. to file; output directly to
screen or another command; direct to database);
b) what format do you want (e.g. XML, like "export" dumps; SQL data
suitable for input to a particular version of MediaWiki
c) do you want to do any filtering while you're at it.
Ignore the fact that you can have multiple of each, and just pick one
"--output" and one "--format", in that order.

3) As a minor "netiquette" issue, please avoid quoting entire message
digests (or even whole messages) when you reply; instead, trim the
quoted part to just enough to see what you're referring to.

--
Rowan Collins BSc
[IMSoP]
_______________________________________________
Wikitech-l mailing list
[hidden email]
http://mail.wikipedia.org/mailman/listinfo/wikitech-l