PS Archiving Project- Techies help requested
  • Hi all

    I have a simpler solution for archival.

    We will extract all messages with author and put it in a database -
    using some technology

    Then, somebody who is compiling has to just give the title of the
    article and message numbers - and the order of the messages

    Then the software will autocompile the article

    This will save us lot of time

    Anybody with Java / .Net experience who wants to contribute some time
    to this exercise - pl mail
  • Hi Gokul,

    I found a archiving tool in Freshmeat for yahoo. I will try it this evening and
    let you know.

    http://freshmeat.net/projects/ygma/
  • Thanks shiva
  • I am not sure about the status of the archival process. Meantime I was
    able to find a way to archive close to 1000 messages to MySQL database.

    You can have a look at the individual messages in html format as well
    as the sql file here http://ps.namo-namaha.net/. I used xampp mysql
    bundle which had version "Server version: 5.0.27-community".

    The database structure is shown here
    http://ps.namo-namaha.net/ygrp-archive.jpg

    The Full Text search available with mysql seems to very good and
    better than the yahoo group search. I haven't had a chance to provide
    a php interface to it.

    Based on the feedback from you guys, if the approach is viable and
    meets the expectation, I can work on it.

    We can tag each message and search on it as well (the way gmail label
    the messages). Once the old messages are archived, I can configure a
    email id to receive new group mails and dump to the database to make
    it sync as soon as a message posted.

    Please let me know your inputs.
  • Hey,

    That's interesting. The archival process is yet to take off. I saw
    some other tool (which i have posted some time back),
    but yet to evaluate it. I think this seems a good tool. I try to
    understand both the tools and come back.
  • The other tool is not available on the net. I wrote a java program
    using htmlunit, javamail and jdbc.

    if you are interested and know php, we can work on the interface. I
    don't have experience in php.

    But before going further, I would like to know the feedback from the
    group on this process and design decisions. I read one user's concern
    about Intellectual Property, etc., I don't want to create any
    violation of rules, I just joined the group and saw the post and
    wanted to contribute.

    I may need other techies cooperation as well to archive all the
    messages since yahoo would ban if you flood their servers and eat all
    the bandwidth. You have to pull 1000-2000 messages each time and wait
    for 2-3 hrs.
  • Very nice work, Thiru

    I'll look at the messages and get back to you

Howdy, Stranger!

It looks like you're new here. If you want to get involved, click one of these buttons!

Top Posters