• html table to csv

    From Brian Jordan@21:1/5 to All on Wed Nov 1 15:44:21 2023
    I have a lot of html tables which I want to convert and then, via
    Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
    to eventually make a pdf booklet. The bit from fireworkz onwards is tried
    and tested here and is fairly straightforward, the table to csv bit less
    so. It looks like some fairly heavy duty searching and replacing will be
    needed at first unless there is a program somewhere which might help me;
    is there such a thing?
    I am aware of some online stuff under Windows which might help but would
    really like to do the whole job under RISC OS. Any thoughts appreciated.
    Thanks
    B

    --
    _____________________________________________________________________

    Brian Jordan
    brian.jordan9@btinternet.com
    RISC OS 5.28 (16-Dec-20) on Raspberry Pi _____________________________________________________________________

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Paul Sprangers@21:1/5 to Brian Jordan on Wed Nov 1 16:56:12 2023
    In article <5afc8e1380brian.jordan9@btinternet.com>,
    Brian Jordan <brian.jordan9@btinternet.com> wrote:

    It looks like some fairly heavy duty searching and replacing will be
    needed at first unless there is a program somewhere which might help me;
    is there such a thing?

    You might give !ConvText a try. It's at https://riscos.sprie.nl/sprang.riscos/Downloads/ConvText.zip

    Paul

    --
    https://riscos.sprie.nl

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Brian Jordan@21:1/5 to Paul Sprangers on Wed Nov 1 16:04:30 2023
    In article <5afc8f291dPaul@sprie.nl>,
    Paul Sprangers <Paul@sprie.nl> wrote:
    Thanks

    [Snip]

    You might give !ConvText a try. It's at https://riscos.sprie.nl/sprang.riscos/Downloads/ConvText.zip

    Paul
    B

    --
    _____________________________________________________________________

    Brian Jordan
    brian.jordan9@btinternet.com
    RISC OS 5.28 (16-Dec-20) on Raspberry Pi _____________________________________________________________________

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Chris Newman@21:1/5 to brian.jordan9@btinternet.com on Wed Nov 1 16:32:16 2023
    In article <5afc8e1380brian.jordan9@btinternet.com>, Brian Jordan <brian.jordan9@btinternet.com> wrote:
    I have a lot of html tables which I want to convert and then, via
    Fireworkz, use to produce draw files for use in Ovation Pro and
    PrintPDF to eventually make a pdf booklet. The bit from fireworkz
    onwards is tried and tested here and is fairly straightforward, the
    table to csv bit less so. It looks like some fairly heavy duty
    searching and replacing will be needed at first unless there is a
    program somewhere which might help me; is there such a thing? I am
    aware of some online stuff under Windows which might help but would
    really like to do the whole job under RISC OS. Any thoughts
    appreciated. Thanks B

    CSV Edit Bernard Veasey

    CSVamp Ray Favre

    --
    Chris

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Chris Newman@21:1/5 to brian.jordan9@btinternet.com on Wed Nov 1 16:41:59 2023
    In article <5afc8e1380brian.jordan9@btinternet.com>, Brian Jordan <brian.jordan9@btinternet.com> wrote:
    I have a lot of html tables which I want to convert and then, via
    Fireworkz, use to produce draw files for use in Ovation Pro and
    PrintPDF to eventually make a pdf booklet. The bit from fireworkz
    onwards is tried and tested here and is fairly straightforward, the
    table to csv bit less so. It looks like some fairly heavy duty
    searching and replacing will be needed at first unless there is a
    program somewhere which might help me; is there such a thing? I am
    aware of some online stuff under Windows which might help but would
    really like to do the whole job under RISC OS. Any thoughts
    appreciated. Thanks B

    I have !UnHTML Mike Williams 1997. Very old but loaded in R5.19. Purpose
    see below from its Help file. I can send toyou. So old I presume no
    copyright problems.

    Converts HTML to Plain Text, Impression, or Draw Textarea format.
    Extracts bookmarks from links found on HTML pages.

    Author: Mike Williams
    mike@econym.demon.co.uk

    Usage: Choose the output format you require from the iconbar menu.
    Drag a HTML file to the UnHTML icon on the iconbar
    After a few seconds a save box will open
    Drag the resulting text to a filer or application

    Plain Text Format:
    The HTML tags are ripped out, special characters are converted,
    but no formatting is applied.

    --
    Chris

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Brian Jordan@21:1/5 to Chris Newman on Wed Nov 1 16:34:47 2023
    Thanks

    In article <5afc927659mec@npost.uk>,
    Chris Newman <mec@npost.uk> wrote:

    [Snip]

    CSV Edit Bernard Veasey

    CSVamp Ray Favre

    B

    --
    _____________________________________________________________________

    Brian Jordan
    brian.jordan9@btinternet.com
    RISC OS 5.28 (16-Dec-20) on Raspberry Pi _____________________________________________________________________

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Jean-Michel@21:1/5 to Jean-Michel on Wed Nov 1 20:52:51 2023
    In message <eb30a4fc5a.jmb@jmc.bruck.orange.fr>
    Jean-Michel <jmc.bruck@orange.fr> wrote:

    In message <5afc92b180brian.jordan9@btinternet.com>
    Brian Jordan <brian.jordan9@btinternet.com> wrote:

    Thanks

    In article <5afc927659mec@npost.uk>,
    Chris Newman <mec@npost.uk> wrote:

    [Snip]

    CSV Edit Bernard Veasey

    CSVamp Ray Favre

    B
    Thanks for pointing out these programs, they are always useful.
    Some time ago I worked on !Psifs and I used the SIBO to RISC OS converters (They are on Thomas Millius' site). Very convenient.

    I just took them out and was able to do the conversion you asked for from
    a csv file extracted from !Fireworks.


    I have sent you an example to test at your address.

    Sorry, I just reread your message and the conversion must be done the
    other way!!! :-(

    HTML to CSV not CSV to HTML....

    --
    Jean-Michel

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Jean-Michel@21:1/5 to Brian Jordan on Wed Nov 1 20:45:54 2023
    In message <5afc92b180brian.jordan9@btinternet.com>
    Brian Jordan <brian.jordan9@btinternet.com> wrote:

    Thanks

    In article <5afc927659mec@npost.uk>,
    Chris Newman <mec@npost.uk> wrote:

    [Snip]

    CSV Edit Bernard Veasey

    CSVamp Ray Favre

    B
    Thanks for pointing out these programs, they are always useful.
    Some time ago I worked on !Psifs and I used the SIBO to RISC OS converters (They are on Thomas Millius' site). Very convenient.

    I just took them out and was able to do the conversion you asked for from
    a csv file extracted from !Fireworks.


    I have sent you an example to test at your address.

    --
    Jean-Michel

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Harriet Bazley@21:1/5 to Brian Jordan on Wed Nov 1 21:41:53 2023
    On 1 Nov 2023 as I do recall,
    Brian Jordan wrote:

    I have a lot of html tables which I want to convert and then, via
    Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
    to eventually make a pdf booklet. The bit from fireworkz onwards is tried
    and tested here and is fairly straightforward, the table to csv bit less
    so. It looks like some fairly heavy duty searching and replacing will be needed at first unless there is a program somewhere which might help me;
    is there such a thing?

    Current versions of EasiWriter can load HTML - you won't get any
    cascading style sheet formatting, but I've tested it on some pages with
    tables that I originally hand-crafted before uploading them to the host
    site, and I can load them back into EasiWriter and select the 'Table'
    region to save as a selection. EW exports tables as TSV, not CSV, but
    I think most things that understand the latter also understand the
    former; in the case of Fireworkz, it is perfectly possible to import tab-separated files into a document as tables provided that they are *filetyped* as CSV (&DFE).

    So it depends how your HTML tables were originally created and how clean
    the coding of them is, I suspect. If they genuinely are just tables of
    data and not messed up with all sorts of layout stuff then you can load
    the pages into EasiWriter, save the tables out as tab-separated text selections, and bulk-filetype those files as CSV in order to import them
    into Fireworkz as tables and/or spreadsheet cells, depending on what you
    want to do with them there.


    --
    Harriet Bazley == Loyaulte me lie ==

    I mean to live forever - or die trying!

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Brian Jordan@21:1/5 to Harriet Bazley on Thu Nov 2 00:14:58 2023
    Many thanks,

    I have EasiWriter here although whether it's the current version will be discovered in the morning.

    In article <2fcfaefc5a.harriet@bazleyfamily.co.uk>,
    Harriet Bazley <harriet@bazleyfamily.co.uk> wrote:
    On 1 Nov 2023 as I do recall,
    Brian Jordan wrote:

    [Snip my original query]

    Current versions of EasiWriter can load HTML - you won't get any
    cascading style sheet formatting, but I've tested it on some pages with tables that I originally hand-crafted before uploading them to the host
    site, and I can load them back into EasiWriter and select the 'Table'
    region to save as a selection. EW exports tables as TSV, not CSV, but
    I think most things that understand the latter also understand the
    former; in the case of Fireworkz, it is perfectly possible to import tab-separated files into a document as tables provided that they are *filetyped* as CSV (&DFE).
    That sounds promising

    So it depends how your HTML tables were originally created and how clean
    the coding of them is, I suspect. If they genuinely are just tables of
    data and not messed up with all sorts of layout stuff then you can load
    the pages into EasiWriter, save the tables out as tab-separated text selections, and bulk-filetype those files as CSV in order to import them
    into Fireworkz as tables and/or spreadsheet cells, depending on what you
    want to do with them there.
    I created the majority of these tables in lovingly hand crafted html back
    in the mid '90s and before publishing them they were run through HTML
    Tidy until they were squeaky clean. I am quite hopeful.
    B

    --
    _____________________________________________________________________

    Brian Jordan
    brian.jordan9@btinternet.com
    RISC OS 5.28 (16-Dec-20) on Raspberry Pi _____________________________________________________________________

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Brian Jordan@21:1/5 to Brian Jordan on Thu Nov 2 09:53:00 2023
    Even better!

    In article <5afcbcd322brian.jordan9@btinternet.com>,
    Brian Jordan <brian.jordan9@btinternet.com> wrote:
    Many thanks,

    I have EasiWriter here although whether it's the current version will be discovered in the morning.
    Not only do I have a version which works as Harriet describes it does
    even better in that exporting as pdf is available in my version thus
    cutting out a number of middle men.

    [Snip]
    B

    --
    _____________________________________________________________________

    Brian Jordan
    brian.jordan9@btinternet.com
    RISC OS 5.28 (16-Dec-20) on Raspberry Pi _____________________________________________________________________

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Harriet Bazley@21:1/5 to Brian Jordan on Thu Nov 2 10:40:51 2023
    On 2 Nov 2023 as I do recall,
    Brian Jordan wrote:

    Even better!

    In article <5afcbcd322brian.jordan9@btinternet.com>,
    Brian Jordan <brian.jordan9@btinternet.com> wrote:
    Many thanks,

    I have EasiWriter here although whether it's the current version will be discovered in the morning.
    Not only do I have a version which works as Harriet describes it does
    even better in that exporting as pdf is available in my version thus
    cutting out a number of middle men.


    Oh, I assumed you actually needed the data in Fireworkz for calculation purposes rather than layout....

    --
    Harriet Bazley == Loyaulte me lie ==

    Eschew Obfuscation.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Brian Jordan@21:1/5 to Harriet Bazley on Thu Nov 2 11:18:49 2023
    In article <ec1ff6fc5a.harriet@bazleyfamily.co.uk>,
    Harriet Bazley <harriet@bazleyfamily.co.uk> wrote:
    On 2 Nov 2023 as I do recall,
    Brian Jordan wrote:

    [Snip]

    Oh, I assumed you actually needed the data in Fireworkz for calculation purposes rather than layout....

    Fair assumption but to explain...
    The files I am converting are old Championship tables for a motor racing
    club. I can't remember how they were produced, presumably in a
    spreadsheet, but all calculations were completed before creating the web tables. I no longer have the original files but am able to grab the
    tables from the, soon to be closed, site. The club has asked if I can
    grab all of the tables from 1996 to the present and produce an inclusive
    pdf document. I have all the recent (post 2010) files here as Fireworkz
    files from which I produce pdfs and HTML (Using Paul Vigay's Webworkz)
    and the Fireworkz route for the old files suggested itself to me. The Easiwriter solution makes it all so much easier, thank you.
    B

    --
    _____________________________________________________________________

    Brian Jordan
    brian.jordan9@btinternet.com
    RISC OS 5.28 (16-Dec-20) on Raspberry Pi _____________________________________________________________________

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Harriet Bazley@21:1/5 to Brian Jordan on Thu Nov 2 18:59:52 2023
    On 2 Nov 2023 as I do recall,
    Brian Jordan wrote:


    The files I am converting are old Championship tables for a motor racing club. I can't remember how they were produced, presumably in a
    spreadsheet, but all calculations were completed before creating the web tables. I no longer have the original files but am able to grab the
    tables from the, soon to be closed, site. The club has asked if I can
    grab all of the tables from 1996 to the present and produce an inclusive
    pdf document. I have all the recent (post 2010) files here as Fireworkz
    files from which I produce pdfs and HTML (Using Paul Vigay's Webworkz)
    and the Fireworkz route for the old files suggested itself to me. The Easiwriter solution makes it all so much easier, thank you.

    Excellent news!

    --
    Harriet Bazley == Loyaulte me lie ==

    Those of you who think you know everything are annoying those of us who do.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Torrens (News)@21:1/5 to Brian Jordan on Fri Nov 3 10:42:28 2023
    In article <5afc8e1380brian.jordan9@btinternet.com>,
    Brian Jordan <brian.jordan9@btinternet.com> wrote:
    I have a lot of html tables which I want to convert and then, via
    Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
    to eventually make a pdf booklet. The bit from fireworkz onwards is tried
    and tested here and is fairly straightforward, the table to csv bit less
    so. It looks like some fairly heavy duty searching and replacing will be needed at first unless there is a program somewhere which might help me;
    is there such a thing?
    I am aware of some online stuff under Windows which might help but would really like to do the whole job under RISC OS. Any thoughts appreciated. Thanks
    B

    If you have Iris - it can export as Text. It uses TAB chars between cells.

    --
    ------------------------------------------------------------------
    Richard Torrens. News email address is valid - for a limited time only.
    You must use the full News+number@Torrens.org as in the From address. http://www.Torrens.org for genealogy, natural history, wild food, walks, cats and more!

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Brian Jordan@21:1/5 to News+19662@Torrens.org on Fri Nov 3 11:55:14 2023
    In article <5afd7a1c06news*@Torrens.org>,
    Richard Torrens (News) <News+19662@Torrens.org> wrote:
    In article <5afc8e1380brian.jordan9@btinternet.com>,
    Brian Jordan <brian.jordan9@btinternet.com> wrote:

    [Snip my original request]

    If you have Iris - it can export as Text. It uses TAB chars between
    cells.

    I do and this knowledge has added a further string to my bow, many thanks.
    In the last few days through the help of folks in these parts I have gone
    to an "I wonder if..." to a cup overflowing situation. Thanks to all for
    your help.
    B

    --
    _____________________________________________________________________

    Brian Jordan
    brian.jordan9@btinternet.com
    RISC OS 5.28 (16-Dec-20) on Raspberry Pi _____________________________________________________________________

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Harriet Bazley@21:1/5 to All on Sat Nov 4 12:57:51 2023
    On 3 Nov 2023 as I do recall,
    Richard Torrens (News) wrote:

    In article <5afc8e1380brian.jordan9@btinternet.com>,
    Brian Jordan <brian.jordan9@btinternet.com> wrote:
    I have a lot of html tables which I want to convert and then, via Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
    to eventually make a pdf booklet. The bit from fireworkz onwards is tried and tested here and is fairly straightforward, the table to csv bit less so. It looks like some fairly heavy duty searching and replacing will be needed at first unless there is a program somewhere which might help me;
    is there such a thing?
    I am aware of some online stuff under Windows which might help but would really like to do the whole job under RISC OS. Any thoughts appreciated. Thanks
    B

    If you have Iris - it can export as Text. It uses TAB chars between cells.

    Even Netsurf does that...

    --
    Harriet Bazley == Loyaulte me lie ==

    "An American is a man with two arms and four wheels".

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Richard Torrens (News)@21:1/5 to Harriet Bazley on Sun Nov 5 14:50:15 2023
    In article <fc560afe5a.harriet@bazleyfamily.co.uk>,
    Harriet Bazley <harriet@bazleyfamily.co.uk> wrote:

    If you have Iris - it can export as Text. It uses TAB chars between
    cells.

    Even Netsurf does that...

    But it uses spaces - not TABs!

    --
    ------------------------------------------------------------------
    Richard Torrens. News email address is valid - for a limited time only.
    You must use the full News+number@Torrens.org as in the From address. http://www.Torrens.org for genealogy, natural history, wild food, walks, cats and more!

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Harriet Bazley@21:1/5 to All on Sun Nov 5 16:12:46 2023
    On 5 Nov 2023 as I do recall,
    Richard Torrens (News) wrote:

    In article <fc560afe5a.harriet@bazleyfamily.co.uk>,
    Harriet Bazley <harriet@bazleyfamily.co.uk> wrote:

    If you have Iris - it can export as Text. It uses TAB chars between cells.

    Even Netsurf does that...

    But it uses spaces - not TABs!

    I'm definitely getting tabs, both from select-and-drag and from
    Export->Text. Maybe it depends on the way the table was defined/laid
    out in the first place? I've only been testing it on my own tables....

    --
    Harriet Bazley == Loyaulte me lie ==

    Those who can't write, write manuals.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Vince M Hudd@21:1/5 to Brian Jordan on Wed Jan 3 16:52:24 2024
    On 01/11/2023 15:44, Brian Jordan wrote:


    I have a lot of html tables which I want to convert and then, via
    Fireworkz, use to produce draw files for use in Ovation Pro and PrintPDF
    to eventually make a pdf booklet. The bit from fireworkz onwards is tried
    and tested here and is fairly straightforward, the table to csv bit less
    so. It looks like some fairly heavy duty searching and replacing will be needed at first unless there is a program somewhere which might help me;
    is there such a thing?

    I see I'm late to the party on this one (this is my 'annual' usenet
    catchup!) so you already have solutions suggested, but I may as well add
    that WebChange (with the aid of a suitable script) can do this.

    (Although I no longer have the script to hand that I was using at the
    time, it's one of the things I used to do as a demo of the software).

    The only flaw was that it would only be able to handle the first table
    it encountered.

    --
    Vince M Hudd

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Vince M Hudd@21:1/5 to Vince M Hudd on Sat Jan 6 18:21:19 2024
    On 03/01/2024 16:52, Vince M Hudd wrote:

    I see I'm late to the party on this one (this is my 'annual' usenet
    catchup!) so you already have solutions suggested, but I may as well add
    that WebChange (with the aid of a suitable script) can do this.

    (Although I no longer have the script to hand that I was using at the
    time, it's one of the things I used to do as a demo of the software).

    The only flaw was that it would only be able to handle the first table
    it encountered.
    I was reminded today that I'd let the webchange.co.uk domain go, and
    hadn't updated the softrock.co.uk site accordingly - so WebChange has been missing in action for a while. (h/t to Bernard Boase for pointing it out)

    As a quick fix, I've created a new subdomain - webchange.softrock.co.uk
    and mapped it to the server space that webchange.co.uk was previously
    using, and I've updated the link on the WebChange page on softrock.co.uk
    to point to it.

    So as of now, WebChange can once again be downloaded.

    https://www.softrock.co.uk/products/webchange.html http://webchange.softrock.co.uk/

    The main site itself hasn't actually been properly updated, though (i.e. I haven't run WebChange on it) - so other links remain broken. That'll give
    lots of "we found problems on your website" spammers even more reasons to
    email me. ;)

    --
    Vince M Hudd

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)