• RootsMagic Users

    From J. Hugh Sullivan@21:1/5 to All on Tue Jun 12 02:10:04 2018
    I started with Bruce when it was Family Origins, went to RootsMagic
    but changed to Legacy. I now have both on the computers.

    I exported a Legacy GED to RM. I have hundreds of people who go by
    their middle name - an old time Southern tradition.

    My name shows up on the import from Legacy James "Hugh" "Hugh"
    Sullivan. Of course the second "Hugh" is a "nickname" in RM.

    How do I delete the duplicate en masse?

    I always go by First Initial, Middle Name and Surname - I will fight
    that battle to the end with doctors, governors, licenses, etc.

    Us WWII vets are sorta hard-nosed - and I've been married 70 years to
    the same auburn-haired gal.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve Hayes@21:1/5 to Sullivan on Tue Jun 12 05:34:57 2018
    On Tue, 12 Jun 2018 02:10:04 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I started with Bruce when it was Family Origins, went to RootsMagic
    but changed to Legacy. I now have both on the computers.

    I exported a Legacy GED to RM. I have hundreds of people who go by
    their middle name - an old time Southern tradition.

    My name shows up on the import from Legacy James "Hugh" "Hugh"
    Sullivan. Of course the second "Hugh" is a "nickname" in RM.

    How do I delete the duplicate en masse?

    I'm not sure how you do it en masse. Please let us know if you
    discover how.

    I too have both Legacy and RM, but find I'm using RM more these days.


    --
    Steve Hayes
    http://www.khanya.org.za/stevesig.htm
    http://khanya.wordpress.com

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to hayesstw@telkomsa.net on Tue Jun 12 08:40:55 2018
    On Tue, 12 Jun 2018 05:34:57 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    On Tue, 12 Jun 2018 02:10:04 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I started with Bruce when it was Family Origins, went to RootsMagic
    but changed to Legacy. I now have both on the computers.

    I exported a Legacy GED to RM. I have hundreds of people who go by
    their middle name - an old time Southern tradition.

    My name shows up on the import from Legacy James "Hugh" "Hugh"
    Sullivan. Of course the second "Hugh" is a "nickname" in RM.

    How do I delete the duplicate en masse?

    I'm not sure how you do it en masse. Please let us know if you
    discover how.

    I too have both Legacy and RM, but find I'm using RM more these days.


    --
    Steve Hayes
    http://www.khanya.org.za/stevesig.htm
    http://khanya.wordpress.com

    Thanks, Steve.

    I'm going to see if I now like RM better. I like the person to person
    movement better on Legacy (up and down) and the name source does not
    switch back to name from RM to Legacy (but I can live with that).

    I wonder if a GED could be ported to Excel and use Find All to
    identify NICK in red, sort red fonts to the top and delete. Even if
    NICK "Hugh" appeared I could selectively erase the part I did not
    want.

    I asked because a 90 year old man doesn't learn as fast as he used to.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to Sullivan on Tue Jun 12 10:06:42 2018
    On Tue, 12 Jun 2018 08:40:55 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I wonder if a GED could be ported to Excel and use Find All to
    identify NICK in red, sort red fonts to the top and delete. Even if
    NICK "Hugh" appeared I could selectively erase the part I did not
    want.

    Hugh

    The method works - so far.

    The GED is composed in .txt format. Select All, Copy and Paste in
    Excel.

    Find NICK(s), Replace with NICK(s) (in color font).

    Delete NICK(s) that are duplicates. Don't Delete real nicknames.

    The problem: I have about 300,000 lines of which 520 are NICK.
    Scrolling and Deleting will take awhile.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to Sullivan on Tue Jun 12 12:23:08 2018
    On Tue, 12 Jun 2018 10:06:42 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 08:40:55 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I wonder if a GED could be ported to Excel and use Find All to
    identify NICK in red, sort red fonts to the top and delete. Even if
    NICK "Hugh" appeared I could selectively erase the part I did not
    want.

    Hugh

    The method works - so far.

    Next problem: The Excel file will not convert to a GED file to be
    imported into RM.

    OOPS!

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to Sullivan on Tue Jun 12 09:33:53 2018
    On Tue, 12 Jun 2018 02:10:04 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I started with Bruce when it was Family Origins, went to RootsMagic
    but changed to Legacy. I now have both on the computers.

    I exported a Legacy GED to RM. I have hundreds of people who go by
    their middle name - an old time Southern tradition.

    My name shows up on the import from Legacy James "Hugh" "Hugh"
    Sullivan. Of course the second "Hugh" is a "nickname" in RM.

    How do I delete the duplicate en masse?

    I always go by First Initial, Middle Name and Surname - I will fight
    that battle to the end with doctors, governors, licenses, etc.

    Us WWII vets are sorta hard-nosed - and I've been married 70 years to
    the same auburn-haired gal.

    Hugh

    I'm thinking that it could be done on a GEDCOM using a text editor
    like EditPlus or Notepad++ and Regular Expressions. You would need a
    small GEDCOM showing how the names appear coming from Legacy, and
    another GEDCOM from RM showing how they should appear "after" you
    corrected RM to show the names the way you want them to be. RE gives
    you lots of latitude in doing a mass edit... the goal to edit the
    Legacy GEDCOM to match that of the corrected RM GEDCOM.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to Sullivan on Tue Jun 12 09:48:46 2018
    On Tue, 12 Jun 2018 10:06:42 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 08:40:55 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I wonder if a GED could be ported to Excel and use Find All to
    identify NICK in red, sort red fonts to the top and delete. Even if
    NICK "Hugh" appeared I could selectively erase the part I did not
    want.

    Hugh

    The method works - so far.

    The GED is composed in .txt format. Select All, Copy and Paste in
    Excel.

    Find NICK(s), Replace with NICK(s) (in color font).

    Delete NICK(s) that are duplicates. Don't Delete real nicknames.

    The problem: I have about 300,000 lines of which 520 are NICK.
    Scrolling and Deleting will take awhile.

    Hugh

    This is exactly what you would do using a text editor with Regular
    Expressions capability.... only it would be a one-time command.... no
    scrolling and deleting.... that would be done by the software.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to Sullivan on Tue Jun 12 10:25:37 2018
    On Tue, 12 Jun 2018 12:23:08 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 10:06:42 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 08:40:55 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I wonder if a GED could be ported to Excel and use Find All to
    identify NICK in red, sort red fonts to the top and delete. Even if
    NICK "Hugh" appeared I could selectively erase the part I did not
    want.

    Hugh

    The method works - so far.

    Next problem: The Excel file will not convert to a GED file to be
    imported into RM.

    OOPS!

    Hugh

    Actually, all you need to do is to copy the text from Excel to a file
    and rename it from TXT to GED.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to invalid@invalid.com on Tue Jun 12 16:12:08 2018
    On Tue, 12 Jun 2018 10:25:37 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    On Tue, 12 Jun 2018 12:23:08 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 10:06:42 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 08:40:55 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I wonder if a GED could be ported to Excel and use Find All to
    identify NICK in red, sort red fonts to the top and delete. Even if >>>>NICK "Hugh" appeared I could selectively erase the part I did not
    want.

    Hugh

    The method works - so far.

    Next problem: The Excel file will not convert to a GED file to be
    imported into RM.

    OOPS!

    Hugh

    Actually, all you need to do is to copy the text from Excel to a file
    and rename it from TXT to GED.

    I tried that. Maybe the problem was mine and not the computer.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to invalid@invalid.com on Tue Jun 12 16:13:16 2018
    On Tue, 12 Jun 2018 09:33:53 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    I'm thinking that it could be done on a GEDCOM using a text editor
    like EditPlus or Notepad++ and Regular Expressions. You would need a
    small GEDCOM showing how the names appear coming from Legacy, and
    another GEDCOM from RM showing how they should appear "after" you
    corrected RM to show the names the way you want them to be. RE gives
    you lots of latitude in doing a mass edit... the goal to edit the
    Legacy GEDCOM to match that of the corrected RM GEDCOM.

    I'm not familiar with RE - I'll give it a look.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to Sullivan on Tue Jun 12 11:41:49 2018
    On Tue, 12 Jun 2018 16:13:16 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 09:33:53 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    I'm thinking that it could be done on a GEDCOM using a text editor
    like EditPlus or Notepad++ and Regular Expressions. You would need a
    small GEDCOM showing how the names appear coming from Legacy, and
    another GEDCOM from RM showing how they should appear "after" you
    corrected RM to show the names the way you want them to be. RE gives
    you lots of latitude in doing a mass edit... the goal to edit the
    Legacy GEDCOM to match that of the corrected RM GEDCOM.

    I'm not familiar with RE - I'll give it a look.

    Hugh

    For text editing I prefer EditPlus. However it's not the most popular,
    probably because it's not free. Notepad++ seems very capable, and it's
    free. I'd recommend downloading Notepad++ and reading a bit about
    Regular Expressions. If the cost doesn't bother you, I do think
    Editplus is easier to use (better learning curve). If the changes
    needed are simply a search and replace, maybe you could do that in
    Word.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to invalid@invalid.com on Tue Jun 12 17:34:47 2018
    On Tue, 12 Jun 2018 11:41:49 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    For text editing I prefer EditPlus. However it's not the most popular, >probably because it's not free. Notepad++ seems very capable, and it's
    free. I'd recommend downloading Notepad++ and reading a bit about
    Regular Expressions. If the cost doesn't bother you, I do think
    Editplus is easier to use (better learning curve). If the changes
    needed are simply a search and replace, maybe you could do that in
    Word.

    Cost is not a problem. But I imported the Legacy .fdb and the
    nicknames disappeared. That eliminates my real problem. And a
    descendant chart essentially solves my need for heresy.

    I used Sull Test for the RM base and I need to change that. I suppose
    I can just edit the folder name.

    I noticed the problem search and voided about 75% of them - really
    easy to do that.

    I had forgotten how good Bruce's program is.

    Thanks, Charlie.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve Hayes@21:1/5 to Sullivan on Wed Jun 13 04:21:54 2018
    On Tue, 12 Jun 2018 12:23:08 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 10:06:42 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Tue, 12 Jun 2018 08:40:55 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    I wonder if a GED could be ported to Excel and use Find All to
    identify NICK in red, sort red fonts to the top and delete. Even if
    NICK "Hugh" appeared I could selectively erase the part I did not
    want.

    Hugh

    The method works - so far.

    Next problem: The Excel file will not convert to a GED file to be
    imported into RM.

    Better to use a text editor on the GED file itself.

    Do a search and replace.


    --
    Steve Hayes
    http://www.khanya.org.za/stevesig.htm
    http://khanya.wordpress.com

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to hayesstw@telkomsa.net on Wed Jun 13 07:17:14 2018
    On Wed, 13 Jun 2018 04:21:54 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    Next problem: The Excel file will not convert to a GED file to be
    imported into RM.

    Better to use a text editor on the GED file itself.

    Do a search and replace.

    Steve Hayes
    http://www.khanya.org.za/stevesig.htm
    http://khanya.wordpress.com

    I do the S & R in Excel a lot but not a TE or Word.

    The NICK in Legacy doubled up as Alt. Name in RM and I missed that.

    I don't know Richard that well but I knew you and Charlie would come
    up with something - and, so far, I'm not too old to learn.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve Hayes@21:1/5 to Sullivan on Thu Jun 14 03:53:47 2018
    On Wed, 13 Jun 2018 07:17:14 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Wed, 13 Jun 2018 04:21:54 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    Next problem: The Excel file will not convert to a GED file to be >>>imported into RM.

    Better to use a text editor on the GED file itself.

    The thing is, a .GED file is a text file, and Excel doesn't really
    work with text files, at least not any of the versions I've used.

    I mostly use XyWrite for such things -- an old word processor that
    uses Ascii as its native format, and is actually more powerful as a
    word processor than most of the modern bloatware -- fewer bells and
    whistles, but more pistons and cylinders. One advantage over others is
    that it's not fussy about file endings -- it will accept .GED, and
    won't demand that you change it to .TXT. The disadvantage is that,
    being old, it only understands 8.3 file names.

    But using any text editor that does search and replace should be much
    easier than using something like Excel, which needs file conversion
    both ways.

    --
    Steve Hayes
    Web: http://hayesgreene.wordpress.com/
    http://hayesgreene.blogspot.com
    http://groups.yahoo.com/group/afgen/

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From john@21:1/5 to J. Hugh Sullivan on Thu Jun 14 14:11:28 2018
    On 14/06/2018 13:33, J. Hugh Sullivan wrote:
    On Thu, 14 Jun 2018 03:53:47 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    But using any text editor that does search and replace should be much
    easier than using something like Excel, which needs file conversion
    both ways.

    Steve Hayes

    I have grown older getting there. The file conversions left scars. :)

    The problem, so far, is...

    2 NICK (name)

    I can delete NICK en masse but it leaves 2 and (name). I have used
    Notepad, Wordpad and Word. Word does best but I have to eliminate more
    than 500 instances individually in more than 3,000 pages. I'm on Page
    300 now.

    I'll look at XyWrite if it is still available.

    Thankee,

    Hugh


    I think you will find XyWrite has not been available since 2003!
    According to the wiki article "Despite these advantages in speed,
    XyWrite does not have as many features as Word or OpenOffice.org. For
    example, XyWrite is unaware of Windows ANSI or Unicode character sets
    and Nota Bene does not support languages (such as Chinese) that require double-byte characters."

    Use the free Notepad++ from
    https://notepad-plus-plus.org/download/v7.5.6.html It will read and
    write a text GEDCOM - you can save it with the appropriate file extension.

    A useful Help sheet https://drive.google.com/file/d/0B86nuTd5nMTKaENHcmliUC1kdnc/edit

    Make a copy of your GEDCOM

    You can record a simple macro, save it and run it.

    I'm not sure of your exact problem but I think you need a macro to
    - search for 2 NICK (use Ctrl-F)
    - Delete Line (use Ctrl-L)
    you can then stop recording, run it a number of times or to the end of
    the file and can save the macro.

    By recording a macro you can see what your actions are on a copy of your
    main GEDCOM

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to hayesstw@telkomsa.net on Thu Jun 14 11:33:52 2018
    On Thu, 14 Jun 2018 03:53:47 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    But using any text editor that does search and replace should be much
    easier than using something like Excel, which needs file conversion
    both ways.

    Steve Hayes

    I have grown older getting there. The file conversions left scars. :)

    The problem, so far, is...

    2 NICK (name)

    I can delete NICK en masse but it leaves 2 and (name). I have used
    Notepad, Wordpad and Word. Word does best but I have to eliminate more
    than 500 instances individually in more than 3,000 pages. I'm on Page
    300 now.

    I'll look at XyWrite if it is still available.

    Thankee,

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Dennis Lee Bieber@21:1/5 to All on Thu Jun 14 09:28:05 2018
    On Thu, 14 Jun 2018 11:33:52 GMT, Eagle@bellsouth.net (J. Hugh Sullivan) declaimed the following:


    The problem, so far, is...

    2 NICK (name)

    I can delete NICK en masse but it leaves 2 and (name). I have used
    Notepad, Wordpad and Word. Word does best but I have to eliminate more
    than 500 instances individually in more than 3,000 pages. I'm on Page
    300 now.


    Presuming ALL occurrences are on lines that start with "2 NICK", and do not break across lines, your search string needs to be a wild card/regex
    that specifies

    start of line
    2 NICK
    * (match anything)
    end of line

    Best I can find in Word is

    [*] Use wildcards

    Find
    ^l2 NICK ?@^l

    Replace
    ^l

    where ^l represents manual line break (If the file loads with paragraph marks, other processing will be needed -- since paragraph marks are NOT available when "use wildcards" is active)

    ?@ represents "any character""1 or more of previous [ie: any
    character]"

    So the whole find string is

    manual line break
    2 NICK
    any string of characters until
    manual line break

    and the replace string is

    manual line break

    (needed since we matched two line breaks, we need to put one back in)

    I'll look at XyWrite if it is still available.

    Might consider SciTE https://www.scintilla.org/SciTEDownload.html

    You'd enable regex and use as the find string

    ^2 NICK .+$

    ^ beginning of line
    2 NICK
    .+ any character, 1 or more of it
    $ end of line

    and an empty replace string

    SciTE regex work on lines, so the above should result in empty lines in the file. Depending on the line ending in the file (turn on View/End-of-Line),
    you can get rid of the empty lines by turning off regex, turning on
    backslash expressions and doing (assuming Windows standard <cr><lf>
    endings)

    find
    \r\n\r\n

    replace
    \r\n

    to compress two line endings into one line ending


    I suspect you don't have a Python interpreter installed -- the task is fairly simple as a Python script.

    -=-=-=-=- denick.py
    import sys

    for ln in sys.stdin:
    if not ln.startswith("2 NICK"):
    sys.stdout.write(ln) #might need (ln + "\n")

    -=-=-=-=-

    python denick.py <original.ged >edited.ged


    --
    bieber.genealogy@earthlink.net Dennis Lee Bieber
    HTTP://home.earthlink.net/~bieber.genealogy/

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to hayesstw@telkomsa.net on Thu Jun 14 09:30:17 2018
    On Thu, 14 Jun 2018 03:53:47 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    The thing is, a .GED file is a text file, and Excel doesn't really
    work with text files, at least not any of the versions I've used.

    Well, that's not really right. Excel will work quite well with text
    files, and I use it often to parse text into columnar data which I
    then import into Access. As an example, I took text from Father
    Hebert's books Southwest Louisiana Records, and using a combination of
    Excel and a text editor, parse that information into fields which I
    then imported into Access to create a searchable database.
    Excel has import from TEXT built right into the options:
    Data, Get Data, From File, From Text/CSV.
    Excel "prefers" text separated by commas or tabs. (It then
    automatically parses it into fields). But plain text will also
    import.... then it's up to the user to figure out how to parse it.
    For a GED file, there's no need to parse the text. Simply use the
    spreadsheets search and replace functions. so importing a GED into
    Excel puts everything into column A.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to bieber.genealogy@earthlink.net on Thu Jun 14 13:52:45 2018
    On Thu, 14 Jun 2018 09:28:05 -0400, Dennis Lee Bieber <bieber.genealogy@earthlink.net> wrote:

    On Thu, 14 Jun 2018 11:33:52 GMT, Eagle@bellsouth.net (J. Hugh Sullivan) >declaimed the following:


    The problem, so far, is...

    2 NICK (name)

    I can delete NICK en masse but it leaves 2 and (name). I have used
    Notepad, Wordpad and Word. Word does best but I have to eliminate more
    than 500 instances individually in more than 3,000 pages. I'm on Page
    300 now.


    Presuming ALL occurrences are on lines that start with "2 NICK", and do
    not break across lines, your search string needs to be a wild card/regex
    that specifies

    start of line
    2 NICK
    * (match anything)
    end of line

    Best I can find in Word is

    [*] Use wildcards

    Find
    ^l2 NICK ?@^l

    Replace
    ^l

    where ^l represents manual line break (If the file loads with paragraph >marks, other processing will be needed -- since paragraph marks are NOT >available when "use wildcards" is active)

    ?@ represents "any character""1 or more of previous [ie: any
    character]"

    So the whole find string is

    manual line break
    2 NICK
    any string of characters until
    manual line break

    and the replace string is

    manual line break

    (needed since we matched two line breaks, we need to put one back in)

    I'll look at XyWrite if it is still available.

    Might consider SciTE https://www.scintilla.org/SciTEDownload.html

    You'd enable regex and use as the find string

    ^2 NICK .+$

    ^ beginning of line
    2 NICK
    .+ any character, 1 or more of it
    $ end of line

    and an empty replace string

    SciTE regex work on lines, so the above should result in empty lines in the >file. Depending on the line ending in the file (turn on View/End-of-Line), >you can get rid of the empty lines by turning off regex, turning on
    backslash expressions and doing (assuming Windows standard <cr><lf>
    endings)

    find
    \r\n\r\n

    replace
    \r\n

    to compress two line endings into one line ending


    I suspect you don't have a Python interpreter installed -- the task is
    fairly simple as a Python script.

    -=-=-=-=- denick.py
    import sys

    for ln in sys.stdin:
    if not ln.startswith("2 NICK"):
    sys.stdout.write(ln) #might need (ln + "\n")

    -=-=-=-=-

    python denick.py <original.ged >edited.ged


    --
    bieber.genealogy@earthlink.net Dennis Lee Bieber HTTP://home.earthlink.net/~bieber.genealogy/

    I really appreciate your efforts. Not growing up with compuers is a
    real problem for us has-beens.

    My line appears

    2 NICK Hugh

    Others are

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    I can delete the 2 NICK but not the name except one at the time

    I'll look a SciTE regex - I may can learn that before I'm 91.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to john1@s145802280.onlinehome.fr on Thu Jun 14 09:39:55 2018
    On Thu, 14 Jun 2018 14:11:28 +0200, john
    <john1@s145802280.onlinehome.fr> wrote:

    On 14/06/2018 13:33, J. Hugh Sullivan wrote:
    On Thu, 14 Jun 2018 03:53:47 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    But using any text editor that does search and replace should be much
    easier than using something like Excel, which needs file conversion
    both ways.

    Steve Hayes

    I have grown older getting there. The file conversions left scars. :)

    The problem, so far, is...

    2 NICK (name)

    I can delete NICK en masse but it leaves 2 and (name). I have used
    Notepad, Wordpad and Word. Word does best but I have to eliminate more
    than 500 instances individually in more than 3,000 pages. I'm on Page
    300 now.

    I'll look at XyWrite if it is still available.

    Thankee,

    Hugh


    I think you will find XyWrite has not been available since 2003!
    According to the wiki article "Despite these advantages in speed,
    XyWrite does not have as many features as Word or OpenOffice.org. For >example, XyWrite is unaware of Windows ANSI or Unicode character sets
    and Nota Bene does not support languages (such as Chinese) that require >double-byte characters."

    Use the free Notepad++ from >https://notepad-plus-plus.org/download/v7.5.6.html It will read and
    write a text GEDCOM - you can save it with the appropriate file extension.

    A useful Help sheet >https://drive.google.com/file/d/0B86nuTd5nMTKaENHcmliUC1kdnc/edit

    Make a copy of your GEDCOM

    You can record a simple macro, save it and run it.

    I'm not sure of your exact problem but I think you need a macro to
    - search for 2 NICK (use Ctrl-F)
    - Delete Line (use Ctrl-L)
    you can then stop recording, run it a number of times or to the end of
    the file and can save the macro.

    By recording a macro you can see what your actions are on a copy of your
    main GEDCOM

    I strongly second John's advice. The only thing I'd add, is to look
    closely into using Regular Expressions. That's a very powerful feature
    that's in Notepad++ but NOT in any Word Processor that I've seen, and
    not in Notepad. RE allows you to do very complicated search and
    replace; like searching for a string of text that contains certain
    characters, and copy certain portions of the string.
    In the example above, you could easily do a search for occurances of
    the string 2 NICK and then change whatever follows 2 NICK to something
    else. (Not that you'd want to do exactly that in this case.) My point
    being that you can't do that kind of a search and replace in a word
    processor.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From john@21:1/5 to J. Hugh Sullivan on Thu Jun 14 16:42:05 2018
    On 14/06/2018 15:52, J. Hugh Sullivan wrote:
    On Thu, 14 Jun 2018 09:28:05 -0400, Dennis Lee Bieber <bieber.genealogy@earthlink.net> wrote:

    On Thu, 14 Jun 2018 11:33:52 GMT, Eagle@bellsouth.net (J. Hugh Sullivan)
    declaimed the following:


    The problem, so far, is...

    2 NICK (name)

    I can delete NICK en masse but it leaves 2 and (name). I have used
    Notepad, Wordpad and Word. Word does best but I have to eliminate more
    than 500 instances individually in more than 3,000 pages. I'm on Page
    300 now.


    Presuming ALL occurrences are on lines that start with "2 NICK", and do >> not break across lines, your search string needs to be a wild card/regex
    that specifies

    start of line
    2 NICK
    * (match anything)
    end of line

    Best I can find in Word is

    [*] Use wildcards

    Find
    ^l2 NICK ?@^l

    Replace
    ^l

    where ^l represents manual line break (If the file loads with paragraph
    marks, other processing will be needed -- since paragraph marks are NOT
    available when "use wildcards" is active)

    ?@ represents "any character""1 or more of previous [ie: any
    character]"

    So the whole find string is

    manual line break
    2 NICK
    any string of characters until
    manual line break

    and the replace string is

    manual line break

    (needed since we matched two line breaks, we need to put one back in)

    I'll look at XyWrite if it is still available.

    Might consider SciTE https://www.scintilla.org/SciTEDownload.html

    You'd enable regex and use as the find string

    ^2 NICK .+$

    ^ beginning of line
    2 NICK
    .+ any character, 1 or more of it
    $ end of line

    and an empty replace string

    SciTE regex work on lines, so the above should result in empty lines in the >> file. Depending on the line ending in the file (turn on View/End-of-Line), >> you can get rid of the empty lines by turning off regex, turning on
    backslash expressions and doing (assuming Windows standard <cr><lf>
    endings)

    snip

    I really appreciate your efforts. Not growing up with compuers is a
    real problem for us has-beens.

    My line appears

    2 NICK Hugh

    Others are

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    I can delete the 2 NICK but not the name except one at the time



    Which is much easier to do using Notepad++ and macros as I indicated
    earlier rather than struggling with regex expressions

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to Sullivan on Thu Jun 14 09:58:56 2018
    On Thu, 14 Jun 2018 13:52:45 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    As I mentioned earlier, I use EditPlus. Here's the RE expression to
    delete all the above:

    2 NICK [A-Z][a-z]*\n


    in the replace box, enter nothing (leave it empty)

    You simply enter this in the search box a click use regular
    expressions, and all those entries are gone in an instant.

    I don't use Notepad++ but I'd imagine that the RE string would be very
    similar, if not identical.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to All on Thu Jun 14 14:36:09 2018
    On Tue, 12 Jun 2018 02:10:04 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    Eureka - I think.

    The double name on a Legacy .ged was associated with 2 NICK. On RM it
    is an Alt Name fact.

    I imported the Legacy database to RM and exported an RM .ged. There is
    a place where one can include or exclude listed facts to be exported -
    one is Alt Name.

    I excluded Alt Name facts, did an RM .GED and imported back to RM.

    I have been looking at 3 different individuals all along to see if
    they changed. They finally disappeared with the above method.

    I appreciate all the help.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to Sullivan on Thu Jun 14 12:52:46 2018
    On Thu, 14 Jun 2018 17:32:05 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:


    At my age I'm supposed to have dementia - but I keep forgetting. :)

    LOL. You must be over 78 because I can't remember whether I've forgot
    something or not.

    Regular Expressions are kind of like leaning a second language. You
    can use as many of the features as you want, ignoring everything that
    seems complicated.
    In the example I posted elsewhere in the thread..

    2 NICK [A-Z][a-z]*\n

    The search string simply means

    Look for the string that starts with the phrase "2 NICK "
    followed by any one capitalized letter
    followed by any number of lower case letters strung together ([a-z]*)
    followed by an end of line character (\n)
    and replace all of that with nothing.

    These text processors also have macro capabilities, but like you, I
    tend to avoid trying to do that.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From john@21:1/5 to J. Hugh Sullivan on Thu Jun 14 20:01:59 2018
    On 14/06/2018 19:35, J. Hugh Sullivan wrote:
    On Thu, 14 Jun 2018 16:42:05 +0200, john
    <john1@s145802280.onlinehome.fr> wrote:

    Which is much easier to do using Notepad++ and macros as I indicated
    earlier rather than struggling with regex expressions

    Macros scare me. I'm a control freak and I would never be sure of what
    I had done with a large volume of data.

    But I can't quit trying...

    Hugh


    That was why I suggested using a copy of your file to play with and get
    it working correctly to your satisfaction!

    You can see what the macro is doing as you are creating it step by step.

    Once created, NotePad++ allows you to run it however many times you want
    so you can see the effect or just let it run to the end of the file.
    Only when you are satisfied do you need to run it on a full GEDCOM.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to john1@s145802280.onlinehome.fr on Thu Jun 14 17:35:57 2018
    On Thu, 14 Jun 2018 16:42:05 +0200, john
    <john1@s145802280.onlinehome.fr> wrote:

    Which is much easier to do using Notepad++ and macros as I indicated
    earlier rather than struggling with regex expressions

    Macros scare me. I'm a control freak and I would never be sure of what
    I had done with a large volume of data.

    But I can't quit trying...

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to invalid@invalid.com on Thu Jun 14 17:32:05 2018
    On Thu, 14 Jun 2018 09:39:55 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    I strongly second John's advice. The only thing I'd add, is to look
    closely into using Regular Expressions. That's a very powerful feature
    that's in Notepad++ but NOT in any Word Processor that I've seen, and
    not in Notepad. RE allows you to do very complicated search and
    replace; like searching for a string of text that contains certain >characters, and copy certain portions of the string.
    In the example above, you could easily do a search for occurances of
    the string 2 NICK and then change whatever follows 2 NICK to something
    else. (Not that you'd want to do exactly that in this case.) My point
    being that you can't do that kind of a search and replace in a word >processor.

    I did change all the 2 NICK lines to XXXXXXXXXXXXXXXXXXXXXXX but
    couldn't delete the names except one at a time.

    My attention span, recall and concentration is not what it used to be
    so I lack self-confidence trying to learn RE. I'm probably
    underselling myself but it would be an effort. I'll look into ++.

    I'm preparing to inventory my foreign stamp collection - not for
    value, just to see what I have and what I'm missing and want to buy on
    eBay. There are more than 3,000 entities that have issued stamps since
    1840 and several lists of those are available. However each is
    slightly differrent and all have inconsistencies I can't tolerate. I'm
    using Excel to create MY list. I like consistency in punctuation, capitalization, appearance, etc. Sounds like ++ might simplify my
    efforts.

    At my age I'm supposed to have dementia - but I keep forgetting. :)

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to john1@s145802280.onlinehome.fr on Thu Jun 14 19:22:03 2018
    On Thu, 14 Jun 2018 20:01:59 +0200, john
    <john1@s145802280.onlinehome.fr> wrote:


    That was why I suggested using a copy of your file to play with and get
    it working correctly to your satisfaction!

    You can see what the macro is doing as you are creating it step by step.

    Comprende.

    Once created, NotePad++ allows you to run it however many times you want
    so you can see the effect or just let it run to the end of the file.
    Only when you are satisfied do you need to run it on a full GEDCOM.

    Fortunately RM seemes to have solved my problem - don't include Alt
    Name in exporting .ged.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to invalid@invalid.com on Thu Jun 14 19:18:37 2018
    On Thu, 14 Jun 2018 12:52:46 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    On Thu, 14 Jun 2018 17:32:05 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:


    At my age I'm supposed to have dementia - but I keep forgetting. :)

    LOL. You must be over 78 because I can't remember whether I've forgot >something or not.

    I'm 90 1/2 - WWII Vet. My AcDu was enlisted but I retired from the
    Reserve as an 0-5.

    Regular Expressions are kind of like leaning a second language. You
    can use as many of the features as you want, ignoring everything that
    seems complicated.
    In the example I posted elsewhere in the thread..

    2 NICK [A-Z][a-z]*\n

    The search string simply means

    Look for the string that starts with the phrase "2 NICK "
    followed by any one capitalized letter
    followed by any number of lower case letters strung together ([a-z]*) >followed by an end of line character (\n)
    and replace all of that with nothing.

    I could do that. Is RE a program or an add-on to a program?

    I d'led ++ and didn't even like the first screen - every other word
    was black highlighted.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to Sullivan on Thu Jun 14 15:10:00 2018
    On Thu, 14 Jun 2018 19:18:37 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Thu, 14 Jun 2018 12:52:46 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    On Thu, 14 Jun 2018 17:32:05 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:


    At my age I'm supposed to have dementia - but I keep forgetting. :)

    LOL. You must be over 78 because I can't remember whether I've forgot >>something or not.

    I'm 90 1/2 - WWII Vet. My AcDu was enlisted but I retired from the
    Reserve as an 0-5.

    Regular Expressions are kind of like leaning a second language. You
    can use as many of the features as you want, ignoring everything that
    seems complicated.
    In the example I posted elsewhere in the thread..

    2 NICK [A-Z][a-z]*\n

    The search string simply means

    Look for the string that starts with the phrase "2 NICK "
    followed by any one capitalized letter
    followed by any number of lower case letters strung together ([a-z]*) >>followed by an end of line character (\n)
    and replace all of that with nothing.

    I could do that. Is RE a program or an add-on to a program?

    I don't really know "technically"... I know the features of it are
    included in lots of software (that I've heard of but don't use). It's
    a part of EditPlus... that is, it's included in the software, but for
    all I know the makers of EditPlus might have to license it from
    someone.

    I d'led ++ and didn't even like the first screen - every other word
    was black highlighted.

    Yes, that is disconcerting. I've looked at Notepad++ after I heard
    about it.... but I'd already become comfortable with EditPlus, so I
    didn't think it worthwhile to try to be familiar with both.
    The thing is, both programs are designed primarily for software
    programmers. Notepad++ has lots of configuration options so that it
    can be tailored to editing different kinds of software. But for me,
    this just presented another learning curve.

    For me, EditPlus presents a cleaner view. Load a GEDCOM and it just
    looks like it would if you opened it in Word or Notepad.



    Hugh

    You mentioned working on your stamp collection earlier. Is it already
    in some form of text file, and you just want to edit it for
    consistancy? If you're working on it in Excel maybe you're treating it
    as a database. Excel is nice for working on what is basically a "flat
    file" database. The disadvantage is getting good reports out of it.
    Since Access is included with Office 365, it's really convenient to do
    the editing in Excel, load the Excel file into Access, and generate
    neat report(s) from there.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to invalid@invalid.com on Thu Jun 14 22:54:08 2018
    On Thu, 14 Jun 2018 15:10:00 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    For me, EditPlus presents a cleaner view. Load a GEDCOM and it just
    looks like it would if you opened it in Word or Notepad.

    EditPad Lite appears to do the same thing and can activate some
    functions that Excel does not without some maneuvering. Immediate all
    caps or lower case for example and it has RE.

    You mentioned working on your stamp collection earlier. Is it already
    in some form of text file, and you just want to edit it for
    consistancy? If you're working on it in Excel maybe you're treating it
    as a database. Excel is nice for working on what is basically a "flat
    file" database. The disadvantage is getting good reports out of it.
    Since Access is included with Office 365, it's really convenient to do
    the editing in Excel, load the Excel file into Access, and generate
    neat report(s) from there.

    I d'led one on the better listings of entities that produced stamps
    from 1840. There are multiple variations of form of government, war,
    colony, independent, republic, etc. for most entities. Of course
    almost every entity has produced 100s-1000s of different stamps. I
    trying to shrink to country and colony by printing years without
    losing valuable data to make eBay purchases manageable.

    I copy pasted the better list to Excel in text format. I'm doing
    combos, deletes, punctuation, i. e., personalization. All I want is a
    list of the type stamp I have and the missing types. I want to fill in
    the blanks up to a certain level of expense.

    Excel is perfect for that and I can port to EditPad for some action. I
    still enjoy busy work up to 10 hours per day.

    The first few lines look like below exept better formatted in columns.


    Major entity Minor entity Entity type Start End
    Scott
    Abu Dhabi British Protectorate 1964 1972 ?
    Aden British Colony 1937 1965 ?
    Aden Kathiri State of Seiyun Aden States 1942 1964
    ?
    Aden Kathiri State in Hadhramaut Aden States 1967
    1968 ?
    Aden Qu'aiti State of Shihr and Mukalla Aden States
    1942 1951 ?
    Aden Mahra State Qishn and Socotra Aden States 1967
    1967 ?
    Aden State of Upper Yafa Aden States 1967 1967
    ?
    Italy Aegean Islands EGEO Federation 1912 1912
    ?

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to Sullivan on Thu Jun 14 18:36:32 2018
    On Thu, 14 Jun 2018 22:54:08 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    On Thu, 14 Jun 2018 15:10:00 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    For me, EditPlus presents a cleaner view. Load a GEDCOM and it just
    looks like it would if you opened it in Word or Notepad.

    EditPad Lite appears to do the same thing and can activate some
    functions that Excel does not without some maneuvering. Immediate all
    caps or lower case for example and it has RE.

    You mentioned working on your stamp collection earlier. Is it already
    in some form of text file, and you just want to edit it for
    consistancy? If you're working on it in Excel maybe you're treating it
    as a database. Excel is nice for working on what is basically a "flat
    file" database. The disadvantage is getting good reports out of it.
    Since Access is included with Office 365, it's really convenient to do
    the editing in Excel, load the Excel file into Access, and generate
    neat report(s) from there.

    I d'led one on the better listings of entities that produced stamps
    from 1840. There are multiple variations of form of government, war,
    colony, independent, republic, etc. for most entities. Of course
    almost every entity has produced 100s-1000s of different stamps. I
    trying to shrink to country and colony by printing years without
    losing valuable data to make eBay purchases manageable.

    I copy pasted the better list to Excel in text format. I'm doing
    combos, deletes, punctuation, i. e., personalization. All I want is a
    list of the type stamp I have and the missing types. I want to fill in
    the blanks up to a certain level of expense.

    Excel is perfect for that and I can port to EditPad for some action. I
    still enjoy busy work up to 10 hours per day.

    The first few lines look like below exept better formatted in columns.


    Major entity Minor entity Entity type Start End
    Scott
    Abu Dhabi British Protectorate 1964 1972 ?
    Aden British Colony 1937 1965 ?
    Aden Kathiri State of Seiyun Aden States 1942 1964
    ?
    Aden Kathiri State in Hadhramaut Aden States 1967
    1968 ?
    Aden Qu'aiti State of Shihr and Mukalla Aden States
    1942 1951 ?
    Aden Mahra State Qishn and Socotra Aden States 1967
    1967 ?
    Aden State of Upper Yafa Aden States 1967 1967
    ?
    Italy Aegean Islands EGEO Federation 1912 1912
    ?

    Hugh

    Seems perfect application for cleaning in Excel.
    The nice things about considering importing to Access:
    Access makes it really easy to get the data into Access.
    Once in Access it easy to extract whatever information you're
    interested in looking at closely.

    I don't keep up with the latest software so much now... but thanks for mentioning Editpad lite. I'll give it a look.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve Hayes@21:1/5 to invalid@invalid.com on Fri Jun 15 08:00:59 2018
    On Thu, 14 Jun 2018 09:39:55 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    On Thu, 14 Jun 2018 14:11:28 +0200, john
    <john1@s145802280.onlinehome.fr> wrote:
    By recording a macro you can see what your actions are on a copy of your >>main GEDCOM

    I strongly second John's advice. The only thing I'd add, is to look
    closely into using Regular Expressions. That's a very powerful feature
    that's in Notepad++ but NOT in any Word Processor that I've seen, and
    not in Notepad. RE allows you to do very complicated search and
    replace; like searching for a string of text that contains certain >characters, and copy certain portions of the string.
    In the example above, you could easily do a search for occurances of
    the string 2 NICK and then change whatever follows 2 NICK to something
    else. (Not that you'd want to do exactly that in this case.) My point
    being that you can't do that kind of a search and replace in a word >processor.

    I'm not sure exactly what he does want to do, but in XyWrite (and I
    think most other word processors have the equivalent) it is dead easy
    to type
    cha |2 NICK Sim|2 NICK Something Else|

    Which changes this:

    0 @I3910@ INDI
    1 NAME Simeon /Growdon/
    2 GIVN Simeon
    2 SURN Growdon
    2 NICK Sim
    1 NAME Sim /Growdon/
    2 GIVN Sim
    2 SURN Growdon
    2 TYPE nickname
    1 SEX M
    1 _UID 5B42F25016F04D1FB4E46962C25817957A5A

    to

    0 @I3910@ INDI
    1 NAME Simeon /Growdon/
    2 GIVN Simeon
    2 SURN Growdon
    2 NICK Something Else
    1 NAME Sim /Growdon/
    2 GIVN Sim
    2 SURN Growdon
    2 TYPE nickname
    1 SEX M
    1 _UID 5B42F25016F04D1FB4E46962C25817957A5A

    and would do so throughout the file in less than a second.







    --
    Steve Hayes
    http://www.khanya.org.za/stevesig.htm
    http://khanya.wordpress.com

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve Hayes@21:1/5 to Sullivan on Fri Jun 15 08:11:46 2018
    On Thu, 14 Jun 2018 13:52:45 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    My line appears

    2 NICK Hugh

    Others are

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    I can delete the 2 NICK but not the name except one at the time

    Again, I'm not sure what the commands are in other word processors,
    but in XyWrite it is a trivial task.

    Using my previous example, typing

    cha |->@ NICK Sim||

    produced this:

    0 @I3910@ INDI?
    1 NAME Simeon /Growdon/?
    2 GIVN Simeon?
    2 SURN Growdon?
    1 NAME Sim /Growdon/?
    2 GIVN Sim?
    2 SURN Growdon?
    2 TYPE nickname?
    1 SEX M?
    1 _UID 5B42F25016F04D1FB4E46962C25817957A5A?

    the line 2 NICK Sim has gone (the -> is an approximation for the CR/LF
    in a XyWrite screen display).



    --
    Steve Hayes
    http://www.khanya.org.za/stevesig.htm
    http://khanya.wordpress.com

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve Hayes@21:1/5 to invalid@invalid.com on Fri Jun 15 08:15:41 2018
    On Thu, 14 Jun 2018 09:30:17 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    On Thu, 14 Jun 2018 03:53:47 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    The thing is, a .GED file is a text file, and Excel doesn't really
    work with text files, at least not any of the versions I've used.

    Well, that's not really right. Excel will work quite well with text
    files, and I use it often to parse text into columnar data which I
    then import into Access. As an example, I took text from Father
    Hebert's books Southwest Louisiana Records, and using a combination of
    Excel and a text editor, parse that information into fields which I
    then imported into Access to create a searchable database.
    Excel has import from TEXT built right into the options:
    Data, Get Data, From File, From Text/CSV.
    Excel "prefers" text separated by commas or tabs. (It then
    automatically parses it into fields). But plain text will also
    import.... then it's up to the user to figure out how to parse it.
    For a GED file, there's no need to parse the text. Simply use the >spreadsheets search and replace functions. so importing a GED into
    Excel puts everything into column A.

    I'l ask my wife about that -- she's an Excel fundi, and can do amazing
    things with spreadsheets.

    But yes, that does sound as if it would work just like a word
    processor.


    --
    Steve Hayes
    http://www.khanya.org.za/stevesig.htm
    http://khanya.wordpress.com

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to hayesstw@telkomsa.net on Fri Jun 15 10:09:32 2018
    On Fri, 15 Jun 2018 08:00:59 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    I'm not sure exactly what he does want to do, but in XyWrite (and I
    think most other word processors have the equivalent) it is dead easy
    to type
    cha |2 NICK Sim|2 NICK Something Else|

    Which changes this:

    0 @I3910@ INDI
    1 NAME Simeon /Growdon/
    2 GIVN Simeon
    2 SURN Growdon
    2 NICK Sim
    1 NAME Sim /Growdon/
    2 GIVN Sim
    2 SURN Growdon
    2 TYPE nickname
    1 SEX M
    1 _UID 5B42F25016F04D1FB4E46962C25817957A5A

    to

    0 @I3910@ INDI
    1 NAME Simeon /Growdon/
    2 GIVN Simeon
    2 SURN Growdon
    2 NICK Something Else
    1 NAME Sim /Growdon/
    2 GIVN Sim
    2 SURN Growdon
    2 TYPE nickname
    1 SEX M
    1 _UID 5B42F25016F04D1FB4E46962C25817957A5A

    and would do so throughout the file in less than a second.

    But "2 NICK Something Else" is not removed. I can do that.

    I don't want to remove 2 NICK, I want to remove the "Something Else".

    Fortunately I discovered how to do that in the RM program export to
    ged and import of the result.

    Use of a proper "string", as suggested by John and Charlie, would
    work, I'm sure.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to invalid@invalid.com on Fri Jun 15 10:19:39 2018
    On Thu, 14 Jun 2018 18:36:32 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    Seems perfect application for cleaning in Excel.

    Works very well. After 6 months of work I have established workable
    parameters.

    The nice things about considering importing to Access:
    Access makes it really easy to get the data into Access.
    Once in Access it easy to extract whatever information you're
    interested in looking at closely.

    I post the number of stamps I have in each category in columns for
    1840-1940 and 1941 to whenever. The spot is blank if I don't have any
    stamps in the category. It's pretty simple to sort between entry and
    no entry - Find, Replace in red, Sort to top.

    But I have the list on the left of the screen and eBay availables on
    the right. Not making a mistake requires every i to be dotted and
    every t to be crossed.

    I don't keep up with the latest software so much now... but thanks for >mentioning Editpad lite. I'll give it a look.

    The Pro is $50 but a Lite version of most programs usually serves my
    minimal purposes.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to hayesstw@telkomsa.net on Fri Jun 15 10:21:00 2018
    On Fri, 15 Jun 2018 08:11:46 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    On Thu, 14 Jun 2018 13:52:45 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    My line appears

    2 NICK Hugh

    Others are

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    I can delete the 2 NICK but not the name except one at the time

    Again, I'm not sure what the commands are in other word processors,
    but in XyWrite it is a trivial task.

    Using my previous example, typing

    cha |->@ NICK Sim||

    produced this:

    0 @I3910@ INDI?
    1 NAME Simeon /Growdon/?
    2 GIVN Simeon?
    2 SURN Growdon?
    1 NAME Sim /Growdon/?
    2 GIVN Sim?
    2 SURN Growdon?
    2 TYPE nickname?
    1 SEX M?
    1 _UID 5B42F25016F04D1FB4E46962C25817957A5A?

    the line 2 NICK Sim has gone (the -> is an approximation for the CR/LF
    in a XyWrite screen display).

    Maybe it's just semantics. I think I used XY write years ago, but if
    so, I've forgot what the features were.
    My point is that I haven't seen (at least recently) a word processor
    that will handle Regular Expressions... and Regular Expressions are
    really powerful tools in editing text.

    In the example that Hugh gave, where scattered throughout the GED file
    are nicknames that appear like the list

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    and he wants to eliminate ALL of them. In his word processor (or in
    Excel) he can either do a search for <2 NICK> and replace with
    nothing, or he can simply search through the file, looking for the
    occurance of <2 NICK> and when found, delete the entire line. He was
    actually doing the latter, until he found that RM could remove them on
    export of a GED, which could then be re-imported thus removing them
    all.

    With RE capability, a single search/replace command can and will
    remove all the lines containing <2 NICK> AND will also delete anything following <2 NICK> as long as what follows "fits" the criteria of a
    single capitalized letter followed by any number of consecutive lower
    case letters (same as each other or different) followed by an end of
    line character.

    2 NICK [A-Z][a-z]*\n

    Think of what this means.....
    If somehow Hugh had used the phrase "John appears in the program
    GEDCOM as '2 NICK John' but I don't want that to show" within a note
    in his program, that phrase would not be deleted from the note,
    because it doesn't fit the strict structure of the RE expression. (no
    end of line character following everything else that "matched" the
    search string).

    But this example mearly scratches the surface of the capabilities of
    Regular Expressions.

    I'd really like to find a word processor program that will do this
    kind of editing. If XY Write will, I'd use it no matter how old it is.

    A final comment.... Hugh's final solution is actually much better than
    my suggestion, because there are a number of situations where my
    search and replace will fail. Say the nichname is <John Boy> or
    <Little-John> or even <littlejohn>. My search criteria fails to delete
    those from the GEDCOM. However, that doesn't mean that the principle
    fails. A more elaborate RE expression would correct those errors.

    2 NICK [A-z, ,-]*\n

    There, that seems to work.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Dennis Lee Bieber@21:1/5 to All on Fri Jun 15 17:34:02 2018
    On Fri, 15 Jun 2018 10:21:00 -0500, Charlie Hoffpauir <invalid@invalid.com> declaimed the following:

    In the example that Hugh gave, where scattered throughout the GED file
    are nicknames that appear like the list

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    and he wants to eliminate ALL of them. In his word processor (or in
    Excel) he can either do a search for <2 NICK> and replace with
    nothing, or he can simply search through the file, looking for the
    occurance of <2 NICK> and when found, delete the entire line. He was
    actually doing the latter, until he found that RM could remove them on
    export of a GED, which could then be re-imported thus removing them
    all.

    <SNIP>



    A final comment.... Hugh's final solution is actually much better than
    my suggestion, because there are a number of situations where my
    search and replace will fail. Say the nichname is <John Boy> or
    <Little-John> or even <littlejohn>. My search criteria fails to delete
    those from the GEDCOM. However, that doesn't mean that the principle
    fails. A more elaborate RE expression would correct those errors.

    2 NICK [A-z, ,-]*\n

    There, that seems to work.

    In SciTE

    ^2 NICK .*$

    converts (using your text...) to

    -=-=-=-=-
    In the example that Hugh gave, where scattered throughout the GED file
    are nicknames that appear like the list







    and he wants to eliminate ALL of them. In his word processor (or in
    Excel) he can either do a search for <2 NICK> and replace with
    nothing, or he can simply search through the file, looking for the
    occurance of <2 NICK> and when found, delete the entire line. He was
    actually doing the latter, until he found that RM could remove them on
    export of a GED, which could then be re-imported thus removing them
    all.
    -=-=-=-=-

    Unfortunately, SciTE is line-based, and while ^ is beginning of line, $ is end of line, the regex mode will not allow matching the line end (which
    in the cut&paste, is using \r\n ). One can, subsequently, change the toggle from regex to \mode, and do a search for

    find:
    \r\n\r\n
    replace:
    \r\n

    (and repeat a few times if needed) and get
    -=-=-=-=-
    In the example that Hugh gave, where scattered throughout the GED file
    are nicknames that appear like the list
    and he wants to eliminate ALL of them. In his word processor (or in
    Excel) he can either do a search for <2 NICK> and replace with
    nothing, or he can simply search through the file, looking for the
    occurance of <2 NICK> and when found, delete the entire line. He was
    actually doing the latter, until he found that RM could remove them on
    export of a GED, which could then be re-imported thus removing them
    all.
    -=-=-=-=-

    Saving the original clip and importing into M$ Word shows the first problem: the \r\n imports as a Word PARAGRAPH marker, and the advanced
    search feature doesn't allow for matching paragraph marks. So... First step

    Replace

    Find:
    ^p
    Replace:
    ^l

    Replace all... to convert paragraph marks to "manual line ends".

    Second pass, turn on [X] Use Wildcards

    Find:
    ^l2 NICK ?@^l
    Replace:
    ^l

    (since I'm matching a line break on both ends, I need to put one end back
    into the file)

    For some reason I had to hit Replace All three times... It might be okay in
    the real GEDCOM file, but it seems that the repeat starts after the
    replaced line break, and hence it only activated on the odd number lines (Tom/Harry/Bill removed first time, Dick second time, and finally John).

    Using a find string of
    <2 NICK ?@^l
    and empty replace string results in

    -=-=-=-=-
    In the example that Hugh gave, where scattered throughout the GED file
    are nicknames that appear like the list


    and he wants to eliminate ALL of them. In his word processor (or in
    Excel) he can either do a search for <2 NICK> and replace with
    nothing, or he can simply search through the file, looking for the
    occurance of <2 NICK> and when found, delete the entire line. He was
    actually doing the latter, until he found that RM could remove them on
    export of a GED, which could then be re-imported thus removing them
    all.
    -=-=-=-=-

    SO... M$ Word /can/ do this with the one added step of first converting paragraph breaks to manual line breaks. This first step is done with
    [ ] Use Wildcards
    off, then the second step has
    [X] Use Wildcards
    on, to enable M$ Word's form of regex.

    If you open the [More>>] version of find/replace, the [Special] drop down list will show you what you can search for in each state.





    --
    bieber.genealogy@earthlink.net Dennis Lee Bieber
    HTTP://home.earthlink.net/~bieber.genealogy/

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve Hayes@21:1/5 to invalid@invalid.com on Sat Jun 16 07:56:00 2018
    On Fri, 15 Jun 2018 10:21:00 -0500, Charlie Hoffpauir
    <invalid@invalid.com> wrote:

    On Fri, 15 Jun 2018 08:11:46 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    On Thu, 14 Jun 2018 13:52:45 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    My line appears

    2 NICK Hugh

    Others are

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    I can delete the 2 NICK but not the name except one at the time

    Again, I'm not sure what the commands are in other word processors,
    but in XyWrite it is a trivial task.

    Using my previous example, typing

    cha |->@ NICK Sim||

    produced this:

    0 @I3910@ INDI?
    1 NAME Simeon /Growdon/?
    2 GIVN Simeon?
    2 SURN Growdon?
    1 NAME Sim /Growdon/?
    2 GIVN Sim?
    2 SURN Growdon?
    2 TYPE nickname?
    1 SEX M?
    1 _UID 5B42F25016F04D1FB4E46962C25817957A5A?

    the line 2 NICK Sim has gone (the -> is an approximation for the CR/LF
    in a XyWrite screen display).

    Maybe it's just semantics. I think I used XY write years ago, but if
    so, I've forgot what the features were.
    My point is that I haven't seen (at least recently) a word processor
    that will handle Regular Expressions... and Regular Expressions are
    really powerful tools in editing text.

    In the example that Hugh gave, where scattered throughout the GED file
    are nicknames that appear like the list

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    and he wants to eliminate ALL of them. In his word processor (or in
    Excel) he can either do a search for <2 NICK> and replace with
    nothing, or he can simply search through the file, looking for the
    occurance of <2 NICK> and when found, delete the entire line. He was
    actually doing the latter, until he found that RM could remove them on
    export of a GED, which could then be re-imported thus removing them
    all.

    With RE capability, a single search/replace command can and will
    remove all the lines containing <2 NICK> AND will also delete anything >following <2 NICK> as long as what follows "fits" the criteria of a
    single capitalized letter followed by any number of consecutive lower
    case letters (same as each other or different) followed by an end of
    line character.

    2 NICK [A-Z][a-z]*\n

    Think of what this means.....
    If somehow Hugh had used the phrase "John appears in the program
    GEDCOM as '2 NICK John' but I don't want that to show" within a note
    in his program, that phrase would not be deleted from the note,
    because it doesn't fit the strict structure of the RE expression. (no
    end of line character following everything else that "matched" the
    search string).

    But this example mearly scratches the surface of the capabilities of
    Regular Expressions.

    I'd really like to find a word processor program that will do this
    kind of editing. If XY Write will, I'd use it no matter how old it is.

    XyWrite can do that kind of of thing, though I'd have to do some
    reading in the manual to find out how. In addition to macros, it also
    has XPL, the XyWrite programming langualge, so it's more than a word
    processor. It's like a Lego kit word processor, you can build it to do
    various specialised tasks, but not, as someone pointed out, in
    Unicode.

    I believe AWK can do that kind of thing too, and at various tims I've
    tried to learn AWK because I've thought, from its description, that it
    could do amazing things with Gedcom files, but I've never managed to
    learn it well enough to do anything useful.

    But my problem is that I'm not sure exactly what Hugh wants to do.

    I gather he wants to leave the 2 NICK line in the Gedcom file, but
    with nothing following it. What is the point of that?


    --
    Steve Hayes
    http://www.khanya.org.za/stevesig.htm
    http://khanya.wordpress.com

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From john@21:1/5 to Charlie Hoffpauir on Sat Jun 16 10:41:34 2018
    On 15/06/2018 17:21, Charlie Hoffpauir wrote:
    On Fri, 15 Jun 2018 08:11:46 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    On Thu, 14 Jun 2018 13:52:45 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    My line appears

    2 NICK Hugh

    Others are

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    I can delete the 2 NICK but not the name except one at the time

    Again, I'm not sure what the commands are in other word processors,
    but in XyWrite it is a trivial task.

    Using my previous example, typing

    cha |->@ NICK Sim||

    produced this:

    0 @I3910@ INDI?
    1 NAME Simeon /Growdon/?
    2 GIVN Simeon?
    2 SURN Growdon?
    1 NAME Sim /Growdon/?
    2 GIVN Sim?
    2 SURN Growdon?
    2 TYPE nickname?
    1 SEX M?
    1 _UID 5B42F25016F04D1FB4E46962C25817957A5A?

    the line 2 NICK Sim has gone (the -> is an approximation for the CR/LF
    in a XyWrite screen display).

    Maybe it's just semantics. I think I used XY write years ago, but if
    so, I've forgot what the features were.
    My point is that I haven't seen (at least recently) a word processor
    that will handle Regular Expressions... and Regular Expressions are
    really powerful tools in editing text.

    In the example that Hugh gave, where scattered throughout the GED file
    are nicknames that appear like the list

    2 NICK Tom
    2 NICK Dick
    2 NICK Harry
    2 NICK John
    2 NICK Bill

    and he wants to eliminate ALL of them. In his word processor (or in
    Excel) he can either do a search for <2 NICK> and replace with
    nothing, or he can simply search through the file, looking for the
    occurance of <2 NICK> and when found, delete the entire line. He was
    actually doing the latter, until he found that RM could remove them on
    export of a GED, which could then be re-imported thus removing them
    all.

    With RE capability, a single search/replace command can and will
    remove all the lines containing <2 NICK> AND will also delete anything following <2 NICK> as long as what follows "fits" the criteria of a
    single capitalized letter followed by any number of consecutive lower
    case letters (same as each other or different) followed by an end of
    line character.

    2 NICK [A-Z][a-z]*\n

    Think of what this means.....
    If somehow Hugh had used the phrase "John appears in the program
    GEDCOM as '2 NICK John' but I don't want that to show" within a note
    in his program, that phrase would not be deleted from the note,
    because it doesn't fit the strict structure of the RE expression. (no
    end of line character following everything else that "matched" the
    search string).

    But this example mearly scratches the surface of the capabilities of
    Regular Expressions.

    I'd really like to find a word processor program that will do this
    kind of editing. If XY Write will, I'd use it no matter how old it is.

    A final comment.... Hugh's final solution is actually much better than
    my suggestion, because there are a number of situations where my
    search and replace will fail. Say the nichname is <John Boy> or
    <Little-John> or even <littlejohn>. My search criteria fails to delete
    those from the GEDCOM. However, that doesn't mean that the principle
    fails. A more elaborate RE expression would correct those errors.

    2 NICK [A-z, ,-]*\n

    There, that seems to work.


    One potential problem with Regular Expressions (regex_ in
    word-processing programs is how the embedded text formatting is handled.
    There is limited regex available in later versions of MS Word; see the
    bottom of this page https://support.office.com/en-us/article/find-and-replace-text-and-other-data-in-a-word-document-c6728c16-469e-43cd-afe4-7708c6c779b7?ocmsassetID=HA102350661&CorrelationId=946bf317-fe64-40a3-9201-0d9661c4fdbc&ui=en-US&rs=en-US&ad=US

    But you are probably better off using a text editor unless you really
    want to keep formatting. I earlier suggested NotePad+++. Another good
    free text editor is PSPad. Both are updated regularly, support different character encoding, support regex, etc. Like many better text editors,
    they both also have file compare. So you can see two files side-by-side
    with similar, differences, and additions highlighted (which, in this
    current GEDCOM case, can be useful comparing your editing actions with a
    copy of the original file).

    regex is very powerful if you use it often enough to remember all the functions. Hugh had a simple problem. If he hadn't solved it with RM I'm
    sure writing a simple macro would have been much easier, rather than
    having to work out the appropriate regex expression to handle all the
    cases which might have occurred in the file.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to hayesstw@telkomsa.net on Sat Jun 16 11:47:39 2018
    On Sat, 16 Jun 2018 07:56:00 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    But my problem is that I'm not sure exactly what Hugh wants to do.

    I gather he wants to leave the 2 NICK line in the Gedcom file, but
    with nothing following it. What is the point of that?

    I wanted to exclude the entire 2 NICK line but everything I tried
    deleted 2 NICK but left the name. I don't know macros or string
    searches.

    When entering a NAME I enter full name, name called in " " if the
    middle name, nickname in " ". So the data is all on one line. Legacy
    and RM also picked up names in " " as NICK or ALT NAME.

    As mentioned RM retains the FACT but allows the exclusion of some FACT
    data.

    Most likely I will return to RM as my base program. I had avoided
    using either program as the base for .ged transfers because the Name
    Source in RM moves to Unspecified in Legacy. Even so the name is still
    sourced in Legacy printouts. I have spent about 25 years trying not to
    care!

    At one time I kept REAL genealogy lines in one program and
    research/unconnected lines in the other. I'll export the latter to
    ged allowing me to only use 1 program.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to hayesstw@telkomsa.net on Sat Jun 16 08:50:51 2018
    On Sat, 16 Jun 2018 07:56:00 +0200, Steve Hayes
    <hayesstw@telkomsa.net> wrote:

    XyWrite can do that kind of of thing, though I'd have to do some
    reading in the manual to find out how. In addition to macros, it also
    has XPL, the XyWrite programming langualge, so it's more than a word >processor. It's like a Lego kit word processor, you can build it to do >various specialised tasks, but not, as someone pointed out, in
    Unicode.

    Then it's really a shame that XyWrite is no longer available.

    I believe AWK can do that kind of thing too, and at various tims I've
    tried to learn AWK because I've thought, from its description, that it
    could do amazing things with Gedcom files, but I've never managed to
    learn it well enough to do anything useful.


    As you said, AWK will do exactly the same thing, and like you I looked
    at learning it some time ago but decided against it. I also toyed with Powershell, which most people don't realize is included or free with
    Windows, and is super capable. However at 78+, it doesn't seem a wise
    use of my time to learn a new language. For me, since I was already
    familiar with basic text editing, adding the capability to use Regular Expressions in the editing program was relatively painless.

    But my problem is that I'm not sure exactly what Hugh wants to do.

    I gather he wants to leave the 2 NICK line in the Gedcom file, but
    with nothing following it. What is the point of that?

    Actually his main goal was simply to avoid RM showing the "middle"
    name twice.... it was doing this, if I understand the situation right,
    because on export of a GEDCOM from Legacy, RM put what Hugh was using
    as the "call" name of people like himself and my Dad, who preferred to
    be called by their middle name, into the NICK field. This resulted in
    something like "J. Hugh (Hugh) Sullivan" showing up in RM. To solve
    this, he was attmpting to delete the entire line whenever 2 NICK
    xxxxxx showed up in the Legacy GED. He later found that he could
    accomplish his goal by instead of modifying the Legacy GED, rather
    exporting the GED from RM and in the export process, failing to
    include the NICK field...... so that he could then re-import the
    corrected GED back into RM thus eliminating the duplication.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to john1@s145802280.onlinehome.fr on Sat Jun 16 09:02:51 2018
    On Sat, 16 Jun 2018 10:41:34 +0200, john
    <john1@s145802280.onlinehome.fr> wrote:

    One potential problem with Regular Expressions (regex_ in
    word-processing programs is how the embedded text formatting is handled. >There is limited regex available in later versions of MS Word; see the
    bottom of this page >https://support.office.com/en-us/article/find-and-replace-text-and-other-data-in-a-word-document-c6728c16-469e-43cd-afe4-7708c6c779b7?ocmsassetID=HA102350661&CorrelationId=946bf317-fe64-40a3-9201-0d9661c4fdbc&ui=en-US&rs=en-US&ad=US

    But you are probably better off using a text editor unless you really
    want to keep formatting. I earlier suggested NotePad+++. Another good
    free text editor is PSPad. Both are updated regularly, support different >character encoding, support regex, etc. Like many better text editors,
    they both also have file compare. So you can see two files side-by-side
    with similar, differences, and additions highlighted (which, in this
    current GEDCOM case, can be useful comparing your editing actions with a
    copy of the original file).

    regex is very powerful if you use it often enough to remember all the >functions. Hugh had a simple problem. If he hadn't solved it with RM I'm
    sure writing a simple macro would have been much easier, rather than
    having to work out the appropriate regex expression to handle all the
    cases which might have occurred in the file.

    I agree with almost everything you said. I guess everyone is
    different, and although I used to use Macros extensively years ago
    with Word, I really try to avoid using them any more.... because I
    tend to spend as much time trying to get the macro right as it would
    take to manually do the task. But that's just me. For me, adding the
    capability to use really simple RE expressions in a text editor was
    painless.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Charlie Hoffpauir@21:1/5 to bieber.genealogy@earthlink.net on Sat Jun 16 09:18:28 2018
    On Fri, 15 Jun 2018 17:34:02 -0400, Dennis Lee Bieber <bieber.genealogy@earthlink.net> wrote:

    In SciTE

    ^2 NICK .*$

    converts (using your text...) to

    -=-=-=-=-
    In the example that Hugh gave, where scattered throughout the GED file
    are nicknames that appear like the list

    Quite right! I made the solution too specific, your's is more elegant.
    In Editplus the equivalent string is

    2 NICK .*\n

    The only difference the end of line character.

    I am going to download a copy of SciTE and give it a try.

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From shmartonak@gmail.com@21:1/5 to J. Hugh Sullivan on Sat Jun 16 13:28:41 2018
    On Tuesday, June 12, 2018 at 2:10:06 AM UTC, J. Hugh Sullivan wrote:
    I started with Bruce when it was Family Origins, went to RootsMagic
    but changed to Legacy. I now have both on the computers.

    I exported a Legacy GED to RM. I have hundreds of people who go by
    their middle name - an old time Southern tradition.

    My name shows up on the import from Legacy James "Hugh" "Hugh"
    Sullivan. Of course the second "Hugh" is a "nickname" in RM.

    How do I delete the duplicate en masse?

    I always go by First Initial, Middle Name and Surname - I will fight
    that battle to the end with doctors, governors, licenses, etc.

    Us WWII vets are sorta hard-nosed - and I've been married 70 years to
    the same auburn-haired gal.

    Hugh

    The *nix tool 'sed' can make the problem fairly simple:

    sed '/^2 NICK/d' < old.ged > new.ged

    This says to go through the file 'old.ged' and look for any line that begins with '2 NICK' and delete it, putting the result into the new file 'new.ged'

    The problem that was also given in the first message, as I understand it, was dealing with a line that had

    1 NAME firstname "nickname""nickname" /lastname/

    This can also be handled by 'sed'

    sed '/^1 NAME/s/"".*"/"/' < old.ged > new.ged

    This says to go through the file 'old.ged' and look at any line that starts with '1 NAME'. Look for two double quotes followed by anything followed by another double quote. Replace it with just a single double quote. What you're doing is just deleting
    the second nickname. The results go into the new file 'new.ged'

    'sed' can be extremely useful for simple editing at the command line.

    Steve

    --

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From J. Hugh Sullivan@21:1/5 to shmartonak@gmail.com on Sat Jun 16 21:33:12 2018
    On Sat, 16 Jun 2018 13:28:41 -0700 (PDT), shmartonak@gmail.com wrote:

    The *nix tool 'sed' can make the problem fairly simple:

    sed '/^2 NICK/d' < old.ged > new.ged

    This says to go through the file 'old.ged' and look for any line that begin= >s with '2 NICK' and delete it, putting the result into the new file 'new.ge= >d'

    The problem that was also given in the first message, as I understand it, w= >as dealing with a line that had

    1 NAME firstname "nickname""nickname" /lastname/

    This can also be handled by 'sed'

    You understand the problem I had and 'sed' is an interesting concept.

    Hugh

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)
  • From Steve Hayes@21:1/5 to Sullivan on Sun Jun 17 05:08:27 2018
    On Sat, 16 Jun 2018 11:47:39 GMT, Eagle@bellsouth.net (J. Hugh
    Sullivan) wrote:

    At one time I kept REAL genealogy lines in one program and >research/unconnected lines in the other. I'll export the latter to
    ged allowing me to only use 1 program.

    Yes, I keep my "reasl" genealogy in Legacy, and research/unconnected
    lines in RootsMagic.

    I'll continue to do that, and when I move "real" data to Legacy I
    will transfer the details via FamilySearch.

    ---
    Ignore the following - it's spambot fodder.

    djbasharat50@gmail.com
    searchdatingsingle@gmail.com

    --- SoupGate-Win32 v1.05
    * Origin: fsxNet Usenet Gateway (21:1/5)