Topic: Vedit.vdf - dictionary of Vedit macro commands (1 of 13), Read 66 times, 1 File Attachment
Conf: VEDIT User Applications
From: Scott Lambert
Date: Saturday, July 10, 2010 02:34 PM

Vedit.vdf is a dictionary of Vedit macro commands for Scribe.

I created it by using Scribe's Learn New Words feature on the vedit.syn file and then adding newer commands from the veditlog.txt file I got off Vedit.com (whole process took less then 10 minutes).

I am unsure why anyone would need this VDF, but I offer it, because someone might come up with a brilliant use for it.

The only reason, I can come up with is if you want to spell check the comments in your vdm file, but that seems like using a chainsaw when a butter knife is called for.

If any of you find an use for it, please let me know.

Scott

 
VEDIT.VDF (6KB)

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (2 of 13), Read 60 times
Conf: VEDIT User Applications
From: Peter Rejto
Date: Tuesday, July 13, 2010 01:24 PM

Thanks Scott,

I would be interested in a related question:

To formulate this related question, let me go back to Ian's comment on Firefox. My understanding is that Firefox is using Hunspell. So, my specific question is,

Can I use a Hunspell dictionary with Scribe ?


-peter

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (3 of 13), Read 68 times
Conf: VEDIT User Applications
From: Scott Lambert
Date: Tuesday, July 13, 2010 02:02 PM


Hi Peter,

To use a dictionary with Scribe, the dictionary must be a plain text file with one word per line.

If it is a plain text file, but the words are not on one line, you can use the Learn New Words feature of Scribe to generate a new vdf file. In your case, you would want a language where the user vdf, and the main vdf exist but are empty when you run the learn feature of Scribe.

What is it about the english.vdf file that makes you seek something else?

Scott


On 7/13/2010 1:24:27 PM, peter rejto wrote:
>Thanks Scott,
>
>I would be interested in a
>related question:
>
>To formulate this related
>question, let me go back to
>Ian's comment on Firefox. My
>understanding is that Firefox
>is using Hunspell. So, my
>specific question is,
>
>Can I use a Hunspell
>dictionary with Scribe ?
>
>
>-peter
>
>

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (4 of 13), Read 50 times
Conf: VEDIT User Applications
From: Peter Rejto
Date: Wednesday, July 14, 2010 11:46 AM

On 7/13/2010 2:02:15 PM, Scott Lambert wrote:
>
>Hi Peter,

>What is it about the
>english.vdf file that makes
>you seek something else?
>
>Scott


Scott,


I would like to give you a progress report on your dictionary versus the Hunspell dictionary. That is to say, a progress report on english.vdf versus en-US.dic


In short they are pretty comparable. The precise comparison
of these two ASCII files is beyond my programming capabilities.


Since Firefox and Thunderbird already have en-US.dic in their Dictionaries sub directory, I do prefer en-US.dic to english.vdf


I have also downloaded en-US.dic from

wiki.services.openoffice.org/wiki/Dictionaries

There the readme.txt gives credit to Kevin Atkinson. Specifically, to one of the versions of his aspell programm, I no longer remember which one. The time stamp is 2006, so it is more recent than the time stamp of your english.vdf, which is 1999.

I have a hunch that there are only very few public domain dictionaries around. So, I also have a hunch that your english.vdf also goes back to Kevin Atkinson.



-peter

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (5 of 13), Read 54 times, 1 File Attachment
Conf: VEDIT User Applications
From: Scott Lambert
Date: Wednesday, July 14, 2010 07:01 PM


Hi Peter,

Thank you for this message, as a writer of a spell checker, I have great interest in this subject.

I take it that you want a dictionary that will flag non-US spellings of words (colour vs color, honor vs honour, etc). I see that the english.vdf file has both spellings.

I regret I cannot remember where I got the list of words that make up english.vdf.

Have you considered making your own word list to serve in place of english.vdf. Experts say that the average person only has about 10 to 15 thousand words in his/her vocabulary, so why does one need a spell checker with 100,000+ words. I mean if you never use the word zeugma, your spell checker does not need to know about it either.

It is not as time consuming as one might think. If you look at what we write, most of it consists of words such as:

the pronouns

various tenses of the verbs to be, to do, to have, to go

the question words: what, where, how, etc

words like: to, from, in, out, on, off, if, yes, no, of, with, etc

names of days of week, months, numbers, seasons, colors, etc

Take a look at the attached custom.vdf, it contains most of the these words.

So you start with custom.vdf as your main vdf file, then do the learn new words process on about 10 documents written by you, and you have a totally custom spell checker of "peter-ese" with no word you don't use or want.

I think that is one of the top advantages of Scribe, you can have totally custom dictionaries, you are not locked into anyone else's word list.

Scott


On 7/14/2010 11:46:09 AM, peter rejto wrote:
>I would like to give you a progress
>report on your dictionary versus the
>Hunspell dictionary. That is to say, a
>progress report on english.vdf versus
>en-US.dic
>
>
>In short they are pretty comparable. The
>precise comparison
>of these two ASCII files is beyond my
>programming capabilities.
>
>
>Since Firefox and Thunderbird already
>have en-US.dic in their Dictionaries sub
>directory, I do prefer en-US.dic to
>english.vdf
>
>
>I have also downloaded en-US.dic from
>
>wiki.services.openoffice.org/wiki/Dictio
>naries
>
>There the readme.txt gives credit to
>Kevin Atkinson. Specifically, to one of
>the versions of his aspell programm, I
>no longer remember which one. The time
>stamp is 2006, so it is more recent than
>the time stamp of your english.vdf,
>which is 1999.
>
>I have a hunch that there are only very
>few public domain dictionaries around.
>So, I also have a hunch that your
>english.vdf also goes back to Kevin
>Atkinson.
>
>
>
>-peter

 
CUSTOM.VDF (3KB)

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (6 of 13), Read 59 times
Conf: VEDIT User Applications
From: Ian Binnie
Date: Wednesday, July 14, 2010 09:06 PM

On 7/14/2010 7:01:32 PM, Scott Lambert wrote:
>
>Hi Peter,
>
>Thank you for this message, as
>a writer of a spell checker, I
>have great interest in this
>subject.
>
>I take it that you want a
>dictionary that will flag
>non-US spellings of words
>(colour vs color, honor vs
>honour, etc). I see that the
>english.vdf file has both
>spellings.

Just for the record, Firefox and Thunderbird have many dictionaries. I use en-GB.dic

This has color, but only as part of colorimeter etc. and flags color as an error - which suits me fine.

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (7 of 13), Read 57 times
Conf: VEDIT User Applications
From: Peter Rejto
Date: Wednesday, July 14, 2010 09:53 PM

On 7/14/2010 9:06:04 PM, Ian Binnie wrote:
>On 7/14/2010 7:01:32 PM, Scott Lambert
>wrote:
>Just for the record, Firefox and
>Thunderbird have many dictionaries. I
>use en-GB.dic

Thanks Ian,

I would like to download en_GB.dic. I visited the mozilla add ins website and there I could not find the option of downloading the dictionary alone.

So, I would appreciate getting the exact url.

Also, I tried to use Firefox to spell check an ASCI file that I have opened in Vedit. I did not succeed. Am I missing something ?


-peter

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (8 of 13), Read 48 times
Conf: VEDIT User Applications
From: Peter Rejto
Date: Wednesday, July 14, 2010 10:09 PM

On 7/14/2010 7:01:32 PM, Scott Lambert wrote:
>
>Hi Peter,
>
>Thank you for this message, as
>a writer of a spell checker, I
>have great interest in this
>subject.
>
>I take it that you want a
>dictionary that will flag
>non-US spellings of words
>(colour vs color, honor vs
>honour, etc). I see that the
>english.vdf file has both
>spellings.
>
>I regret I cannot remember
>where I got the list of words
>that make up english.vdf.
>
>Have you considered making
>your own word list to serve in
>place of english.vdf. Experts
>say that the average person
>only has about 10 to 15
>thousand words in his/her
>vocabulary, so why does one
>need a spell checker with
>100,000+ words. I mean if you
>never use the word zeugma,
>your spell checker does not
>need to know about it either.
>
>It is not as time consuming as
>one might think. If you look
>at what we write, most of it
>consists of words such as:
>
>the pronouns
>
>various tenses of the verbs to
>be, to do, to have, to go
>
>the question words: what,
>where, how, etc
>
>words like: to, from, in, out,
>on, off, if, yes, no, of,
>with, etc
>
>names of days of week, months,
>numbers, seasons, colors, etc
>
>Take a look at the attached
>custom.vdf, it contains most
>of the these words.
>
>So you start with custom.vdf
>as your main vdf file, then do
>the learn new words process on
>about 10 documents written by
>you, and you have a totally
>custom spell checker of
>"peter-ese" with no word you
>don't use or want.
>
>I think that is one of the top
>advantages of Scribe, you can
>have totally custom
>dictionaries, you are not
>locked into anyone else's word
>list.
>
>Scott
>

Hi Scott,

No, I have not considered making my own word list to
serve in place of english.vdf In fact, since english.vdf
and en-US.dic/en-GB.dic are comparable I am glad to leave the task of upgrading dictionaries to Firefox.

In fact, I have a question: Would it possible to rename the .vdf extension to .dic.

-peter

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (9 of 13), Read 36 times
Conf: VEDIT User Applications
From: Scott Lambert
Date: Thursday, July 15, 2010 05:03 PM

On 7/14/2010 10:09:22 PM, peter rejto wrote:
>In fact, I have a question: Would it
>possible to rename the .vdf extension to
>.dic.

Hi Peter,

There is nothing in the code to prevent you using .dic (or anything else) as the extension for your dictionaries. Just make sure the dictionary filenames in the scribe.ini file match what is in the scribe folder.

I just thought .vdf sounded better then .dic

Scott

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (11 of 13), Read 30 times
Conf: VEDIT User Applications
From: Pauli Lindgren
Date: Friday, July 16, 2010 10:41 AM

On 7/14/2010 10:09:22 PM, peter rejto wrote:
>
>In fact, I have a question: Would it
>possible to rename the .vdf extension to
>.dic.

It is not good idea to change the filename extension.
The extension reflects the file type. The extension .vdf indicates that the file is scribe dictionary. If you rename it to .dic, someone may think it is Hunspell dictionary.

--
Pauli

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (12 of 13), Read 27 times
Conf: VEDIT User Applications
From: Peter Rejto
Date: Saturday, July 17, 2010 12:57 PM

On 7/16/2010 10:41:16 AM, Pauli Lindgren wrote:


>It is not good idea to change the
>filename extension.
>The extension reflects the file type.
>The extension .vdf indicates that the
>file is scribe dictionary. If you rename
>it to .dic, someone may think it is
>Hunspell dictionary.

>Pauli



Thanks Pauli,

Sorry to have missed that point! In fact, I do need all the help that I can get to try to keep the various dictionary file apart!


So, I am asking Scott to add an "identifying" label to his .vdf files.

I would also appreciate it, if I could add my own labels.
Specifically, I would appreciate it if scribe would honor the usual Vedit comment sign, which I believe is //.


-peter

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (10 of 13), Read 31 times
Conf: VEDIT User Applications
From: Pauli Lindgren
Date: Friday, July 16, 2010 10:36 AM

On 7/14/2010 11:46:09 AM, peter rejto wrote:
>
>Since Firefox and Thunderbird already
>have en-US.dic in their Dictionaries sub
>directory, I do prefer en-US.dic to
>english.vdf

I don't think you can use the .dic files directly with scribe. Even though the .dic file does list the words in plain ASCII, one word on each line, it has some additional information after each word. You could remove the additional information with Vedit (remove anything from '/' to end of line).

>
>I have a hunch that there are only very
>few public domain dictionaries around.
>So, I also have a hunch that your
>english.vdf also goes back to Kevin
>Atkinson.

There are many dictionaries for Aspell available at
http://aspell.net/

Of course some of them may have same origin as the dictionaries used by Hunspell.

Aspell dictionaries list each word in ASCII, separated by NULL. You could easily replace NULLs with Newlines. In addition, there is some binary data at the beginning and lots of binary data at the end of the file, so you would need to remove those. Then sort the dictionary to alphabetic order, and you have Scribe compatible dictionary.

--
Pauli

 


Topic: Vedit.vdf - dictionary of Vedit macro commands (13 of 13), Read 18 times
Conf: VEDIT User Applications
From: Peter Rejto
Date: Sunday, July 18, 2010 04:40 PM

On 7/16/2010 10:36:43 AM, Pauli Lindgren wrote:
>On 7/14/2010 11:46:09 AM, peter rejto
>wrote:
>>
>>Since Firefox and Thunderbird already
>>have en-US.dic in their Dictionaries sub
>>directory, I do prefer en-US.dic to
>>english.vdf
>
>I don't think you can use the .dic files
>directly with scribe. Even though the
>.dic file does list the words in plain
>ASCII, one word on each line, it has
>some additional information after each
>word. You could remove the additional
>information with Vedit (remove anything
>from '/' to end of line).
>
...
>Pauli


Pauli,


Here I stand corrected. I just did the following experiment confirming your comment:

I edited scribe.ini as per Scott's instructions so that en_US.dic was my main dictionary. Then the word " the "
was a mispelled word. In other words, scribe did not find the word " the " in the main dictionary. I checked it and this statement was correct, inasmuch as en_US.dic has only " the/JG "

So, a big thank you for your suggestion saying that

"You could remove the additional
>information with Vedit (remove anything
>from '/' to end of line)"


I have also downloaded the Hunspell manuals from sourceforge.net . They confirmed your comment that in
the en-US.dic file some words have
'>some additional information' after the '/' character.
I certainly had the feeling that this additional information is relevant from the linguistic point of view.
Although, I did not have time to study the specifics of this additional information.

I do hope that the issue of ignoring characters
" from '/' to end of line' in a dictionary file, is similar to the issue of ignoring TeX commands in a file to be checked. {So, a big thank you to Fritz for writing the Tex filter for scribe.)


In other words, I do hope that Scott will give me the option of ignoring things after the '/' character in a dictionary file !


Thanks to all who made this possible.

-peter.