Top Banner
US 20030145285A1 (12) Patent Application Publication (10) Pub. No.: US 2003/0145285 A1 (19) United States Miyahira et al. (43) Pub. Date: Jul. 31, 2003 (54) METHOD OF DISPLAYING CORRECT (30) Foreign Application Priority Data WORD CANDIDATES, SPELL CHECKING METHOD, COMPUTER APPARATUS, AND Jan. 29, 2002 (JP) .................................... .. 2002-020789 PROGRAM Publication Classi?cation 7 (75> whimMayahirwmawishim; E23 313%. 1111111111111111111111111111111111111111111111111111.???22732 Marl Kurokl, Sagamihara-shi (JP) (57) ABSTRACT Correspondence Address? Acompound Word dictionary is provided in a dictionary data IBM CORPORATION storing section. If a Word sequence composed of a candidate 3039 CORNWALLIS RD- for a correct Word relative to a Word subjected to spell DEPT. T81 / B503, PO BOX 12195 checking and a Word/Words before or after the Word sub REASEARCH TRIANGLE PARK, NC 27709 jected to spell checking or on each side of the Word (US) subjected to spell checking is registered as a compound Word in the compound Word dictionary, the correct Word (73) Assignee: International Business Machines Cor- candidate forming the compound Word is placed in a pref poration, Armonk, NY (US) (US) erential position on a list, While, if a subject area of the content of a sentence can be speci?ed based on the original (21) Appl. No.: 10/351,022 teXt before and after a Word subjected to spell checking, a correct Word candidate that belongs to such a subject area is (22) Filed: Jan. 24, 2003 placed in a preferential position on the list. IN 10\ 20 [so TRANSLATION INPUT ISPELL-CHECK OUTPUT SECTION PROCESSING sECTION sECTION [-40 SUBJECT AREA JUDGING SECTION [50 DICTIONARY DATA STORING SECTION BASE _ USER DICTIONARY DOMAIN DICTIONARY W 0RD DICTIONARY WORD DICT5IONARY WORD DICTISCZNARY J DICTIONARY —’ COMPOUND 53a COMPOUND DlCvilIgRiRY COMPOUND DI ViIORD RY 5,: WORD 222A DICTIONARY —— 53b 51/ \52 53
17

(75> whimMayahirwmawishim; E23 313 ...

Feb 14, 2017

Download

Documents

vodien
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: (75> whimMayahirwmawishim; E23 313 ...

US 20030145285A1

(12) Patent Application Publication (10) Pub. No.: US 2003/0145285 A1 (19) United States

Miyahira et al. (43) Pub. Date: Jul. 31, 2003

(54) METHOD OF DISPLAYING CORRECT (30) Foreign Application Priority Data WORD CANDIDATES, SPELL CHECKING METHOD, COMPUTER APPARATUS, AND Jan. 29, 2002 (JP) .................................... .. 2002-020789 PROGRAM

Publication Classi?cation

7

(75> whimMayahirwmawishim; E23 313%. 1111111111111111111111111111111111111111111111111111.???22732 Marl Kurokl, Sagamihara-shi (JP)

(57) ABSTRACT Correspondence Address? Acompound Word dictionary is provided in a dictionary data IBM CORPORATION storing section. If a Word sequence composed of a candidate 3039 CORNWALLIS RD- for a correct Word relative to a Word subjected to spell DEPT. T81 / B503, PO BOX 12195 checking and a Word/Words before or after the Word sub REASEARCH TRIANGLE PARK, NC 27709 jected to spell checking or on each side of the Word (US) subjected to spell checking is registered as a compound

Word in the compound Word dictionary, the correct Word (73) Assignee: International Business Machines Cor- candidate forming the compound Word is placed in a pref

poration, Armonk, NY (US) (US) erential position on a list, While, if a subject area of the content of a sentence can be speci?ed based on the original

(21) Appl. No.: 10/351,022 teXt before and after a Word subjected to spell checking, a correct Word candidate that belongs to such a subject area is

(22) Filed: Jan. 24, 2003 placed in a preferential position on the list.

IN

10\ 20 [so TRANSLATION

INPUT ISPELL-CHECK OUTPUT SECTION PROCESSING sECTION

sECTION

[-40 SUBJECT AREA

JUDGING SECTION

[50 DICTIONARY DATA STORING SECTION

BASE _ USER

DICTIONARY DOMAIN DICTIONARY

W 0RD DICTIONARY WORD

DICT5IONARY WORD DICTISCZNARY J DICTIONARY —’

COMPOUND 53a COMPOUND

DlCvilIgRiRY COMPOUND DI ViIORD RY 5,: WORD 222A — DICTIONARY ——

53b 51/ — \52

53

Page 2: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication Jul. 31, 2003 Sheet 1 0f 10 US 2003/0145285 A1

FIG. 1 1 /

2

CONTROL SECTION

5

DISPLAY CONTROL 3

BLOCK DISPLAY UNIT

POINTER CONTROL 4 BLOCK POINTING

DEVICE 7

TRANSLATION PROCESSING

BLOCK

Page 3: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication US 2003/0145285 A1

53

Jul. 31, 2003 Sheet 2 0f 10

10\ / 2o [30 TRANSLATION

INPUT ISPELL-CHECK _ OUTPUT

SECTION PROCESSING SECTION SECTION

r 40

SUBJECT AREA JUDGING SECTION

II J50

DICTIONARY DATA STORING SECTION

BASE USER DlCTlONARY DOMAIN DICTIONARY

W 0RD DICTIONARY WORD

DICTSIONARY WORD DlCTlsgNARY ——§ DICTIONARY J

COMPOUND 53a COMPOUND

WORD COMPOUND WORD DICTIONARY DICTIONARY

51b WORD 52b — DICTIONARY —

53b 51/ Q T‘ \52

Page 4: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication Jul. 31, 2003 Sheet 3 0f 10 US 2003/0145285 A1

I 5101 INPUT ORIGINAL /

TEXT DATA

I S102 SPELL-CHECK _/ PROCESSING

5103 NO

YES $104 EXTRACT CORRECT WORD / , CANDIDATE / CANDIDATES

S105 ONE

CORRECT WORD CANDIDATE

REGISTERED AS COMPOUND

WORD 12

YES

S107 SUBJECT

AREA SPECIFIED ‘.7

YES 5

NO

SORT CORRECT WORD j'oa CANDIDATES

I $109 OUTPUT CORRECT WORD /

CANDIDATE LIST

@E)

Page 5: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication Jul. 31, 2003 Sheet 4 0f 10 US 2003/0145285 A1

FIG. 4 IN CASE OF "Jave" ALONE

FIG. 4 A Jade [VERB] — L1 Jake [ADJECTIVE] ——-——- / Jane [NOUN] ————

Jape [VERB] Jive [VERB] Jove [NOUN] Javy [NOUN] Jace [NOUN] Jayne [NOUN] Jaye [NOUN] Java [NOUN]

IN CASE OF "Jave IS A PROGRAMMING LANGUAGE."

FIG. 413 Java [NOUN] ————— L2 Jade [VERB] —— / \

Jake [ADJECTIVE] Jane [NOUN] Jape [VERB] Jive [VERB] Jove [NOUN] Javy [NOUN] Jace [NOUN] Jayne [NOUN] Jaye [NOUN]

IN CASE OF "Jave LANGUAGE IS POPULAR."

FIG. 4C Java [NOUN] ———— L3 Jade [VERB] — / Jake [ADJECTIVE] Jane [NOUN] Jape [VERB] Jive [VERB] Jove [NOU N] Javy [NOUN] Jace [NOUN] Jayne [NOUN] Jaye [NOU N]

Page 6: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication Jul. 31, 2003 Sheet 5 0f 10 US 2003/0145285 A1

FIG. 5

IN CASE OF "Chiplet" ALONE

FIG 5A Chaplet [NOUN] ———— L4 ' Chipset [NOUN] ————— /

ChiCIe [NOUN] ————

Chilled [ADJECTIVE] Chiller [NOUN] Chipper [ADJECTIVE] Chile [NOUN]

IN CASE OF "Chiplet on the mother board"

FIG. 5B Chipset [NOUN] ————— L5 Chaplet [NOUN] ——-— / Chicle [NOUN] ——

Chilled [ADJECTIVE] Chiller [NOUN] Chipper [ADJECTIVE] Chile [NOUN]

Page 7: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication Jul. 31, 2003 Sheet 6 0f 10 US 2003/0145285 A1

FIG. 6

IN CASE OF "brwser" ALONE

FIG_ 6A bowser [NOUN] ____ )6 browser [NOUN] bower [NOUN] bowler [NOUN] brawler [NOUN] brewer [NOUN] browse [NOUN]

IN CASE OF "web brwser"

browser [NOUN] —-———— L7

' bowser [NOUN] ———— / bower [NOUN] —————

bowler [NOUN] brawler [NOUN] brewer [NOUN] browse [NOUN]

Page 8: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication

FIG. 7A

FIG. 7B

Jul. 31, 2003 Sheet 7 0f 10 US 2003/0145285 A1

FIG. 7

Name ADJECTIVE "8 Node NOUN / Nope ADVERB Nose VERB Note VERB None ADVERB Nomo NOUN

Nomo NOUN L9

Name ADJECTIVE / Node NOUN Nope ADVERB Nose VERB Note VERB None ADVERB

Page 9: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication Jul. 31, 2003 Sheet 8 0f 10 US 2003/0145285 A1

f 3(1) NO TITLE - NOTEPAD

File Edit Format Help

@struck out Sammy Sosa.

MOUSE DICTIONARY AND TRANSLATION Q?l?

D'CT‘Lg'QARY KENKYUSHA's E WORD [2| .: TRANSLATION DICTIONARY BOOK ‘- E

—I <1:

Nome g 5 * The word was not found in the dictionary.* % 8 %

Nomo [NOUN] U Name [ADJECTIVE] 2 Node [NOUN] g Nope [ADVERB] I g Nose [VERB] 5 Note [VERB] 1 None [ADVERB] L Noma [NOUN] \ v

Page 10: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication Jul. 31, 2003 Sheet 9 0f 10 US 2003/0145285 A1

FIG. 9

/- 3(1) NO TITLE - NOTEPAD

File Edit Format Help

HideoINWgéEIs a pitcher.

MOUSE DICTIONARY AND TRANSLATION

I I KENKYUSHA's WORD DICTIONARY I33 BOOK

@

DICTIONARY ::: FOR

' TRANSLATION _|_J

Nome a Z

* The word was not found in the dictionary.* 8

Nomo [NOU N] Name [ADJECTIVE] ’ Node [N 0U N] Nope [ADVERB] Nose [VERB] Note [VERB] ——————

None [ADVERB] -_— L

Noma [NOU N] \ —

IE1

II P DICTIONARY ERANSLATION El

Page 11: (75> whimMayahirwmawishim; E23 313 ...

Patent Application Publication Jul. 31, 2003 Sheet 10 0f 10 US 2003/0145285 A1

FIG. 10

' ' I interesting! ' '

P

k f w MOUSE DICTIONARY AND TRANSLATION

DICTIONARY FOR

TRANSLATION

interesting

interesting [ADJ ECTIVE]

interest

[VERB]

Page 12: (75> whimMayahirwmawishim; E23 313 ...

US 2003/0145285 A1

METHOD OF DISPLAYING CORRECT WORD CANDIDATES, SPELL CHECKING METHOD, COMPUTER APPARATUS, AND PROGRAM

BACKGROUND OF THE INVENTION

[0001] The present invention relates to a method of dis playing candidates for a correct Word, suitable for use in a program or the like With a dictionary function.

[0002] Translation programs have been used in personal computers and so forth. Among the translation programs, that have been available such a program that has, in addition to a function of mechanically translating a sentence dis played on a monitor, a function of consulting a dictionary for the meaning of a Word in the sentence and displaying it.

[0003] Hitherto, upon executing such a dictionary consult ing function, a Word, of Which a user Wishes to knoW the meaning, in a document has been designated by performing a drag operation of a mouse, or the like. In recent years, hoWever, there has also been offered such a program that has a function, Wherein as shoWn in FIG. 10, only by placing a mouse pointer P in the neighborhood of a Word the meaning of Which is Wished to be knoWn, i.e. Without performing the particular operation for designating the Word as noted above, the program side automatically identi?es the Word near the mouse pointer P based on spaces or the like in the document. In the example of FIG. 10, the Word positioned near the mouse pointer P is identi?ed (highlighted), and the result of consulting a dictionary for the meaning of the Word is displayed in a WindoW W.

[0004] On the other hand, upon execution of the dictionary consulting function to display the meaning of a Word as described above, if spelling of the Word is Wrong, the correct meaning cannot be displayed. In vieW of this, there has also been such a program that has a function of displaying a list of candidates for a correct Word if the spelling appears to be Wrong.

[0005] There are, for example, the folloWing factors that cause misspelling:

[0006] “insertion or addition” of a Wrong character like

[0007] WhereQWheWre

[0008] the—>thhe

[0009] “omission” of a character like

[0010] SOfIWaI‘6—>SOfIWaI‘

[0011] con?dential—>co?dential

[0012] “substitution” like

[0013] private—>privite

[0014] theQtha

[0015] “transposition” in order of characters like

[0016] foreigner—>foerigner

[0017] the—>teh

[0018] The foregoing candidate list displaying function searches for Words that can be created by combination of those factors and displays them as candidates on a list.

Jul. 31, 2003

[0019] In the foregoing conventional technique, hoWever, there exists the folloWing problem, speci?cally, When dis playing a list of a plurality of candidates for a correct Word relative to a Word that seems misspelled, it is generally considered to display the correct Word candidates in alpha betical order or the like. HoWever, in particular, if there are many Words as candidates, there are those instances Where the correct Word is displayed in a loWer position on the list. In this event, a user has to scroll the list for ?nding the correct Word, Which Would be bothersome.

[0020] As means for solving such a problem, there has been proposed a technique of analyZing the context to identify a part of speech of a Word that appears misspelled, and displaying a Word corresponding to the identi?ed part of speech in a preferential position on a list.

[0021] Even using such a technique, hoWever, it does not necessarily folloW that the best Word is displayed in the uppermost position on the list, and accordingly, there still remains room for further improvement in this regard.

SUMMARY OF THE INVENTION

[0022] The present invention has been made in vieW of the foregoing technical problem and has an object to provide a method of displaying candidates for a correct Word, Which can display the correct Word With higher accuracy, and so forth.

[0023] For accomplishing the foregoing object, a method of displaying candidates for a correct Word according to the present invention is characteriZed in that When there is misspelling in an original text input into a computer appa ratus, if a candidate for a correct Word relative to a mis spelled Word and one or more Words before or after said misspelled Word or on each side of said misspelled Word in the original text are included in a Word sequence registered in advance, the computer apparatus displays the correct Word candidate included in the Word sequence in a prefer ential position. Here, “one or more Words before or after said misspelled Word or on each side of said misspelled Word” represents one or more Words before or after the misspelled Word, or one or more Words on each side of the misspelled Word. Further, “a Word sequence registered in advance” includes a so-called compound Word, idiom, proper noun and so forth and, in this speci?cation, these are each suitably referred to as a Word sequence or a compound Word. Further, “in a preferential position” represents that priority of a corresponding candidate in terms of display order on a list or the like is increased.

[0024] It may be further arranged that it is judged Whether or not a subject area of content of the original text including the misspelled Word can be speci?ed based on such an original text and, When the subject area is speci?ed such as sports or art, a correct Word candidate belonging to the speci?ed subject area is displayed in a preferential position. Preferably, the foregoing processing is executed When per forming spell-check processing of the original text inputted into the computer apparatus.

[0025] The present invention can also be understood as a spell checking method comprising a step of checking Whether or not there is misspelling in a Word forming an original text input into a computer apparatus, and a step of displaying Words as candidates for a correct Word relative to

Page 13: (75> whimMayahirwmawishim; E23 313 ...

US 2003/0145285 Al

the misspelled Word. In this case, at the displaying step, (a) if a subject area of content of the original text can be speci?ed based on the original text before and after the misspelled Word, and a correct Word candidate is included in the speci?ed subject area, or (b) if a Word sequence regis tered in advance is formed by one or more Words before or after said misspelled Word or on each side of said misspelled Word in the original text and a correct Word candidate, a list of Words as correct Word candidates is displayed such that a Word satisfying (a) or (b) is placed in a preferential position on the list. That is, the correct Word candidate that agrees With the speci?ed content subject area, or the correct Word candidate forming the Word sequence registered in advance is placed in a preferential position as compared With other correct Word candidates.

[0026] If the present invention is understood as a computer apparatus, the computer apparatus has a list output section that extracts candidates for a correct Word relative to a misspelled Word in an original text input from an input section, and outputs a list of the extracted correct Word candidates. In this case, the list output section is character iZed by determining priority of the correct Word candidates on the list based on a Word stored in a domain database included in a database.

[0027] When the database has a compound Word database storing compound Words each composed of a plurality of Words, the list output section can also be understood as a computer apparatus characteriZed by determining priority of the correct Word candidates on the list based on a compound Word stored in the compound Word database.

[0028] In the list output section, the correct Word candi dates on the list can be sorted such that the correct Word candidate With high priority is placed in a preferential position on the list as compared With other candidates. In addition, for distinguishing the correct Word candidate With high priority from other candidates, it may be displayed in a highlighted fashion.

[0029] The present invention can also be understood as a program for causing a computer apparatus to execute a step of specifying a misspelled Word based on an original text input into the computer apparatus, a step of extracting candidates for a correct Word relative to the misspelled Word, and a step of, if the extracted correct Word candidate and one or more Words before or after said misspelled Word or on each side of said misspelled Word in the original text form a compound Word registered in advance as dictionary data, outputting the correct Word candidates such that the correct Word candidate included in the compound Word is placed in a preferential position.

[0030] It may be arranged that the correct Word candidate outputting step refers to a compound Word database storing compound Word data, and judges Whether or not a com pound Word stored in the compound Word database is formed by one or more Words before or after said misspelled Word or on each side of said misspelled Word in the original text and the extracted correct Word candidate for the mis spelled Word.

[0031] Further, it is effective to add a function to the program, thereby to further make the computer apparatus execute processing of spell checking of a Word included in the original text. In addition, by combining this program

Jul. 31, 2003

With a machine translation program, it is also possible to further make the computer apparatus execute the process of translating the original text. With this arrangement, it is possible to provide a program having both the translation function and the spell-check function.

[0032] Further, this program can also make the computer apparatus execute the process of, When a subject area can be speci?ed based on the original text before and after the misspelled Word, outputting the correct Word candidates such that the correct Word candidate that belongs to the speci?ed subject area is placed in a preferential position.

BRIEF DESCRIPTION OF THE DRAWINGS

[0033] FIG. 1 is a diagram shoWing a schematic con?gu ration of a computer apparatus in a preferred embodiment of the present invention;

[0034] FIG. 2 is a diagram shoWing a con?guration of a translation processing block;

[0035] FIG. 3 is a diagram shoWing the processing ?oW upon executing spell-check processing;

[0036] FIG. 4 is a diagram shoWing display examples of correct Word candidate lists;

[0037] FIG. 5 is a diagram shoWing another display examples of correct Word candidate lists;

[0038] FIG. 6 is a diagram shoWing still another display examples of correct Word candidate lists;

[0039] FIG. 7 is a diagram shoWing still another display examples of correct Word candidate lists;

[0040] FIG. 8 is a diagram shoWing the display image Where the correct Word candidate list shoWn in FIG. 7(b) is displayed on a screen of the computer apparatus;

[0041] FIG. 9 is a diagram shoWing another example of a correct Word candidate list displayed on the screen of the computer apparatus; and

[0042] FIG. 10 is a diagram shoWing an example of displaying a result of dictionary consultation according to the prior art.

PREFERRED EMBODIMENT OF THE PRESENT INVENTION

[0043] HereinbeloW, the present invention Will be described in detail based on a preferred embodiment shoWn in the accompanying draWings.

[0044] FIG. 1 is a diagram for explaining a schematic con?guration of a computer apparatus in this embodiment. As shoWn in FIG. 1, the computer apparatus 1 comprises a control section 2 having a CPU, a main memory, an HDD and so forth, a display unit 3 using a CRT, an LCD panel or the like, and a pointing device 4 such as a mouse for operating a pointer in the form of an arroW or the like displayed on a display screen of the display unit 3, and further comprises a keyboard and so forth.

[0045] The control section 2 has functions such as the display control block 5 for displaying an image on the display screen of the display unit 3 based on a draWing command from the CPU, the pointer control block 6 for

Page 14: (75> whimMayahirwmawishim; E23 313 ...

US 2003/0145285 A1

displaying the pointer on the display screen, and the trans lation processing block 7 as described below.

[0046] The display control block 5 is realiZed by a video driver, a video chip and so forth (not shoWn), and makes the display unit 3 display information based on image data transferred from the main memory or the like.

[0047] The pointer control block 6 is realiZed by a user interface driver that processes an event When a user operates the pointing device 4 or the keyboard (not shoWn) and, in particular, executes processing of changing position coordi nates of the pointer displayed on the display screen of the display unit 3, based on an operation of the pointing device 4 by the user.

[0048] The translation processing block 7 is realiZed through execution of processing based on a program stored in the HDD or the like by the CPU cooperatively With the main memory and so forth. FIG. 2 shoWs a functional con?guration of the translation processing block 7, Which comprises an input section 10 for inputting the original text to be translated, a translation/spell-check processing section 20 for executing translation processing and spell-check processing of the inputted original text, an output section (list output section) 30 for outputting data that makes the display unit 3 display a translation as a result of the executed translation processing, a correct Word list as a result of the executed spell-check processing and so forth, and a dictio nary data storing section (database) 50 storing dictionary data that is used upon performing the translation processing or the spell-check processing.

[0049] The translation processing block 7 receives infor mation about position coordinates of the pointer on the display screen transferred from the pointer control block 6, and so forth and, When a drag operation (operation that moves the pointer While clicking, then releases the click) is performed, the input section 10 inputs a sentence of the original text Whose range is designated by the drag opera tion, and the translation/spell-check processing section 20 executes the translation processing of the input original text. Upon executing the translation processing, the translation/ spell-check processing section 20 carries out processing of morphological analysis, syntactic analysis, syntactic genera tion, morphological generation and so forth in sequence. Further, the translation/spell-check processing section 20 implements the translation processing by referring to the dictionary data storing section 50 storing grammar rules, Word data and so forth.

[0050] It is to be noted that inasmuch as an outline of the translation processing itself belongs to the general tech nique, detailed explanation thereof is omitted. Here, there is no particular intention to limit a translation algorithm or an output form of the translation result.

[0051] On the other hand, even if the range designation is not performed by the drag operation, When, for example, a predetermined operation such as right button click of the mouse is carried out, or the pointer is continuously placed in the same position for a predetermined time, the input section 10 of the translation processing block 7 identi?es and inputs a Word positioned in the neighborhood of the pointer at that time instant, and the translation/spell-check processing sec tion 20 refers to the dictionary data storing section 50 to execute processing of dictionary consultation for the input

Jul. 31, 2003

Word. In this event, the translation/spell-check processing section 20 can offer not only the best translation based on the context, but also information such as a part of speech of the Word based on the Word data stored in the dictionary data storing section 50.

[0052] Further, the translation/spell-check processing sec tion 20 performs spell checking of the original text input at the input section 10 and, if there is misspelling, displays a list of candidates for a correct Word.

[0053] In this embodiment, upon displaying the correct Word candidate list, if a compound Word registered in advance is formed by a Word sequence composed of a candidate for a correct Word relative to a Word subjected to spell checking and a Word/Words before or after the Word subjected to spell checking or on each side of the Word subjected to spell checking, the translation/spell-check pro cessing section 20 displays the candidate forming the com pound Word in a preferential position on the list. A subject area judging section 40 is further provided attendantly to the translation/spell-check processing section 20. If a subject area of the content of the original text can be speci?ed based on the original text before and after a Word subjected to spell checking, the subject area judging section 40 displays in a preferential position on the list a Word that belongs to such a subject area.

[0054] Because the translation/spell-check processing sec tion 20 executes the foregoing list display control, the dictionary data storing section 50 includes a base dictionary 51 and a user dictionary 52, and further includes domain dictionaries (domain databases) 53 having data for each of subject areas such as sports, computers, art, entertainments, politics and economics, science, and home etc. Further, the base dictionary 51 storing data of grammar rules and Words, the user dictionary 52 storing data of Words and so forth registered by a user, and the foregoing domain dictionaries 53 are provided With not only Word dictionaries 51a, 52a, 53a, but also compound Word dictionaries (Word sequence databases) 51b, 52b, 53b, respectively. [0055] In the computer apparatus 1 having the foregoing con?guration, the translation/spell-check processing section 20 refers to the dictionary data storing section 50 based on the original text (sentences and Words) input at the input section 10, and performs the translation processing to pro duce a translation, Which is output from the output section 30. When the translation/spell-check processing section 20 executes the translation processing, if the subject area judg ing section 40 can specify a subject area from Words and compound Words composing the original text, the translation processing With high accuracy can be carried out ef?ciently by preferentially referring to the domain dictionary 53 corresponding to the speci?ed subject area. As a technique implemented by the subject area judging section 40 for judging a subject area, there has been available a method, for example, as described in JP-A-2001-101185.

[0056] FIG. 3 shoWs the processing ?oW upon executing the spell-check processing in the computer apparatus 1 having the foregoing con?guration. As shoWn in FIG. 3, upon executing the spell-check processing, a Word to be spell checked and the original text before and after such a Word are input from the input section 10 (step S101). Then, the translation/spell-check processing section 20 judges Whether there is a Word to be spell checked (eg a Word that

Page 15: (75> whimMayahirwmawishim; E23 313 ...

US 2003/0145285 A1

is not registered in the dictionaries in advance) (step S102). If it is judged that there is no misspelling, then the process ing is ?nished (step S103). On the other hand, if it is judged that there is misspelling, then the translation/spell-check processing section 20 refers to the dictionary data storing section 50 to extract a candidate/candidates for a correct Word (step S104). For this extraction of the correct Word candidate/candidates, the same algorithm as that of the knoWn spell-check processing can be used.

[0057] After extracting the correct Word candidate/candi dates, it is judged Whether there is only one correct Word candidate (step S105). If there is only one correct Word candidate, inasmuch as sorting is not necessary upon dis playing the correct Word candidate on a list, the processing jumps to step S109 Where a correct Word candidate list is output from the output section 30.

[0058] On the other hand, if there are tWo or more correct Word candidates, then the processing for sorting the correct Word candidates upon displaying them on a list is executed. For accomplishing this, the translation/spell-check process ing section 20 ?rst refers to the compound Word dictionaries 51b, 52b, 53b of the base dictionary 51, the user dictionary 52 and the domain dictionaries 53, thereby to con?rm Whether a Word sequence composed of a correct Word candidate for the Word subjected to spell checking and one or more Words immediately before or after such a Word or on each side of such a Word is registered as a compound Word (step S106). If the Word sequence is registered as a compound Word, sorting of the list is executed such that the correct Word candidate included in that compound Word is placed in a preferential position on the list of the correct Word candidates (step S108), and this sorted list of the correct Word candidates is output from the output section 30 (step S109). [0059] On the other hand, if the Word sequence is not registered as a compound Word at step S106, then the subject area judging section 40 attendant to the translation/spell check processing section 20 refers to the domain dictionaries 53 to con?rm Whether there is anything registered that can specify a subject area of the content of the original text, based on Words and Word sequences included in the original text before and after the Word subjected to spell checking (step S107). If a Word or a Word sequence that can specify a subject area is registered, sorting of the list is executed such that the Word included in the domain dictionary 53 corresponding to that subject area is placed in a preferential position on the list of the correct Word candidates (step S108), and then this sorted list of the correct Word candi dates is output from the output section 30 (step S109).

[0060] The order of executing the search for the com pound Word at step S106 and the search for specifying the subject area at step S107 is not limited to the foregoing order, but may be reversed. It may also be arranged that both searches are alWays executed and, depending on the result of judgment, correct Word candidates are sorted in predeter mined preferential order.

[0061] FIGS. 4 to 9 shoW examples of differences in content of correct Word candidate lists produced according to the foregoing processing.

[0062] FIG. 4(a) shoWs an example of a list L1 that is produced When a Word “Jave” is judged to be misspelled,

Jul. 31, 2003

and a compound Word or a subject area is not speci?ed (i.e. like prior art). On the list L1, correct Word candidates are displayed in order of ?nding of the candidates by a spell check logic.

[0063] On the other hand, in case the original text is a sentence as “Jave is a programming language.”, if the Word “Jave” is judged to be misspelled, a subject area is speci?ed as “computer-related” based on a Word sequence “program ming language”. Accordingly, as a list L2 shoWn in FIG. 4(b), the correct Word candidates are sorted for display such that a Word “Java” stored in the computer-related domain dictionary 53 is placed in a preferential position on the list, ie in the uppermost position on the list in the ?gure.

[0064] In a different situation, in case of a sentence as “Jave language is popular.”, if a Word sequence “Java language” is registered as a compound Word, as a list I3 shoWn in FIG. 4(c), the correct Word candidates are sorted for display such that the Word “Java” is placed in a prefer ential position on the list, ie in the uppermost position on the list in the ?gure.

[0065] FIG. 5(a) shoWs an example of a list L4 that is produced When a Word “chiplet” is judged to be misspelled, and a compound Word or a subject area is not speci?ed. On the list L4, correct Word candidates are displayed in order of ?nding of the candidates by a spell-check logic. On the other hand, if the Word “chiplet” is judged to be misspelled in a phrase as “chiplet on the mother board”, a subject area is speci?ed as “computer-related” based on a Word sequence “mother board”. Accordingly, as a list L5 shoWn in FIG. 5(b), the correct Word candidates are sorted for display such that a Word “chipset” is placed in a preferential position on the list, ie in the uppermost position on the list in the ?gure.

[0066] FIG. 6(a) shoWs an example of a list L6 that is produced When a Word “brWser” is judged to be misspelled, and a compound Word or a subject area is not speci?ed. On the list L6, correct Word candidates are displayed in order of ?nding of the candidates by a spell-check logic. On the other hand, in case the Word “brWser” is judged to be misspelled in a Word sequence as “Web brWser”, if a Word sequence “Web broWser” is registered as a compound Word, as a list L7 shoWn in FIG. 6(b), the correct Word candidates are sorted for display such that a Word “broWser” is placed in a preferential position on the list, ie in the uppermost position on the list in the ?gure.

[0067] FIGS. 7 and 8 shoW display examples of lists produced When a Word “Nome” is judged to be misspelled in a sentence as “Nome struck out Sammy Sosa.”. If correct Word candidates are displayed according to the conventional logic or algorithm, it Will be a list L8 as shoWn in FIG. 7(a). On the other hand, if a Word sequence “Sammy Sosa” is registered in the domain dictionary 53, it can be assumed that the sentence concerns baseball (sports), and the Word “Nome” represents a baseball player “Nomo”. FIG. 7(b) shoWs an example of a list L9.that is displayed in such an event. On the list L9, the correct Word candidates are sorted for display such that a Word “Nomo” is placed in a prefer ential position on the list, ie in the uppermost position on the list in the ?gure.

[0068] FIG. 8 shoWs a display example on the display screen of the display unit 3 of the computer apparatus 1 in that event.

Page 16: (75> whimMayahirwmawishim; E23 313 ...

US 2003/0145285 A1

[0069] FIG. 9 shows a display example of a list produced When a Word “Nome” is judged to be misspelled in a sentence as “Hideo Nome is a pitcher.”. If correct Word candidates are displayed according to the conventional logic or algorithm, it Will be the list L8 as shoWn in FIG. 7(a). On the other hand, if a Word sequence “Hideo Nomo” is registered in the compound Word dictionary 51b, 52b, 53b (not limited to the domain dictionary 53), it can be assumed that the Word “Nome” represents a baseball player “Nomo”. Also in this event, the list L9 as shoWn in FIG. 7(b) is displayed, Wherein the correct Word candidates are sorted for display such that the Word “Nomo” is placed in a preferential position on the list, ie in the uppermost position on the list in the ?gure.

[0070] FIG. 9 shoWs the display example on the display screen of the display unit 3 of the computer apparatus 1 in that event.

[0071] As described above, according to the present invention, if a Word sequence composed of a candidate for a correct Word relative to a Word subjected to spell checking and a Word/Words before or after the Word subjected to spell checking or on each side of the Word subjected to spell checking is registered as a compound Word in the compound Word dictionary 51b, 52b, 53b, the correct Word candidate forming the compound Word is placed in a preferential position on the list, While, if a subject area of the content of a sentence can be speci?ed based on the original text before and after a Word subjected to spell checking, a correct Word candidate that belongs to such a subject area is placed in a preferential position on the list. In this manner, When a Word sequence is registered as a compound Word or a subject area can be speci?ed, a correct Word candidate list With higher accuracy can be displayed.

[0072] In the foregoing preferred embodiment, the domain dictionaries 53 and the compound Word dictionaries 51b, 52b, 53b are both provided, but only one of them may be provided. Further, it may also be arranged that the foregoing spell-check processing is executed not only When a user carries out a particular operation, but also at predetermined timing during the translation processing. In addition, the foregoing processing of displaying a list of correct Word candidates upon spell checking is applicable to not only a translation program, but also any program that alloWs an input of a text and has a spell-check function. Further, the program that executes the processing as described in the foregoing preferred embodiment may also be in the form of a storage medium. Speci?cally, a program that makes a computer apparatus execute the foregoing processing may be stored in a storage medium such as a CD-ROM, DVD, memory or hard disk, in a computer readable manner.

[0073] Other than the foregoing, it is possible to selec tively use or properly modify the con?guration described in the foregoing preferred embodiment Without departing from the gist of the present invention.

[0074] As described above, according to the present invention, it is possible to display correct Word candidates With high accuracy.

What is claimed is: 1. A method of displaying candidates for a correct Word

When there is misspelling in an original text input into a computer apparatus, said method comprising the steps of:

Jul. 31, 2003

Determining Whether or not a candidate for a correct Word relative to a misspelled Word and one or more Words before or after said misspelled Word or on each side of said misspelled Word in the original text are included in a Word sequence registered in advance; and

causing, When included in said Word sequence, said computer apparatus to display the correct Word candi date included in said Word sequence in a preferential position.

2. A method according to claim 1, further comprising the steps of:

determining Whether or not a subject area of content of the original text including the misspelled Word is speci?ed based on a Word or a Word sequence forming said original text; and

causing, When the subject area is speci?ed, said computer apparatus to display a correct Word candidate belonging to said subject area in a preferential position.

3. A method according to claim 1, Wherein each of said steps is executed When performing spell-check processing of the original text input into said computer apparatus.

4. A spell checking method comprising the steps of:

checking Whether or not there is misspelling in a Word forming an original text input into a computer appara tus; and

displaying Words as candidates for a correct Word relative to the misspelled Word,

Wherein, in said displaying step, (a) if a subject area of content of the original text can be speci?ed based on the original text before and after the misspelled Word, and a Word in the correct Word candidates is included in said subject area, or (b) if a Word sequence registered in advance is formed by one or more Words before or after said misspelled Word or on each side of said misspelled Word in the original text and a Word in the correct Word candidates, a list of the Words as the correct Word candidates is displayed such that the Word satisfying said (a) or (b) is placed in a preferential position on the list.

5. A computer apparatus comprising:

an input section for inputting data of an original text;

a list output section for extracting candidates for a correct Word relative to a misspelled Word in the input original text, and outputting a list of the extracted correct Word candidates; and

a database storing dictionary data,

Wherein said database comprises domain databases stor ing Words for respective subject areas, and

said list output section determines priority of the correct Word candidates on the list based on a Word stored in said domain database.

6. A computer apparatus comprising:

an input section for inputting data of an original text;

a list output section for extracting candidates for a correct Word relative to a misspelled Word in the input original text, and outputting a list of the extracted correct Word candidates; and

Page 17: (75> whimMayahirwmawishim; E23 313 ...

US 2003/0145285 A1 Jul. 31, 2003 6

a database storing dictionary data, 7. A computer apparatus according to claim 5 or 6,

wherein Said database Comprises a Word Sequence data_ wherein said list output section sorts the correct Word base Storing Word Sequences each Composed of a Candidates On the Such the Correct Word Candidate plurality of Words, and With high priority is placed in a preferential position on the

said list output section deterrnines priority of the correct 11st as Compared Wlth other Candldates' Word candidates on the list based on a Word sequence stored in said Word sequence database. * * * * *