Kyaw Kyaw Zin | 1 Apr 2011 09:42
Picon

Light A Candle With SocialKonnekt And Pray For Japan

Hi  moses-support@... ,

I just light a candle for Japan Victims.
Join us to pray for those who have lost their lives and hope for the best for those who have survived.
It is time to light a candle and Pray...

Please Light a Candle Now at: http://www.socialkonnekt.com/Tsunami/

Warm Regards,
Kyaw Kyaw Zin 
Tomas Hudik | 1 Apr 2011 13:52
Favicon

-threads option and segmentation fail

Hi there,
When I run command :
moses -threads 10 -v 4 -f moses.ini < oo > aa
Defined parameters (per moses.ini or switch):
        config: moses.ini 
        distortion-file: 0-0 wbe-msd-bidirectional-fe-allff 6
/home/moses/reordering-table.wbe-msd-bidirectional-fe.gz 
        distortion-limit: 6 
        input-factors: 0 
        lmodel-file: 1 0 3 /home/moses/LM.qblm 
        mapping: 0 T 0 
        threads: 10 
        ttable-file: 0 0 0 5 /home/moses/phrase-table.gz 
        ttable-limit: 20 
        verbose: 4 
        weight-d: 0.3 0.3 0.3 0.3 0.3 0.3 0.3 
        weight-l: 0.5000 
        weight-t: 0.2 0.2 0.2 0.2 0.2 
        weight-w: -1
...
It loads LM and TM normally, however, during decoding :
...
[files ; 1-1]
Translating line 5  in thread id 1163602240
Translating: File Open Options 

Translation Option Collection

Translating line 4  in thread id 1153112384
Translating: Productstream item properties 
(Continue reading)

Barry Haddow | 1 Apr 2011 14:01
Picon
Picon
Favicon

Re: -threads option and segmentation fail

Hi Tomas

You're using irstlm, which isn't thread-safe. You need to either switch to a 
different LM (SRILM memory, kenlm and randlm are all ok), or use 
single-threaded moses,

best regards - Barry

On Friday 01 April 2011 12:52, Tomas Hudik wrote:
> Hi there,
> When I run command :
> moses -threads 10 -v 4 -f moses.ini < oo > aa
> Defined parameters (per moses.ini or switch):
>         config: moses.ini
>         distortion-file: 0-0 wbe-msd-bidirectional-fe-allff 6
> /home/moses/reordering-table.wbe-msd-bidirectional-fe.gz
>         distortion-limit: 6
>         input-factors: 0
>         lmodel-file: 1 0 3 /home/moses/LM.qblm
>         mapping: 0 T 0
>         threads: 10
>         ttable-file: 0 0 0 5 /home/moses/phrase-table.gz
>         ttable-limit: 20
>         verbose: 4
>         weight-d: 0.3 0.3 0.3 0.3 0.3 0.3 0.3
>         weight-l: 0.5000
>         weight-t: 0.2 0.2 0.2 0.2 0.2
>         weight-w: -1
> ...
> It loads LM and TM normally, however, during decoding :
(Continue reading)

Anita Gojun | 5 Apr 2011 08:00
Picon

Mert on Lattices using EMS




It worked. Including weight-i in moses.ini file and then removing
it from the decoder-settings worked !

I am also able to run it through EMS  now.

Thanks a lot.

Regards,
Sriram

On Thu, Mar 31, 2011 at 3:09 PM, Sriram V <sriram.venkatapathy-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> wrote:
> I had mentioned that explicitly among the decoder flags. I shall check
> by including it in the moses.ini file too. Thanks.
>
> - Sriram
>
> On Thu, Mar 31, 2011 at 2:37 PM, Anita Gojun <anita.hr <at> gmail.com> wrote:
>> Sorry, I forgot to mention that in moses.ini, the following lines must be
>> included:
>>
>> [weight-i]
>> 0
>>
>>
>> Cheers,
>>
>> Anita
>>
>>
>> 2011/3/31 Sriram V <sriram.venkatapathy-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
>>>
>>> Thanks Anita. I tried the fix that you suggested but it didn't work for me
>>> for some reason. I am trying to see if there is any other way of getting
>>> around this.
>>>
>>> On Thu, Mar 31, 2011 at 11:14 AM, Anita Gojun <anita.hr <at> gmail.com> wrote:
>>> > Try to add the lines that I mentioned in my message to mert-moses.pl.
>>> > I added them to the subroutine "scan_config" after the lines:
>>> >
>>> >   push <at> {$used_triples{"d"}}, [1.0, 0.0, 2.0]
>>> >     if defined $config_weights->{"d"}
>>> >       && (!defined $used_triples{"d"}
>>> >          || scalar <at> {$config_weights->{"d"}}
>>> >             == scalar <at> {$used_triples{"d"}} +1);
>>> >
>>> > So, I got:
>>> >
>>> >   push <at> {$used_triples{"d"}}, [1.0, 0.0, 2.0]
>>> >     if defined $config_weights->{"d"}
>>> >       && (!defined $used_triples{"d"}
>>> >          || scalar <at> {$config_weights->{"d"}}
>>> >             == scalar <at> {$used_triples{"d"}} +1);
>>> >
>>> >  push <at> {$used_triples{"I"}}, [0.0, 0.0, 0.0]
>>> >     if defined $config_weights->{"I"}
>>> >       && (!defined $used_triples{"I"}
>>> >           || scalar <at> {$config_weights->{"I"}}
>>> >              == scalar <at> {$used_triples{"I"}} +1);
>>> >
>>> >   # check the weights provided in the ini file and plug them into the
>>> > triples
>>> >   # if --starting-weights-from_ini
>>> >   foreach my $weightname (keys %used_triples) {
>>> >     if (!defined $config_weights->{$weightname}) {
>>> >       print STDERR "$inishortname:Model requires weights '$weightname'
>>> > but
>>> > none were found in the ini file.\n";
>>> >       $error = 1;
>>> >       next;
>>> >     }
>>> > ...
>>> >
>>> > After I did that, the tuning did not abort any more with the message
>>> > about
>>> > an undefined array value. I'm not happy with this solution but it
>>> > works...
>>> >
>>> > Let me know if you were successfull!
>>> >
>>> >
>>> >
>>> > 2011/3/31 Sriram V <sriram.venkatapathy-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
>>> >>
>>> >> Thanks Anita. Yes, I had seen your earlier message, and as you had
>>> >> mentioned in that message, I had made sure I used the flag
>>> >> --no-filter-phrase-table
>>> >>
>>> >> And, yes,  I had changed the first number in moses.ini to 1 to specify
>>> >> that
>>> >> the binary phrase table should be used.
>>> >>
>>> >> The error that I got when I reran EMS with the new moses.ini is exactly
>>> >> the same as the one you got. Here are the last few lines in the file
>>> >> "TUNING_tune.2.STDERR".
>>> >>
>>> >> -------------
>>> >>
>>> >> Scoring the nbestlist.
>>> >> Executing: /home/svenkata/tools/mosesdecoder/trunk/mert/extractor
>>> >> --scconfig case:true --scfile run1.scores.dat --ffile
>>> >> run1.features.dat -r ....../tuning/reference.tc.1 -n
>>> >> run1.best100.out.gz > extract.out 2> extract.err
>>> >> Can't use an undefined value as an ARRAY reference at
>>> >>
>>> >>
>>> >> /home/svenkata/tools/mosesdecoder/trunk/scripts/scripts-20110317-1219/training/mert-moses.pl
>>> >> line 684.
>>> >> cp: cannot stat `......./working-dir-lattice/tuning/tmp.2/moses.ini':
>>> >> No such file or directory
>>> >>
>>> >> --------------
>>> >>
>>> >> Also, I see that there is no output in extract.out.
>>> >>
>>> >> - Sriram
>>> >>
>>> >> On Thu, Mar 31, 2011 at 8:32 AM, Anita Gojun <anita.hr <at> gmail.com>
>>> >> wrote:
>>> >> > Hello Sriram,
>>> >> >
>>> >> > I'm not familiar with tuning  in EMS but I experienced some other
>>> >> > problems
>>> >> > with tuning on lattices by using directly mert-moses.pl.
>>> >> > (See http://www.mail-archive.com/moses-support-3s7WtUTddSA@public.gmane.org/msg03707.html)
>>> >> >
>>> >> > When you change the moses configuration file moses.ini, do you
>>> >> > indicate
>>> >> > that
>>> >> > a binary phrase table is to be used (first number should be "1")?
>>> >> >
>>> >> > [ttable-file]
>>> >> > 1 0 0 5 phrase-table
>>> >> >
>>> >> >
>>> >> > Cheers,
>>> >> >
>>> >> > Anita
>>> >> >
>>> >> >
>>> >> >
>>> >> > 2011/3/30 Sriram V <sriram.venkatapathy-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
>>> >> >>
>>> >> >> Hi,
>>> >> >>
>>> >> >> I have been trying to do Mert on lattices. I set the appropriate
>>> >> >> flags,
>>> >> >> --------------
>>> >> >> tuning-settings = "-mertdir $moses-src-dir/mert
>>> >> >> --no-filter-phrase-table"
>>> >> >> decoder-settings = "-inputtype 2 -weight-i 0"
>>> >> >> --------------
>>> >> >>
>>> >> >> But the tuning step crashes as it does not find a binary phrase
>>> >> >> table.
>>> >> >> There
>>> >> >> is an appropriate binarization step indicated in
>>> >> >> $SCRIPTS_ROOTDIR/ems/experiment.meta
>>> >> >> which is :
>>> >> >>
>>> >> >> --------------
>>> >> >> binarize-config
>>> >> >>        in: config
>>> >> >>        out: bin-config
>>> >> >>        pass-unless: binarize-all
>>> >> >>        rerun-on-change: config
>>> >> >>        default-name: model/moses.bin.ini
>>> >> >>        template: $binarize-all $ttable-binarizer $rtable-binarizer
>>> >> >> OUT
>>> >> >> IN
>>> >> >> ---------------
>>> >> >>
>>> >> >> But, ems doesn't make use of it, most definitely because
>>> >> >> $binarize-all
>>> >> >> is
>>> >> >> not
>>> >> >> defined. Any idea about the script that's being referred by
>>> >> >> $binarize-all
>>> >> >> ?
>>> >> >>
>>> >> >> As an alternative, I tried binarizing seperately after the crash,
>>> >> >> changed
>>> >> >> the
>>> >> >> configuration file and then asked EMS to use the new configuration
>>> >> >> file
>>> >> >> instead
>>> >> >> of training again but that seemed to crash  also for some reason. I
>>> >> >> am
>>> >> >> trying
>>> >> >> to figure out why.
>>> >> >>
>>> >> >> I saw a similar question posted earlier last year on moses-support
>>> >> >> but
>>> >> >> that
>>> >> >> didn't help me much. Any suggestions on this ?
>>> >> >>
>>> >> >> Regards,
>>> >> >> Sriram
>>> >> >> _______________________________________________
>>> >> >> Moses-support mailing list
>>> >> >> Moses-support-3s7WtUTddSA@public.gmane.org
>>> >> >> http://mailman.mit.edu/mailman/listinfo/moses-support
>>> >> >
>>> >> >
>>> >> >
>>> >> > --
>>> >> > "Das Universum ist der Gott und wir sind seine Träume." - A.
>>> >> > Eschbach,
>>> >> > Quest
>>> >> >
>>> >
>>> >
>>> >
>>> > --
>>> > "Das Universum ist der Gott und wir sind seine Träume." - A. Eschbach,
>>> > Quest
>>> >
>>
>>
>>
>> --
>> "Das Universum ist der Gott und wir sind seine Träume." - A. Eschbach, Quest
>>
>



--
"Das Universum ist der Gott und wir sind seine Träume." - A. Eschbach, Quest
_______________________________________________
Moses-support mailing list
Moses-support@...
http://mailman.mit.edu/mailman/listinfo/moses-support
Cyrine NASRI | 5 Apr 2011 17:10
Picon

translation table based on words

Hello everyone,
Is there a way to extract the translation table based only on words. ie that the translation table contains only words.

Thank you

Bests
_______________________________________________
Moses-support mailing list
Moses-support@...
http://mailman.mit.edu/mailman/listinfo/moses-support
Philipp Koehn | 5 Apr 2011 18:15
Picon
Picon
Favicon

Re: translation table based on words

Hi,

yes, you could train with "-max-phrase-length 1".

-phi

On Tue, Apr 5, 2011 at 4:10 PM, Cyrine NASRI <cyrine.nasri@...> wrote:
> Hello everyone,
> Is there a way to extract the translation table based only on words. ie that
> the translation table contains only words.
>
> Thank you
>
> Bests
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@...
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
>
Philipp Koehn | 5 Apr 2011 19:37
Picon
Picon
Favicon

Re: translation table based on words

Hi,

it would be a rather pointless feature, but it does no harm.

Similarly, the lexical translation probabilities will be very
similar (but not identical, due to many-to-many alignments).

-phi

On Tue, Apr 5, 2011 at 5:58 PM, Cyrine NASRI <cyrine.nasri@...> wrote:
> Hello,
> Other question :
> in the translation table based on words, is that the value 2.718 (PENALITY
> sentence) is still valid?
>
> Thank you
>
> 2011/4/5 Cyrine NASRI <cyrine.nasri@...>
>>
>> Thank you very much
>>
>> 2011/4/5 Philipp Koehn <pkoehn@...>
>>>
>>> Hi,
>>>
>>> yes, you could train with "-max-phrase-length 1".
>>>
>>> -phi
>>>
>>> On Tue, Apr 5, 2011 at 4:10 PM, Cyrine NASRI <cyrine.nasri@...>
>>> wrote:
>>> > Hello everyone,
>>> > Is there a way to extract the translation table based only on words. ie
>>> > that
>>> > the translation table contains only words.
>>> >
>>> > Thank you
>>> >
>>> > Bests
>>> >
>>> > _______________________________________________
>>> > Moses-support mailing list
>>> > Moses-support@...
>>> > http://mailman.mit.edu/mailman/listinfo/moses-support
>>> >
>
>
nakul sharma | 6 Apr 2011 11:47
Picon

Mix of source and target languages in Moses translation

Hi All,
 
i am undertaking English to regional language translation.
i have tuned the corpus using mert-moses.pl using following command:-
 
 ./mert-moses.pl corpus/corpus.lowercased.en corpus/corpus.lowercased.hi  /home/nakul/moses/mosesdecoder/trunk/moses-cmd/src/moses model/moses.ini --working-dir corpus/tuning/mert  --rootdir /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/ --decoder-flags "-v 0" >& mert.out &
 
the moses.ini, i got after training (at corpus/tuning/mert) has following contents :-
 
 
# MERT optimized configuration
# decoder /home/nakul/moses/mosesdecoder/trunk/moses-cmd/src/moses
# BLEU 0.612451 -> 0.612451 on dev /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/corpus/corpus.lowercased.en
# We were before running iteration 8
# finished बुध  अप्रेल  6 14:40:28 IST 2011
### MOSES CONFIG FILE ###
#########################
 
# input factors
[input-factors]
0
 
# mapping steps
[mapping]
0 T 0
 
# translation tables: source-factors, target-factors, number of scores, file 
[ttable-file]
0 0 0 5 /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/model/phrase-table.gz
 
# no generation models, no generation-file section
 
# language models: type(srilm/irstlm), factors, order, file
[lmodel-file]
0 0 3 /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm
 
 
# limit on how many phrase translations e for each phrase f are loaded
# 0 = all elements loaded
[ttable-limit]
20
 
# distortion (reordering) files
[distortion-file]
0-0 wbe-msd-bidirectional-fe-allff 6 /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/model/reordering-table.wbe-msd-bidirectional-fe.gz
 
# distortion (reordering) weight
[weight-d]
0.009645
0.021906
0.008725
0.032902
0.019846
0.002462
0.016001
 
# language model weights
[weight-l]
0.022668
 
 
# translation model weights
[weight-t]
-0.008632
0.019782
0.228404
-0.005989
-0.499862
 
# no generation models, no weight-generation section
 
# word penalty
[weight-w]
-0.103176
 
[distortion-limit]
6
 
[v]
0
 
 
the contents of initial moses.ini is as follows:-
 
#########################
### MOSES CONFIG FILE ###
#########################
 
# input factors
[input-factors]
0
 
# mapping steps
[mapping]
0 T 0
 
# translation tables: source-factors, target-factors, number of scores, file 
[ttable-file]
0 0 0 5 /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/model/phrase-table.gz
 
# no generation models, no generation-file section
 
# language models: type(srilm/irstlm), factors, order, file
[lmodel-file]
0 0 3 /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm
 
 
# limit on how many phrase translations e for each phrase f are loaded
# 0 = all elements loaded
[ttable-limit]
20
 
# distortion (reordering) files
[distortion-file]
0-0 wbe-msd-bidirectional-fe-allff 6 /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/model/reordering-table.wbe-msd-bidirectional-fe.gz
 
# distortion (reordering) weight
[weight-d]
0.3
0.3
0.3
0.3
0.3
0.3
0.3
 
# language model weights
[weight-l]
0.5000
 
 
# translation model weights
[weight-t]
0.2
0.2
0.2
0.2
0.2
 
# no generation models, no weight-generation section
 
# word penalty
[weight-w]
-1
 
[distortion-limit]
6
 
upon undertaking the translation i get mix of regional lang and english words as follows:-
echo "Even amendments would require recommendation." | TMP=/tmp /home/nakul/moses/mosesdecoder/trunk/moses-cmd/src/moses -f /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/corpus/tuning/mert/moses.ini  >output.txt
Defined parameters (per moses.ini or switch):
    config: /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/corpus/tuning/mert/moses.ini 
    distortion-file: 0-0 wbe-msd-bidirectional-fe-allff 6 /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/model/reordering-table.wbe-msd-bidirectional-fe.gz 
    distortion-limit: 6 
    input-factors: 0 
    lmodel-file: 0 0 3 /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm 
    mapping: 0 T 0 
    ttable-file: 0 0 0 5 /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/model/phrase-table.gz 
    ttable-limit: 20 
    v: 0 
    weight-d: 0.009645 0.021906 0.008725 0.032902 0.019846 0.002462 0.016001 
    weight-l: 0.022668 
    weight-t: -0.008632 0.019782 0.228404 -0.005989 -0.499862 
    weight-w: -0.103176 
Loading lexical distortion models...have 1 models
Creating lexical reordering...
weights: 0.022 0.009 0.033 0.020 0.002 0.016 
Loading table into memory...done.
Start loading LanguageModel /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm : [1.000] seconds
/home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm: line 122: warning: non-zero probability for <unk> in closed-vocabulary LM
Finished loading LanguageModels : [1.000] seconds
Start loading PhraseTable /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/model/phrase-table.gz : [1.000] seconds
filePath: /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts-20110405-1055/training/model/phrase-table.gz
Finished loading phrase tables : [1.000] seconds
IO from STDOUT/STDIN
Created input-output object : [1.000] seconds
Translating: Even amendments would require recommendation. 
 
Collecting options took 0.000 seconds
Search took 0.000 seconds
BEST TRANSLATION: संशोधनों सहित Even|UNK|UNK|UNK प्रभाव require|UNK|UNK|UNK recommendation.|UNK|UNK|UNK [11111]  [total=-301.895] <<-4.000, -6.000, -300.000, 0.000, 0.000, -1.022, -0.511, -1.609, 0.000, -48.665, -1.792, -2.303, -1.386, -3.624, 2.000>>
Translation took 0.000 seconds
Finished translating
End. : [1.000] seconds
 
 
Please tell how to improve upon this translation, i have already tuned the system but problem presists.
 


--
Thanks & Regards,
nakul

_______________________________________________
Moses-support mailing list
Moses-support@...
http://mailman.mit.edu/mailman/listinfo/moses-support
Barry Haddow | 6 Apr 2011 12:55
Picon
Picon
Favicon

Re: Mix of source and target languages in Moses translation

Hi Nakul

Some of your source words are untranslated because there are no corresponding 
entries in your phrase table. If moses cannot find a translation in the 
phrase table, it cannot translate it, and by default will pass it straight 
through. If you don't want unknowns in the output, then use the -drop-unknown 
flag.

In particular, the following source tokens are unknown:

Even
require
recommendation.

You may want to lowercase and tokenise the source before passing it to moses,

best regards - Barry

On Wednesday 06 April 2011 10:47, nakul sharma wrote:
> Hi All,
>
> i am undertaking English to regional language translation.
> i have tuned the corpus using mert-moses.pl using following command:-
>
>  ./mert-moses.pl corpus/corpus.lowercased.en corpus/corpus.lowercased.hi
> /home/nakul/moses/mosesdecoder/trunk/moses-cmd/src/moses model/moses.ini
> --working-dir corpus/tuning/mert  --rootdir
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/ --decoder-flags "-v 0" >& mert.out &
>
> the moses.ini, i got after training (at corpus/tuning/mert) has following
> contents :-
>
>
> # MERT optimized configuration
> # decoder /home/nakul/moses/mosesdecoder/trunk/moses-cmd/src/moses
> # BLEU 0.612451 -> 0.612451 on dev
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/corpus/corpus.lowercased.en
>
> # We were before running iteration 8
> # finished बुध  अप्रेल  6 14:40:28 IST 2011
> ### MOSES CONFIG FILE ###
> #########################
>
> # input factors
> [input-factors]
> 0
>
> # mapping steps
> [mapping]
> 0 T 0
>
> # translation tables: source-factors, target-factors, number of scores,
> file
> [ttable-file]
> 0 0 0 5
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/model/phrase-table.gz
>
>
> # no generation models, no generation-file section
>
> # language models: type(srilm/irstlm), factors, order, file
> [lmodel-file]
> 0 0 3 /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm
>
>
> # limit on how many phrase translations e for each phrase f are loaded
> # 0 = all elements loaded
> [ttable-limit]
> 20
>
> # distortion (reordering) files
> [distortion-file]
> 0-0 wbe-msd-bidirectional-fe-allff 6
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/model/reordering-table.wbe-msd-bidirectional-fe.gz
>
>
> # distortion (reordering) weight
> [weight-d]
> 0.009645
> 0.021906
> 0.008725
> 0.032902
> 0.019846
> 0.002462
> 0.016001
>
> # language model weights
> [weight-l]
> 0.022668
>
>
> # translation model weights
> [weight-t]
> -0.008632
> 0.019782
> 0.228404
> -0.005989
> -0.499862
>
> # no generation models, no weight-generation section
>
> # word penalty
> [weight-w]
> -0.103176
>
> [distortion-limit]
> 6
>
> [v]
> 0
>
>
> the contents of initial moses.ini is as follows:-
>
> #########################
> ### MOSES CONFIG FILE ###
> #########################
>
> # input factors
> [input-factors]
> 0
>
> # mapping steps
> [mapping]
> 0 T 0
>
> # translation tables: source-factors, target-factors, number of scores,
> file
> [ttable-file]
> 0 0 0 5
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/model/phrase-table.gz
>
>
> # no generation models, no generation-file section
>
> # language models: type(srilm/irstlm), factors, order, file
> [lmodel-file]
> 0 0 3 /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm
>
>
> # limit on how many phrase translations e for each phrase f are loaded
> # 0 = all elements loaded
> [ttable-limit]
> 20
>
> # distortion (reordering) files
> [distortion-file]
> 0-0 wbe-msd-bidirectional-fe-allff 6
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/model/reordering-table.wbe-msd-bidirectional-fe.gz
>
>
> # distortion (reordering) weight
> [weight-d]
> 0.3
> 0.3
> 0.3
> 0.3
> 0.3
> 0.3
> 0.3
>
> # language model weights
> [weight-l]
> 0.5000
>
>
> # translation model weights
> [weight-t]
> 0.2
> 0.2
> 0.2
> 0.2
> 0.2
>
> # no generation models, no weight-generation section
>
> # word penalty
> [weight-w]
> -1
>
> [distortion-limit]
> 6
>
> upon undertaking the translation i get mix of regional lang and english
> words as follows:-
> echo "Even amendments would require recommendation." | TMP=/tmp
> /home/nakul/moses/mosesdecoder/trunk/moses-cmd/src/moses -f
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/corpus/tuning/mert/moses.ini
>
> >output.txt
>
> Defined parameters (per moses.ini or switch):
>     config:
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/corpus/tuning/mert/moses.ini
>
>     distortion-file: 0-0 wbe-msd-bidirectional-fe-allff 6
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/model/reordering-table.wbe-msd-bidirectional-fe.gz
>
>     distortion-limit: 6
>     input-factors: 0
>     lmodel-file: 0 0 3
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm
>     mapping: 0 T 0
>     ttable-file: 0 0 0 5
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/model/phrase-table.gz
>
>     ttable-limit: 20
>     v: 0
>     weight-d: 0.009645 0.021906 0.008725 0.032902 0.019846 0.002462
> 0.016001
>     weight-l: 0.022668
>     weight-t: -0.008632 0.019782 0.228404 -0.005989 -0.499862
>     weight-w: -0.103176
> Loading lexical distortion models...have 1 models
> Creating lexical reordering...
> weights: 0.022 0.009 0.033 0.020 0.002 0.016
> Loading table into memory...done.
> Start loading LanguageModel
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm : [1.000]
> seconds
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/lm_hin.lm: line 122:
> warning: non-zero probability for <unk> in closed-vocabulary LM
> Finished loading LanguageModels : [1.000] seconds
> Start loading PhraseTable
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/model/phrase-table.gz
>
> : [1.000] seconds
>
> filePath:
> /home/nakul/moses/mosesdecoder/trunk/scripts/training/moses-scripts/scripts
>-20110405-1055/training/model/phrase-table.gz
>
> Finished loading phrase tables : [1.000] seconds
> IO from STDOUT/STDIN
> Created input-output object : [1.000] seconds
> Translating: Even amendments would require recommendation.
>
> Collecting options took 0.000 seconds
> Search took 0.000 seconds
> BEST TRANSLATION: संशोधनों सहित Even|UNK|UNK|UNK प्रभाव require|UNK|UNK|UNK
> recommendation.|UNK|UNK|UNK [11111]  [total=-301.895] <<-4.000, -6.000,
> -300.000, 0.000, 0.000, -1.022, -0.511, -1.609, 0.000, -48.665, -1.792,
> -2.303, -1.386, -3.624, 2.000>>
> Translation took 0.000 seconds
> Finished translating
> End. : [1.000] seconds
>
>
> Please tell how to improve upon this translation, i have already tuned the
> system but problem presists.

--

-- 
The University of Edinburgh is a charitable body, registered in
Scotland, with registration number SC005336.

_______________________________________________
Moses-support mailing list
Moses-support <at> mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support
Soldatovs | 7 Apr 2011 12:11
Picon

(no subject)

Hello all,

I'm new to Moses, so I beg my pardon for a possibly stupid question.

My input is:
1. already translated bilingual documents
2. a glossary

What I want to do using Moses is:
1. Train the engine using the bilingual documents I already have.
2. Produce translations with entries from the glossary having higher priority over the engine's proposals.

Example.
Assume I translate from English to Norwegian. In my en-US - nb-NO glossary, I have a term
terminology processing - behandling av terminologi

So, if I feed Moses with a sentence beginning with 'The most important issues of terminology processing are...', I want to get something like 'De viktigste problemene av behandling av terminologi er...', but NOT something like 'De viktigste problemene av behandling av fagord er...'.

Of course, I understand that there's no problem in training Moses, but what I want to ask you is:
1. Is it possible to solve the step 2 of the problem with Moses?
2. If the solution already exists, could you please share any links to appropriate manuals?

Thanks for help in advance.

Best regards,
Anton 
 
_______________________________________________
Moses-support mailing list
Moses-support@...
http://mailman.mit.edu/mailman/listinfo/moses-support

Gmane