pytorch-seq2seq-example_系统开发案例-程序员客栈

BatchedSeq2SeqExampleBasedontheseq2seq-translation-batched.ipynbfrompractical-pytorch,butmoreextrafeatures.

ThisexamplerunsgrammaticalerrorcorrectiontaskwherethesourcesequenceisagrammaticallyerroneuousEnglishsentenceandthetargetsequenceisangrammaticallycorrectEnglishsentence.Thecorpusandevaluationscriptcanbedownloadat:https://github.com/keisks/jfleg.

ExtrafeaturesCleanercodebaseVerydetailedcommentsforlearnersImplementPytorchnativedatasetanddataloaderforbatchingCorrectlyhandlethehiddenstatefrombidirectionalencoderandpasttothedecoderasinitialhiddenstate.Fullybatchedattentionmechanismcomputation(onlyimplementgeneralattentionbutit'ssufficient).Note:Theoriginalcodestillusesfor-looptocompute,whichisveryslow.SupportLSTMinsteadofonlyGRUSharedembeddings(encoder'sinputembeddinganddecoder'sinputembedding)PretrainedGloveembeddingFixedembeddingTieembeddings(decoder'sinputembeddinganddecoder'soutputembedding)TensorboardvisualizationLoadandsavecheckpointReplaceunknownwordsbyselectingthesourcetokenwiththehighestattentionscore.(Translation)Cons

Comparingtothestate-of-the-artseq2seqlibrary,OpenNMT-py,therearesomestuffsthataren'toptimizedinthiscodebase:

UseCuDNNwhenpossible(alwaysonencoder,ondecoderwheninput_feed=0)Alwaysavoidindexing/loopsandusetorchprimitives.Whenpossible,batchsoftmaxoperationsacrosstime.(thisisthesecondcomplicatedpartofthecode)Batchinferenceandbeamsearchfortranslation(thisisthemostcomplicatedpartofthecode)HowtospeedupRNNtraining?

SeveralwaystospeedupRNNtraining:

BatchingStaticpaddingDynamicpaddingBucketingTruncatedBPTT

See"SequenceModelsandtheRNNAPI(TensorFlowDevSummit2017)"forunderstandingthosetechniques.

YoucanusetorchtextorOpenNMT'sdataiteratorforspeedingupthetraining.Itcanbe7xfaster!(ex:7hoursforanepoch->1hour!)

Acknowledgement

ThankstotheauthorofOpenNMT-py@srushforansweringthequestionsforme!Seehttps://github.com/OpenNMT/OpenNMT-py/issues/552

pytorch-seq2seq-example

作品详情

重点城市程序员兼职推荐

重点岗位程序员兼职推荐