:py:mod:`rannet.pretrain` ========================= .. py:module:: rannet.pretrain .. autoapi-nested-parse:: Pretrain RanNet Module Contents --------------- Functions ~~~~~~~~~ .. autoapisummary:: rannet.pretrain.split_sentences rannet.pretrain.cli rannet.pretrain.corpus rannet.pretrain.single_corpus rannet.pretrain.pretrain rannet.pretrain.export_checkpoint rannet.pretrain.main Attributes ~~~~~~~~~~ .. autoapisummary:: rannet.pretrain.gpus .. py:data:: gpus .. py:function:: split_sentences(text: str, max_length: int = 512, lang: str = 'english') .. py:function:: cli() rannet client .. py:function:: corpus(vocab_path: str, workers: int, min_length: int, max_length: int, chunk_size: int, corpus_dir: str, save_dir: str, whole_word_tokenizer: str, cased: bool) .. py:function:: single_corpus(vocab_path: str, max_length: int, chunk_size: int, corpus_path: str, save_path: str) .. py:function:: pretrain(corpus_path: str, config_path: str, log_path: str, base_ckpt_path: str, save_dir: str, record_info_path: str, batch_size: int, learning_rate: float, weight_decay: float, sequence_length: int, num_warmup_steps: int, num_train_steps: int, ckpt_save_freq: int, gradient_accumulation_steps: int, distributed: bool, distributed_strategy: str, verbose: int) .. py:function:: export_checkpoint(config_path: str, ckpt_path: str, target_path: str) .. py:function:: main()