Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

ThilinaRajapakse / simpletransformers Public

Notifications You must be signed in to change notification settings
Fork 728
Star 4.1k

Code
Issues 73
Pull requests 12
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: ThilinaRajapakse/simpletransformers

Releases Tags

Releases · ThilinaRajapakse/simpletransformers

Multiprocessing Support for `QuestionAnsweringModel`

19 May 18:41

ThilinaRajapakse

0.28.7

0da0351

Compare

Choose a tag to compare

View all tags

Multiprocessing Support for `QuestionAnsweringModel`

Added

Added multiprocessing support for Question Answering tasks for substantial performance boost where CPU-bound tasks (E.g. prediction especially with long contexts)
Added multiprocessing_chunksize (default 500) to global_args for finer control over chunking. Usually, the optimal value will be (roughly) number of examples / process count.

Fixed

Fixed bug in NERModel.predict() method when split_on_space=False. @alexysdussier

Assets 2

All reactions

Option to disable model saving

18 May 17:16

ThilinaRajapakse

0.28.5

68d62b1

Compare

Choose a tag to compare

View all tags

Option to disable model saving

Added

Added no_save option to model args. Setting this to True will prevent models from being saved to disk.
Added minimal training script for Seq2Seq models in the examples directory.

Assets 2

All reactions

Docs update and Bug fixes

15 May 12:30

ThilinaRajapakse

0.28.4

91866e8

Compare

Choose a tag to compare

View all tags

Docs update and Bug fixes

Fixed

Fixed potential bugs in loading weights when fine-tuning an ELECTRA language model. Fine-Tuning an ELECTRA language model now requires both model_name and model_type to be set to electra.
Bugfix for generic Seq2SeqModel
Bugfix when training language models from scratch

Changed

Updated Seq2SeqModel to use MarianTokenizer with MarianMT models. @flozi00

Assets 2

All reactions

Sequence-to-Sequence task support added

10 May 20:09

ThilinaRajapakse

0.28.0

2c0891d

Compare

Choose a tag to compare

View all tags

Sequence-to-Sequence task support added

Added

Sequence-to-Sequence task support added. This includes the following models:
- BART
- Marian
- Generic Encoder-Decoder
The args dict of a task-specific Simple Transformers model is now saved along with the model. When loading the model, these values will be read and used.
Any new args passed into the model initialization will override the loaded values.

Assets 2

All reactions

Improvements and Bug Fixes

10 May 13:40

ThilinaRajapakse

0.27.3

402bd8e

Compare

Choose a tag to compare

View all tags

Improvements and Bug Fixes

Added

Support for AutoModel in NER, QA, and LanguageModeling. @flozi00

Fixed

Now predict function from NER_Model returns a value model_outputs that contains:
A Python list of lists with dicts containing each word mapped to its list with raw model output. @flaviussn
Fixed T5 lm_labels not being masked properly
Fixed issue with custom evaluation metrics not being handled correctly in MultiLabelClassificationModel. @galtay

Changed

Torchvision import is now optional. It only needs to be installed if MultiModal models are used.
Pillow import is now optional. It only needs to be installed if MultiModal models are used.

Assets 2

All reactions

T5 Model Added

05 May 15:56

ThilinaRajapakse

0.27.0

4745a5b

Compare

Choose a tag to compare

View all tags

T5 Model Added

Added

Added support for T5 Model.
Added do_sample arg to language generation.
NERModel.predict() now accepts a split_on_space optional argument. If set to False, to_predict must be a a list of lists, with the inner list being a list of strings consisting of the split sequences. The outer list is the list of sequences to predict on.

Changed

eval_df argument in NERModel.train_model() renamed to eval_data to better reflect the input format. Added Deprecation Warning.

Assets 2

All reactions

ELECTRA model support for Classification and Question Answering tasks

24 Apr 20:11

ThilinaRajapakse

0.26.0

3d4f616

Compare

Choose a tag to compare

View all tags

ELECTRA model support for Classification and Question Answering tasks

Added

Added Electra model support for sequence classification (binary, multiclass, multilabel)
Added Electra model support for question answering
Added Roberta model support for question answering

Changed

Reduced logger messages during question answering evaluation

Assets 2

All reactions

Language Generation

23 Apr 20:29

ThilinaRajapakse

0.25.0

aa8e6a6

Compare

Choose a tag to compare

View all tags

Language Generation

Language Generation is now supported!

Supported model types:

GPT-2
CTRL
OpenAI-GPT
XLNet
Transformer-XL
XLM

Assets 2

All reactions

Custom Metrics for Question Answering

22 Apr 15:38

ThilinaRajapakse

0.24.9

5aee622

Compare

Choose a tag to compare

View all tags

Custom Metrics for Question Answering

Added

Added support for custom metrics with QuestionAnsweringModel.

Fixed

Fixed issue with passing proxies to ConvAI models. @Pradhy729

Assets 2

All reactions

Easier configuration for models and support for getting hidden layer outputs

13 Apr 15:37

ThilinaRajapakse

0.24.7

e9b1f41

Compare

Choose a tag to compare

View all tags

Easier configuration for models and support for getting hidden layer outputs

Added

Added option to get hidden layer outputs and embedding outputs with ClassificationModel.predict() method.
- Setting config: {"output_hidden_states": True} will automatically return all embedding outputs and hidden layer outputs.

Changed

global_args now has a config dictionary which can be used to override default values in the confg class.
- This can be used with ClassificationModel, MultiLabelClassificationModel, NERModel, QuestionAnsweringModel, and LanguageModelingModel

Assets 2

All reactions

Previous 1 2 3 4 Next

Previous Next

Footer

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.