Why expect Z in Adapter? #8

niedakh · 2024-06-25T11:08:39Z

The class Adapter expects Z in constructor:

class Adapter(transformers.PreTrainedModel):
    config_class = transformers.PretrainedConfig
    def __init__(self, config, classifiers=None, Z=None, labels_list=[]):
        super().__init__(config)    
        self.Z= torch.nn.Embedding(len(config.classifiers_size),config.hidden_size, max_norm=1.0).weight if Z==None else Z
        self.classifiers=torch.nn.ModuleList(
            [torch.nn.Linear(config.hidden_size,size) for size in config.classifiers_size]
        ) if classifiers==None else classifiers
        self.config=self.config.from_dict(
            {**self.config.to_dict(),
            'labels_list':labels_list}
        )
    def adapt_model_to_task(self, model, task_name):
        task_index=self.config.tasks.index(task_name)
        #setattr(model,search_module(model,'linear',mode='class')[-1], self.classifiers[task_index])
        model.classifier=self.classifiers[task_index]
        return model
    def _init_weights(*args):
        pass

but doesn't use it at all when adapting model to task?

The text was updated successfully, but these errors were encountered:

sileod · 2024-06-25T11:54:20Z

Hi, great question

It is used here:

tasknet/src/tasknet/utils.py

Line 210 in c9f4360

if adapt_task_embedding:

But actually, it would be cleaner to have it in adapt_model_to_task
I'll try to do it for the next release

The general idea is to have a shared encoder, one classifier per task (unless some task share all their labels), and task embedding per task
The task embedding is randomly dropped at 10% rate to work without using it, but it allows the model to "see" the task it should do and it improves results, so it is best to add it alongsides the classifier. It's actually the core of the Adapter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why expect Z in Adapter? #8

Why expect Z in Adapter? #8

niedakh commented Jun 25, 2024

sileod commented Jun 25, 2024

Why expect Z in Adapter? #8

Why expect Z in Adapter? #8

Comments

niedakh commented Jun 25, 2024

sileod commented Jun 25, 2024