https://arxiv.org/abs/2102.11005

Last update: Dec 19, 2022

Related tags

Overview

LogME

LogME: Practical Assessment of Pre-trained Models for Transfer Learning

How to use

Just feed the features f and labels y to the function, and you can get a nice score which well correlates with the transfer learning performance.

from LogME import LogME
score = LogME(f, y)

Then you can use the score to quickly select a good pre-trained model. The larger the score is, the better transfer performance you get.

Experimental results

We extensively validate the generality and superior performance of LogME on 14 pre-trained models and 17 downstream tasks, covering various pre-trained models (supervised pre-trained and unsupervised pre-trained), downstream tasks (classification and regression), and modalities (vision and language). Check the paper for all the results.

Computer vision

9 datasets and 10 pre-trained models. LogME is a reasonably good indicator for transfer performance.

NLP

7 tasks and 4 pre-trained models. LogME is a good indicator for transfer performance.

Speedup

LogME provides a dramatic speedup for assessing pre-trained models. The speedup comes from two aspects:

LogME does not need hyper-parameter tuning whereas vanilla fine-tuning requires extensive hyper-parameter tuning.
We designed a fast algorithm to further speedup the computation of LogME.

Citation

If you find it useful, please cite the following paper:

@article{you_logme:_2021,
	title = {LogME: Practical Assessment of Pre-trained Models for Transfer Learning},
	author = {You, Kaichao and Liu, Yong and Long, Mingsheng and Wang, Jianmin},
	journal = {arxiv},
	volume = {abs/2102.11005},
	year = {2021},
	url = {https://arxiv.org/abs/2102.11005},
}

Contact

If you have any question or want to use the code, please contact [email protected] .

https://arxiv.org/abs/2102.11005

Related tags

Overview

LogME

How to use

Experimental results

Computer vision

NLP

Speedup

Citation

Contact

Owner

THUML: Machine Learning Group @ THSS

Learning Synthetic Environments and Reward Networks for Reinforcement Learning

Keras Image Embeddings using Contrastive Loss

Forecasting for knowable future events using Bayesian informative priors (forecasting with judgmental-adjustment).

A 1.3B text-to-image generation model trained on 14 million image-text pairs

Machine Learning toolbox for Humans

Code for the RA-L (ICRA) 2021 paper "SeqNet: Learning Descriptors for Sequence-Based Hierarchical Place Recognition"

Pixel-wise segmentation on VOC2012 dataset using pytorch.

Tensorflow Repo for "DeepGCNs: Can GCNs Go as Deep as CNNs?"

Model serving at scale

Code for Ditto: Building Digital Twins of Articulated Objects from Interaction

Node Editor Plug for Blender

CSE-519---Project - Job Title Analysis (Project for CSE 519 - Data Science Fundamentals)

PyTorch implementation of MuseMorphose, a Transformer-based model for music style transfer.

Code for the paper "On the Power of Edge Independent Graph Models"

Official implementation of the paper Vision Transformer with Progressive Sampling, ICCV 2021.

[ACM MM 2021] TSA-Net: Tube Self-Attention Network for Action Quality Assessment

Repo for code associated with Modeling the Mitral Valve.

InterFaceGAN - Interpreting the Latent Space of GANs for Semantic Face Editing

Multi-task Learning of Order-Consistent Causal Graphs (NeuRIPs 2021)

Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020