ENHANCING FEW-SHOT LEARNING IN LIGHTWEIGHT MODELS VIA DUAL-FACETED KNOWLEDGE DISTILLATION