Statistics for Keyphrase Identification Using Minimal Labeled Data with Hierarchical Context and Transfer Learning