zibajoon
/

20231109_layoutlm2_5k_20ep_Doc_NA

Document Question Answering

Generated from Trainer

Model card Files Files and versions

zibajoon commited on Nov 14, 2023

Commit

9486f73

·

1 Parent(s): 99c3486

Update README.md

Files changed (1) hide show

README.md +9 -1

README.md CHANGED Viewed

@@ -19,7 +19,15 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations

 ## Model description
+This DocVQA model, built on the Layout LM v2 framework, represents an initial step in a series of
+experimental models aimed at document visual question answering. It's the "medium" version in a planned series,
+trained on a mid-sized dataset of 5k samples (split between training and test) over 20 epochs.
+The training setup was modest, employing mixed precision (fp16), with manageable batch sizes and a
+focused approach to learning rate adjustment (warmup steps and weight decay). Notably, this model was
+trained without external reporting tools, emphasizing internal evaluation. As the first iteration in a
+progressive series that will later include medium (5k samples) and large (50k samples) models, this
+version serves as a foundational experiment, setting the stage for more extensive and complex models in the
+future.
 ## Intended uses & limitations