Browse
76 - 90 of 161
- 621mmlu-dataset
By
UnsignedLatest ModelKit
Model1Datasets2CodebasesDocsConfiguration - 6K+8mixtral-8x7b
By
SignedLatest ModelKit
ModelDatasets6CodebasesDocsConfiguration - 00mixtral-8x22b
By
UnsignedLatest ModelKit
ModelDatasetsCodebasesDocsConfiguration - 5K+8mistral_v0.3-7b
By
SignedLatest ModelKit
ModelDatasets7CodebasesDocsConfiguration - 5K+8mistral_v0.1-7b
By
SignedLatest ModelKit
ModelDatasets6CodebasesDocsConfiguration - 1K+8mistral-7b
By
UnsignedLatest ModelKit
ModelDatasets6CodebasesDocsConfiguration - 00microsoft_phi-4
By
phi-4 is a state-of-the-art open model built upon a blend of synthetic datasets, data from filtered public domain websites, and acquired academic books and Q&A datasets. The goal of this approach was to ensure that small capable models were trained with data focused on high quality and advanced reasoning. phi-4 underwent a rigorous enhancement and alignment process, incorporating both supervised fine-tuning and direct preference optimization to ensure precise instruction adherence and robust safety measures
UnsignedLatest ModelKit
ModelDatasetsCodebasesDocsConfiguration - 00microsoft-phi-2
By
Phi-2 is a Transformer with 2.7 billion parameters. It was trained using the same data sources as Phi-1.5, augmented with a new data source that consists of various NLP synthetic texts and filtered websites (for safety and educational value). When assessed against benchmarks testing common sense, language understanding, and logical reasoning, Phi-2 showcased a nearly state-of-the-art performance among models with less than 13 billion parameters.
UnsignedLatest ModelKit
ModelDatasetsCodebasesDocsConfiguration - 00microsoft_phi-2
By
Phi-2 is a Transformer with 2.7 billion parameters. It was trained using the same data sources as Phi-1.5, augmented with a new data source that consists of various NLP synthetic texts and filtered websites (for safety and educational value). When assessed against benchmarks testing common sense, language understanding, and logical reasoning, Phi-2 showcased a nearly state-of-the-art performance among models with less than 13 billion parameters.
UnsignedLatest ModelKit
ModelDatasetsCodebasesDocsConfiguration - 641medical-qa-shared-task-v1-toy
By
UnsignedLatest ModelKit
Model1DatasetsCodebasesDocsConfiguration - 00llm_repo
By
UnsignedLatest ModelKit
ModelDatasetsCodebasesDocsConfiguration - 22llm_repo
By
UnsignedLatest ModelKit
ModelDatasets1CodebasesDocsConfiguration - 00llm_local
By
UnsignedLatest ModelKit
ModelDatasetsCodebasesDocsConfiguration - 11llama3-githubactions
By
UnsignedLatest ModelKit
ModelDatasets3CodebasesDocsConfiguration - 55llama3-githubactions
By
UnsignedLatest ModelKit
ModelDatasets3CodebasesDocsConfiguration