Physics of Language Models: Part 4.2 facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003 Updated Jul 29 • 1 • 5 facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.002 Updated Jul 29 • 1 • 2 facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.003 Updated Jul 29 • 1 • 1 facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-2T-lr0.003 Updated Jul 29 • 1 • 2
"Physics of Language Models" series Physics of Language Models: Part 1, Context-Free Grammar Paper • 2305.13673 • Published May 23, 2023 • 7 Physics of Language Models: Part 3.2, Knowledge Manipulation Paper • 2309.14402 • Published Sep 25, 2023 • 7 Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Paper • 2404.05405 • Published Apr 8, 2024 • 10 Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper • 2309.14316 • Published Sep 25, 2023 • 8
Physics of Language Models: Part 1, Context-Free Grammar Paper • 2305.13673 • Published May 23, 2023 • 7
Physics of Language Models: Part 3.2, Knowledge Manipulation Paper • 2309.14402 • Published Sep 25, 2023 • 7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Paper • 2404.05405 • Published Apr 8, 2024 • 10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper • 2309.14316 • Published Sep 25, 2023 • 8
Physics of Language Models: Part 4.2 facebook/PhysicsLM4.2__LlamaCanon-8B-Nemo-1T-lr0.003 Updated Jul 29 • 1 • 5 facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.002 Updated Jul 29 • 1 • 2 facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-1T-lr0.003 Updated Jul 29 • 1 • 1 facebook/PhysicsLM4.2__LlamaCanon-1B-Nemo-2T-lr0.003 Updated Jul 29 • 1 • 2
"Physics of Language Models" series Physics of Language Models: Part 1, Context-Free Grammar Paper • 2305.13673 • Published May 23, 2023 • 7 Physics of Language Models: Part 3.2, Knowledge Manipulation Paper • 2309.14402 • Published Sep 25, 2023 • 7 Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Paper • 2404.05405 • Published Apr 8, 2024 • 10 Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper • 2309.14316 • Published Sep 25, 2023 • 8
Physics of Language Models: Part 1, Context-Free Grammar Paper • 2305.13673 • Published May 23, 2023 • 7
Physics of Language Models: Part 3.2, Knowledge Manipulation Paper • 2309.14402 • Published Sep 25, 2023 • 7
Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Paper • 2404.05405 • Published Apr 8, 2024 • 10
Physics of Language Models: Part 3.1, Knowledge Storage and Extraction Paper • 2309.14316 • Published Sep 25, 2023 • 8