prithivMLmods commited on
Commit
1b9ed7b
·
verified ·
1 Parent(s): 3e04b20

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +96 -1
README.md CHANGED
@@ -9,4 +9,99 @@ library_name: transformers
9
  tags:
10
  - Non-Reasoning
11
  - text-generation-inference
12
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  tags:
10
  - Non-Reasoning
11
  - text-generation-inference
12
+ datasets:
13
+ - prithivMLmods/Nemotron-Safety-30K
14
+ ---
15
+
16
+ ![89.png](https://cdn-uploads.huggingface.co/production/uploads/65bb837dbfb878f46c77de4c/HVL6VpZK94O7cHg-u66ao.png)
17
+
18
+ # **Computron-Bots-1.7B-R1**
19
+
20
+ > **Computron-Bots-1.7B-R1** is a **general-purpose safe question-answering model** fine-tuned from **Qwen3-1.7B**, specifically designed for **direct and efficient factual responses** without complex reasoning chains. It provides straightforward, accurate answers across diverse topics, making it ideal for knowledge retrieval, information systems, and applications requiring quick, reliable responses.
21
+
22
+ > \[!note]
23
+ > GGUF: [https://huggingface.co/prithivMLmods/Computron-Bots-1.7B-R1-GGUF](https://huggingface.co/prithivMLmods/Computron-Bots-1.7B-R1-GGUF)
24
+
25
+
26
+ ## **Key Features**
27
+ 1. **Direct Question Answering Excellence**
28
+ Trained to provide clear, concise, and accurate answers to factual questions across a wide range of topics without unnecessary elaboration or complex reasoning steps.
29
+
30
+ 2. **General-Purpose Knowledge Base**
31
+ Capable of handling diverse question types including factual queries, definitions, explanations, and general knowledge questions with consistent reliability.
32
+
33
+ 3. **Efficient Non-Reasoning Architecture**
34
+ Optimized for fast, direct responses without step-by-step reasoning processes, making it perfect for applications requiring immediate answers and high throughput.
35
+
36
+ 4. **Compact yet Knowledgeable**
37
+ Despite its 1.7B parameter size, delivers strong performance for factual accuracy and knowledge retrieval with minimal computational overhead.
38
+
39
+ ## **Quickstart with Transformers**
40
+ ```python
41
+ from transformers import AutoModelForCausalLM, AutoTokenizer
42
+
43
+ model_name = "prithivMLmods/Computron-Bots-1.7B-R1"
44
+
45
+ model = AutoModelForCausalLM.from_pretrained(
46
+ model_name,
47
+ torch_dtype="auto",
48
+ device_map="auto"
49
+ )
50
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
51
+
52
+ prompt = "What is the capital of France?"
53
+
54
+ messages = [
55
+ {"role": "system", "content": "You are a knowledgeable assistant that provides direct, accurate answers to questions."},
56
+ {"role": "user", "content": prompt}
57
+ ]
58
+
59
+ text = tokenizer.apply_chat_template(
60
+ messages,
61
+ tokenize=False,
62
+ add_generation_prompt=True
63
+ )
64
+
65
+ model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
66
+
67
+ generated_ids = model.generate(
68
+ **model_inputs,
69
+ max_new_tokens=256,
70
+ temperature=0.7,
71
+ do_sample=True
72
+ )
73
+
74
+ generated_ids = [
75
+ output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
76
+ ]
77
+
78
+ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
79
+ print(response)
80
+ ```
81
+
82
+ ## **Intended Use**
83
+ - **Knowledge Base Systems**: Quick factual retrieval for databases and information systems.
84
+ - **Educational Tools**: Direct answers for students and learners seeking factual information.
85
+ - **Customer Support Bots**: Efficient responses to common questions and inquiries.
86
+ - **Search Enhancement**: Improving search results with direct, relevant answers.
87
+ - **API Integration**: Lightweight question-answering service for applications and websites.
88
+ - **Research Assistance**: Quick fact-checking and information gathering for researchers.
89
+
90
+ ## **Limitations**
91
+ 1. **Non-Reasoning Architecture**:
92
+ Designed for direct answers rather than complex reasoning, problem-solving, or multi-step analysis tasks.
93
+
94
+ 2. **Limited Creative Tasks**:
95
+ Not optimized for creative writing, storytelling, or tasks requiring imagination and artistic expression.
96
+
97
+ 3. **Context Dependency**:
98
+ May struggle with questions requiring extensive context or nuanced understanding of complex scenarios.
99
+
100
+ 4. **Parameter Scale Constraints**:
101
+ The 1.7B parameter size may limit performance on highly specialized or technical domains compared to larger models.
102
+
103
+ 5. **Base Model Limitations**:
104
+ Inherits any limitations from Qwen3-1.7B's training data and may reflect biases present in the base model.
105
+
106
+ 6. **Conversational Depth**:
107
+ While excellent for Q&A, may not provide the depth of engagement expected in extended conversational scenarios.