Sunbelt Computer Software

Quickstart

1) Environment Setup

python3 -m venv .venv
source .venv/bin/activate
pip install -U pip
pip install -r requirements.txt

2) Script Overview

`run_benchmark_acceptance_lengths.sh`

Runs multiple datasets across multiple block sizes and writes acceptance-length JSON files.

Output: utils/{dataset}_0T0.0/{block_size}.json
Hidden states are disabled in this script via --no-save-hidden-states

Run:

bash run_benchmark_acceptance_lengths.sh

`run_dataset_txt.sh`

Wrapper script for dataset.py using a TXT file (one prompt per line).

Run:

bash run_dataset_txt.sh --input-file TXT/copa.txt

Example with common options:

bash run_dataset_txt.sh \
  --input-file TXT/copa.txt \
  --block-sizes "1 2 4 8 14 15 16 17 18" \
  --max-samples 32 \
  --seed 0 \
  --no-save-hidden-states

`utils/predict.py`

Loads:

utils/dataset.pt
utils/model.pt

Predicts block ids (pred + 14) and writes:

utils/{task}_0T0.0/1.json

Run:

python3 utils/predict.py

`map_selected_block_values.py`

Maps selected block ids in 1.json to actual values from corresponding block JSON files.

Example logic:

If 1.json["0"] = 18, output value becomes 18.json["0"].

Run:

python3 map_selected_block_values.py utils/gsm8k_0T0.0

Custom output filename:

python3 map_selected_block_values.py \
  utils/gsm8k_0T0.0 \
  --output-file mapped_from_1.json

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
TXT		TXT
__pycache__		__pycache__
data_analysis		data_analysis
model		model
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
benchmark.py		benchmark.py
benchmark_TXT.py		benchmark_TXT.py
benchmark_hidden_states.py		benchmark_hidden_states.py
configuration.json		configuration.json
dataset.py		dataset.py
distributed.py		distributed.py
map_selected_block_values.py		map_selected_block_values.py
requirements.txt		requirements.txt
run_auto_device.py		run_auto_device.py
run_benchmark.sh		run_benchmark.sh
run_benchmark_TXT.sh		run_benchmark_TXT.sh
run_benchmark_acceptance_lengths.sh		run_benchmark_acceptance_lengths.sh
run_benchmark_block_size.sh		run_benchmark_block_size.sh
run_dataset_txt.sh		run_dataset_txt.sh

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Quickstart

1) Environment Setup

2) Script Overview

`run_benchmark_acceptance_lengths.sh`

`run_dataset_txt.sh`

`utils/predict.py`

`map_selected_block_values.py`

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Quickstart

1) Environment Setup

2) Script Overview

run_benchmark_acceptance_lengths.sh

run_dataset_txt.sh

utils/predict.py

map_selected_block_values.py

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`run_benchmark_acceptance_lengths.sh`

`run_dataset_txt.sh`

`utils/predict.py`

`map_selected_block_values.py`

Packages