LoRA for Sentiment Analysis#

📗 You can find an interactive Colab version of this tutorial here.

Low Rank Adaptation (LoRA) is a technique used to modify and fine tune large language models in a more efficient way. Rather than modifying all of the model weights, LoRAs find two low dimensional matrices that have the lowest rank. It then multiplies the two matrices to find the fine tuned weight matrix. This fine tuned weight matrix will be the same size as the original pre trained weight matrix. Once the fine tuned matrix has been found it can then be applied to the model’s layers.

TRAIN FIGURE

Fine tuning with a LoRA is a part of the Parameter Efficient Fine Tuning (PEFT) family because it keeps the original model unchanged and introduces a small number of layers or parameters instead. Once the fine tuned matrix has been calculated, it is applied to the last Multilayer Perceptron (MLP) layer of the model. Once the LoRA has been applied, the model is fine tuned based on a knowledge base or domain specific dataset.

TEST FIGURE

Setup#

Make sure you have obtained your NDIF API key and configured your workspace for remote execution.

The following packages need to be installed for this tutorial:

!pip install nnsight
!pip install pyarrow==15.0.2
!pip install datasets
!pip install datasets torch
[ ]:
from IPython.display import clear_output
from nnsight import CONFIG

CONFIG.set_default_api_key('YOUR API KEY HERE')

!huggingface-cli login --token YOUR_HF_TOKEN_HERE # <- Copy your hugging face token here
clear_output()
The token has not been saved to the git credentials helper. Pass `add_to_git_credential=True` in this function directly or `--add-to-git-credential` if using via `huggingface-cli` if you want to set the git credential as well.
Token is valid (permission: read).
Your token has been saved to /root/.cache/huggingface/token
Login successful

Here are the imports needed for this tutorial.

[ ]:
import torch
import torch.nn as nn
import pandas as pd
from nnsight import LanguageModel
from transformers import AutoModelForSequenceClassification, AutoTokenizer, AutoModelForCausalLM
from transformers import TrainingArguments, Trainer
from torch.utils.data import DataLoader, Subset
from datasets import load_dataset

Prepare Data#

For this tutorial we will be using the The Stanford Sentiment Treebank (SST2). It consists of sentences from movie reviews and human annotations of their sentiment. The task is to predict the sentiment of a given sentence as being either positive or negative. In the dataset, the positive/negative labels of each phrase are represented by a 0 for each negative statement and a 1 for each positive statement.

[ ]:
# GLUE is a standard Natural Language Processing (NLP) benchmark which is commonly used for sentiment analysis tasks.
# It is responisble for assessing the effectiveness of language models across various NLP tasks.
# It serves as a standard for evaluating a model's ability to understand and process language.
dataset = load_dataset("glue", "sst2")

# 0 = neg, 1 = pos
def label_to_str(example):
    example['label'] = 'positive' if example['label'] == 1 else 'negative'
    return example

train_data = [(dataset['sentence'], 'positive' if dataset['label'] == 1 else 'negative') for dataset in dataset['train']]
validation_data = [(dataset['sentence'], 'positive' if dataset['label'] == 1 else 'negative') for dataset in dataset['validation']]
/usr/local/lib/python3.10/dist-packages/huggingface_hub/utils/_token.py:89: UserWarning:
The secret `HF_TOKEN` does not exist in your Colab secrets.
To authenticate with the Hugging Face Hub, create a token in your settings tab (https://huggingface.co/settings/tokens), set it as secret in your Google Colab and restart your session.
You will be able to reuse this secret in all of your notebooks.
Please note that authentication is recommended but still optional to access public models or datasets.
  warnings.warn(

Next, we need to tokenize our data. Tokenizing involves converting text into a numerical representation. It is a popular technique in NLP because it helps the models better understand the text and output a more accurate result.

[ ]:
tokenizer = AutoTokenizer.from_pretrained('openai-community/gpt2', add_prefix_space=True)
tokenizer.pad_token = tokenizer.eos_token

# Uses the tokenizer from the model to tokenize a given sentence with padding and truncation
def tokenize_function(text):
  return tokenizer(text['sentence'], padding='max_length', truncation=True, max_length=10, return_tensors='pt')

# We use .map() in order to apply the tokenization function to all the training data.
#tokenized_train = map(tokenize_function, train_data)
tokenized_train_dataset = dataset['train'].map(tokenize_function, batched=True, batch_size=10)
tokenized_train_dataset = tokenized_train_dataset.map(lambda x: {'input_ids': x['input_ids'], 'attention_mask': x['attention_mask'], 'labels': x['label']})

Prepare our Model#

For this tutorial we will be using the Llama-70B language model.

[ ]:
# Use the LanguageModel wrapper class to load in the Llama model
model_name = "meta-llama/Meta-Llama-3.1-70B"
model = LanguageModel(model_name, device_map='auto')

This is the model architechure before the LoRA has been applied. After the model has been fine tuned with the LoRA, the last MLP layer of the model will be replaced with the LoRA.

We’re going to train a very simple LORA that, when applied, will make our model determine whether a sentence is displaying a positive sentiment or a negative sentiment.

[ ]:
from nnsight.envoy import Envoy

# We will define a LORA class.
# The LORA class call method operations are simply traced like you would normally do in a .trace.
class LORA(nn.Module):
    def __init__(self, module: Envoy, dim: int, r: int) -> None:
        """Init.

        Args:
            module (Envoy): Which model Module we are adding the LORA to.
            dim (int): Dimension of the layer we are adding to (This could potentially be auto populated if the user scanned first so we know the shape)
            r (int): Inner dimension of the LORA
        """
        super(LORA, self).__init__()
        self.r = r
        self.module = module
        self.WA = torch.nn.Parameter(torch.randn(dim, self.r), requires_grad=True).save()
        self.WB = torch.nn.Parameter(torch.zeros(self.r, dim), requires_grad=True).save()

    # The Call method defines how to actually apply the LORA.
    # happens after the forward pass
    def __call__(self, alpha: float = 1.0):
        """Call.

        Args:
            alpha (float, optional): How much to apply the LORA. Can be altered after training for inference. Defaults to 1.0.
        """

        # We apply WA to the first positional arg (the hidden states)
        A_x = torch.matmul(self.module.input, self.WA)
        BA_x = torch.matmul(A_x, self.WB)

        # LORA is additive
        h = BA_x + self.module.output

        # Replace the output with our new one * alpha
        # Could also have been self.module.output[:] = h * alpha, for in-place
        self.module.output = h * alpha

    def parameters(self):
        # Some way to get all the parameters.
        return [self.WA, self.WB]

LLM Fine Tuning#

[ ]:
# Inner LORA dimension
lora_dim = 4

# Module to train LORA on
# Accesses the last mlp layer of the model
module = model.model.layers[-1].mlp

We can use the .scan() method to get the shape of the module without having to fully run the model.

[ ]:
with model.scan(" "):
    dim = module.output.shape[-1]

print(dim)
Starting from v4.46, the `logits` model output will have the same type as the model (except at train time, where it will always be FP32)
8192
[ ]:
# The LORA object itself isn't transmitted to the server. Only the forward / call method.
# The parameters are created remotely and never sent only retrieved
with model.session(remote=True) as session:

    dataset = tokenized_train_dataset

    # Smaller chunks to run faster, feel free to increase
    indices = list(range(0, 5000))
    subset = Subset(dataset, indices)


    # Create a dataloader from it.
    dataloader = DataLoader(subset, batch_size=10)

    # Create our LORA on the last mlp and apply it to the model
    lora = LORA(module, dim, lora_dim)

    # Create an optimizer. Use the parameters from LORA
    optimizer = torch.optim.AdamW(lora.parameters(), lr=3)

    # Iterate over dataloader using .iter.
    with session.iter(dataloader, return_context=True) as (batch, iterator):

        # Accesses the phrase that contains either a positive/negative sentiment
        prompt = batch['sentence']

        # Determines whether the phrase is positive/negative
        correct_token = batch['label']


        # Run .trace with prompt
        with model.trace(prompt) as tracer:


            # Apply LORA to intervention graph just by calling it with .trace
            # This is invoke the __call__() method of the LORA class defined above
            lora()


            # Get logits
            # Logits are the output of the neural network before the
            # activation function has been applied.
            logits = model.lm_head.output


            # Do cross entropy on last predicted token and correct_token
            loss = torch.nn.functional.cross_entropy(logits[:, -1], batch['label'])

            # Call backward
            loss.backward()


        # Call methods on optimizer. Graphs that arent from .trace (so in this case session and iterator both have their own graph) are executed sequentially.
        # The Graph of Iterator here will be:
        # 1.) Index batch at 0 for prompt
        # 2.) Index batch at 1 for correct_token
        # 3.) Execute the .trace using the prompt
        # 4.) Call .step() on optimizer
        optimizer.step()
        # 5.) Call .zero_grad() in optimizer
        optimizer.zero_grad()
        # 6.) Print out the lora WA weights to show they are indeed changing
        iterator.log(lora.WA)


Streaming output truncated to the last 5000 lines.
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4609,  0.8828,  0.3320,  0.0106],
        [ 3.3281, -0.1050,  1.3281,  2.9062],
        [ 1.9844, -0.1611,  0.3496,  1.0938],
        ...,
        [-1.7109,  0.3262, -1.0625, -2.0469],
        [ 1.5391,  0.9219,  0.8750,  1.9531],
        [ 2.0156,  1.1953,  1.9453,  2.1562]], requires_grad=True)
2024-10-08 15:05:50,269 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4688,  0.9023,  0.3398,  0.0243],
        [ 3.2969, -0.0067,  1.3750,  2.9219],
        [ 1.9453, -0.1621,  0.3398,  1.0781],
        ...,
        [-1.6562,  0.2432, -1.1016, -2.0469],
        [ 1.4844,  0.9180,  0.8398,  1.9453],
        [ 1.9766,  1.0859,  1.8203,  2.1406]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4688,  0.9023,  0.3398,  0.0243],
        [ 3.2969, -0.0067,  1.3750,  2.9219],
        [ 1.9453, -0.1621,  0.3398,  1.0781],
        ...,
        [-1.6562,  0.2432, -1.1016, -2.0469],
        [ 1.4844,  0.9180,  0.8398,  1.9453],
        [ 1.9766,  1.0859,  1.8203,  2.1406]], requires_grad=True)
2024-10-08 15:05:50,423 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4766,  0.9258,  0.3555,  0.0378],
        [ 3.2500,  0.0400,  1.3750,  2.9219],
        [ 1.9062, -0.1650,  0.3281,  1.0625],
        ...,
        [-1.6094,  0.1748, -1.1250, -2.0312],
        [ 1.4297,  0.9180,  0.8164,  1.9375],
        [ 1.9375,  0.9531,  1.6719,  2.1094]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4766,  0.9258,  0.3555,  0.0378],
        [ 3.2500,  0.0400,  1.3750,  2.9219],
        [ 1.9062, -0.1650,  0.3281,  1.0625],
        ...,
        [-1.6094,  0.1748, -1.1250, -2.0312],
        [ 1.4297,  0.9180,  0.8164,  1.9375],
        [ 1.9375,  0.9531,  1.6719,  2.1094]], requires_grad=True)
2024-10-08 15:05:50,687 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4688,  0.9297,  0.3535,  0.0493],
        [ 3.2188,  0.0359,  1.3203,  2.9062],
        [ 1.8828, -0.1895,  0.2949,  1.0469],
        ...,
        [-1.5859,  0.1699, -1.0859, -2.0156],
        [ 1.4062,  0.8555,  0.7383,  1.9297],
        [ 1.9219,  0.7812,  1.4922,  2.0781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4688,  0.9297,  0.3535,  0.0493],
        [ 3.2188,  0.0359,  1.3203,  2.9062],
        [ 1.8828, -0.1895,  0.2949,  1.0469],
        ...,
        [-1.5859,  0.1699, -1.0859, -2.0156],
        [ 1.4062,  0.8555,  0.7383,  1.9297],
        [ 1.9219,  0.7812,  1.4922,  2.0781]], requires_grad=True)
2024-10-08 15:05:50,951 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4609,  0.9336,  0.3516,  0.0598],
        [ 3.1719,  0.0742,  1.3125,  2.8750],
        [ 1.8516, -0.2021,  0.2715,  1.0234],
        ...,
        [-1.5547,  0.1484, -1.0703, -2.0000],
        [ 1.3828,  0.8008,  0.6680,  1.9141],
        [ 1.8984,  0.6836,  1.3750,  2.0469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4609,  0.9336,  0.3516,  0.0598],
        [ 3.1719,  0.0742,  1.3125,  2.8750],
        [ 1.8516, -0.2021,  0.2715,  1.0234],
        ...,
        [-1.5547,  0.1484, -1.0703, -2.0000],
        [ 1.3828,  0.8008,  0.6680,  1.9141],
        [ 1.8984,  0.6836,  1.3750,  2.0469]], requires_grad=True)
2024-10-08 15:05:51,220 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4609,  0.9492,  0.3652,  0.0708],
        [ 3.1094,  0.1582,  1.3594,  2.8438],
        [ 1.8203, -0.2051,  0.2617,  1.0078],
        ...,
        [-1.4766,  0.0583, -1.1250, -1.9688],
        [ 1.3281,  0.8281,  0.6797,  1.8984],
        [ 1.8750,  0.6328,  1.3125,  2.0312]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4609,  0.9492,  0.3652,  0.0708],
        [ 3.1094,  0.1582,  1.3594,  2.8438],
        [ 1.8203, -0.2051,  0.2617,  1.0078],
        ...,
        [-1.4766,  0.0583, -1.1250, -1.9688],
        [ 1.3281,  0.8281,  0.6797,  1.8984],
        [ 1.8750,  0.6328,  1.3125,  2.0312]], requires_grad=True)
2024-10-08 15:05:51,478 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4453,  0.8867,  0.3008,  0.0605],
        [ 3.0156,  0.3359,  1.5078,  2.8281],
        [ 1.7734, -0.1865,  0.2695,  0.9883],
        ...,
        [-1.3594, -0.2021, -1.3672, -1.9609],
        [ 1.2500,  0.9844,  0.8164,  1.8984],
        [ 1.8438,  0.6016,  1.2656,  2.0156]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4453,  0.8867,  0.3008,  0.0605],
        [ 3.0156,  0.3359,  1.5078,  2.8281],
        [ 1.7734, -0.1865,  0.2695,  0.9883],
        ...,
        [-1.3594, -0.2021, -1.3672, -1.9609],
        [ 1.2500,  0.9844,  0.8164,  1.8984],
        [ 1.8438,  0.6016,  1.2656,  2.0156]], requires_grad=True)
2024-10-08 15:05:51,732 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4219,  0.8633,  0.2773,  0.0598],
        [ 2.9219,  0.5039,  1.6406,  2.8125],
        [ 1.7266, -0.1934,  0.2559,  0.9648],
        ...,
        [-1.2734, -0.3379, -1.4844, -1.9375],
        [ 1.1875,  1.0312,  0.8594,  1.8906],
        [ 1.8125,  0.5078,  1.1719,  1.9844]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4219,  0.8633,  0.2773,  0.0598],
        [ 2.9219,  0.5039,  1.6406,  2.8125],
        [ 1.7266, -0.1934,  0.2559,  0.9648],
        ...,
        [-1.2734, -0.3379, -1.4844, -1.9375],
        [ 1.1875,  1.0312,  0.8594,  1.8906],
        [ 1.8125,  0.5078,  1.1719,  1.9844]], requires_grad=True)
2024-10-08 15:05:51,890 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4062,  0.8672,  0.2793,  0.0610],
        [ 2.8438,  0.5938,  1.7109,  2.7969],
        [ 1.6875, -0.2178,  0.2256,  0.9453],
        ...,
        [-1.1953, -0.4453, -1.5703, -1.9219],
        [ 1.1250,  1.0234,  0.8516,  1.8750],
        [ 1.7891,  0.3496,  1.0312,  1.9531]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.4062,  0.8672,  0.2793,  0.0610],
        [ 2.8438,  0.5938,  1.7109,  2.7969],
        [ 1.6875, -0.2178,  0.2256,  0.9453],
        ...,
        [-1.1953, -0.4453, -1.5703, -1.9219],
        [ 1.1250,  1.0234,  0.8516,  1.8750],
        [ 1.7891,  0.3496,  1.0312,  1.9531]], requires_grad=True)
2024-10-08 15:05:52,142 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.3750,  0.8789,  0.2891,  0.0698],
        [ 2.7812,  0.5703,  1.6719,  2.7656],
        [ 1.6562, -0.2637,  0.1787,  0.9219],
        ...,
        [-1.1172, -0.5195, -1.6250, -1.8984],
        [ 1.0859,  0.9453,  0.7930,  1.8516],
        [ 1.7734,  0.0913,  0.8164,  1.9062]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.3750,  0.8789,  0.2891,  0.0698],
        [ 2.7812,  0.5703,  1.6719,  2.7656],
        [ 1.6562, -0.2637,  0.1787,  0.9219],
        ...,
        [-1.1172, -0.5195, -1.6250, -1.8984],
        [ 1.0859,  0.9453,  0.7930,  1.8516],
        [ 1.7734,  0.0913,  0.8164,  1.9062]], requires_grad=True)
2024-10-08 15:05:52,303 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.3438,  0.8711,  0.2871,  0.0762],
        [ 2.7031,  0.6953,  1.7734,  2.7500],
        [ 1.6172, -0.2695,  0.1660,  0.9023],
        ...,
        [-1.0469, -0.6094, -1.6953, -1.8750],
        [ 1.0312,  0.9453,  0.7969,  1.8281],
        [ 1.7500, -0.0811,  0.6680,  1.8750]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.3438,  0.8711,  0.2871,  0.0762],
        [ 2.7031,  0.6953,  1.7734,  2.7500],
        [ 1.6172, -0.2695,  0.1660,  0.9023],
        ...,
        [-1.0469, -0.6094, -1.6953, -1.8750],
        [ 1.0312,  0.9453,  0.7969,  1.8281],
        [ 1.7500, -0.0811,  0.6680,  1.8750]], requires_grad=True)
2024-10-08 15:05:52,456 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2734,  0.8281,  0.2578,  0.0884],
        [ 2.5938,  0.8555,  1.8984,  2.7188],
        [ 1.5703, -0.2559,  0.1699,  0.8828],
        ...,
        [-0.9414, -0.7656, -1.8125, -1.8438],
        [ 0.9492,  1.0078,  0.8477,  1.7969],
        [ 1.7109, -0.1777,  0.5781,  1.8438]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2734,  0.8281,  0.2578,  0.0884],
        [ 2.5938,  0.8555,  1.8984,  2.7188],
        [ 1.5703, -0.2559,  0.1699,  0.8828],
        ...,
        [-0.9414, -0.7656, -1.8125, -1.8438],
        [ 0.9492,  1.0078,  0.8477,  1.7969],
        [ 1.7109, -0.1777,  0.5781,  1.8438]], requires_grad=True)
2024-10-08 15:05:52,614 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2031,  0.7539,  0.2012,  0.0972],
        [ 2.5156,  0.9805,  2.0000,  2.6875],
        [ 1.5234, -0.2432,  0.1729,  0.8633],
        ...,
        [-0.8555, -0.9102, -1.9219, -1.8125],
        [ 0.9062,  1.0391,  0.8828,  1.7812],
        [ 1.6797, -0.2500,  0.5078,  1.8125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2031,  0.7539,  0.2012,  0.0972],
        [ 2.5156,  0.9805,  2.0000,  2.6875],
        [ 1.5234, -0.2432,  0.1729,  0.8633],
        ...,
        [-0.8555, -0.9102, -1.9219, -1.8125],
        [ 0.9062,  1.0391,  0.8828,  1.7812],
        [ 1.6797, -0.2500,  0.5078,  1.8125]], requires_grad=True)
2024-10-08 15:05:52,870 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1328,  0.6836,  0.1533,  0.1050],
        [ 2.4375,  1.1016,  2.0938,  2.6562],
        [ 1.4766, -0.2295,  0.1768,  0.8438],
        ...,
        [-0.7773, -1.0469, -2.0312, -1.7812],
        [ 0.8672,  1.0703,  0.9102,  1.7578],
        [ 1.6562, -0.2988,  0.4531,  1.7812]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1328,  0.6836,  0.1533,  0.1050],
        [ 2.4375,  1.1016,  2.0938,  2.6562],
        [ 1.4766, -0.2295,  0.1768,  0.8438],
        ...,
        [-0.7773, -1.0469, -2.0312, -1.7812],
        [ 0.8672,  1.0703,  0.9102,  1.7578],
        [ 1.6562, -0.2988,  0.4531,  1.7812]], requires_grad=True)
2024-10-08 15:05:53,031 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0703,  0.6406,  0.1250,  0.1152],
        [ 2.3594,  1.1094,  2.0781,  2.6094],
        [ 1.4375, -0.2305,  0.1689,  0.8281],
        ...,
        [-0.7383, -1.1094, -2.0625, -1.7578],
        [ 0.8398,  1.0234,  0.8750,  1.7266],
        [ 1.6250, -0.4004,  0.3594,  1.7422]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0703,  0.6406,  0.1250,  0.1152],
        [ 2.3594,  1.1094,  2.0781,  2.6094],
        [ 1.4375, -0.2305,  0.1689,  0.8281],
        ...,
        [-0.7383, -1.1094, -2.0625, -1.7578],
        [ 0.8398,  1.0234,  0.8750,  1.7266],
        [ 1.6250, -0.4004,  0.3594,  1.7422]], requires_grad=True)
2024-10-08 15:05:53,291 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0156,  0.6094,  0.1069,  0.1226],
        [ 2.2812,  1.0625,  2.0156,  2.5469],
        [ 1.4219, -0.2930,  0.1079,  0.8086],
        ...,
        [-0.7109, -1.1250, -2.0625, -1.7266],
        [ 0.8281,  0.9492,  0.8164,  1.6953],
        [ 1.6016, -0.5156,  0.2559,  1.6953]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0156,  0.6094,  0.1069,  0.1226],
        [ 2.2812,  1.0625,  2.0156,  2.5469],
        [ 1.4219, -0.2930,  0.1079,  0.8086],
        ...,
        [-0.7109, -1.1250, -2.0625, -1.7266],
        [ 0.8281,  0.9492,  0.8164,  1.6953],
        [ 1.6016, -0.5156,  0.2559,  1.6953]], requires_grad=True)
2024-10-08 15:05:53,538 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9570,  0.5469,  0.0649,  0.1245],
        [ 2.2500,  0.9414,  1.8828,  2.4844],
        [ 1.4062, -0.3457,  0.0549,  0.7852],
        ...,
        [-0.7266, -1.0781, -2.0000, -1.7031],
        [ 0.8438,  0.8398,  0.7344,  1.6641],
        [ 1.5547, -0.6016,  0.1729,  1.6406]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9570,  0.5469,  0.0649,  0.1245],
        [ 2.2500,  0.9414,  1.8828,  2.4844],
        [ 1.4062, -0.3457,  0.0549,  0.7852],
        ...,
        [-0.7266, -1.0781, -2.0000, -1.7031],
        [ 0.8438,  0.8398,  0.7344,  1.6641],
        [ 1.5547, -0.6016,  0.1729,  1.6406]], requires_grad=True)
2024-10-08 15:05:53,692 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8906,  0.4844,  0.0227,  0.1299],
        [ 2.1875,  0.9648,  1.8906,  2.4375],
        [ 1.3594, -0.3301,  0.0613,  0.7734],
        ...,
        [-0.6914, -1.1875, -2.0781, -1.6953],
        [ 0.8203,  0.8984,  0.7891,  1.6719],
        [ 1.4688, -0.5195,  0.2139,  1.6094]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8906,  0.4844,  0.0227,  0.1299],
        [ 2.1875,  0.9648,  1.8906,  2.4375],
        [ 1.3594, -0.3301,  0.0613,  0.7734],
        ...,
        [-0.6914, -1.1875, -2.0781, -1.6953],
        [ 0.8203,  0.8984,  0.7891,  1.6719],
        [ 1.4688, -0.5195,  0.2139,  1.6094]], requires_grad=True)
2024-10-08 15:05:53,958 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8281,  0.4004, -0.0356,  0.1216],
        [ 2.0938,  1.1094,  1.9922,  2.4219],
        [ 1.3203, -0.2910,  0.0859,  0.7695],
        ...,
        [-0.6367, -1.3672, -2.2188, -1.7031],
        [ 0.7695,  1.0078,  0.8828,  1.6797],
        [ 1.3594, -0.3242,  0.3301,  1.6016]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8281,  0.4004, -0.0356,  0.1216],
        [ 2.0938,  1.1094,  1.9922,  2.4219],
        [ 1.3203, -0.2910,  0.0859,  0.7695],
        ...,
        [-0.6367, -1.3672, -2.2188, -1.7031],
        [ 0.7695,  1.0078,  0.8828,  1.6797],
        [ 1.3594, -0.3242,  0.3301,  1.6016]], requires_grad=True)
2024-10-08 15:05:54,222 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7695,  0.3418, -0.0767,  0.1250],
        [ 2.0312,  1.1562,  2.0156,  2.3750],
        [ 1.2969, -0.2891,  0.0879,  0.7578],
        ...,
        [-0.6094, -1.4531, -2.2969, -1.6875],
        [ 0.7305,  1.0547,  0.9336,  1.6641],
        [ 1.2656, -0.1729,  0.4199,  1.5781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7695,  0.3418, -0.0767,  0.1250],
        [ 2.0312,  1.1562,  2.0156,  2.3750],
        [ 1.2969, -0.2891,  0.0879,  0.7578],
        ...,
        [-0.6094, -1.4531, -2.2969, -1.6875],
        [ 0.7305,  1.0547,  0.9336,  1.6641],
        [ 1.2656, -0.1729,  0.4199,  1.5781]], requires_grad=True)
2024-10-08 15:05:54,382 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7812,  0.3457, -0.0913,  0.1270],
        [ 2.0000,  1.0703,  1.9688,  2.2969],
        [ 1.3047, -0.3418,  0.0640,  0.7305],
        ...,
        [-0.6445, -1.4531, -2.3281, -1.6797],
        [ 0.7305,  1.0156,  0.9414,  1.6328],
        [ 1.2109, -0.1089,  0.4727,  1.5469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7812,  0.3457, -0.0913,  0.1270],
        [ 2.0000,  1.0703,  1.9688,  2.2969],
        [ 1.3047, -0.3418,  0.0640,  0.7305],
        ...,
        [-0.6445, -1.4531, -2.3281, -1.6797],
        [ 0.7305,  1.0156,  0.9414,  1.6328],
        [ 1.2109, -0.1089,  0.4727,  1.5469]], requires_grad=True)
2024-10-08 15:05:54,650 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8477,  0.3691, -0.0996,  0.1147],
        [ 1.9922,  1.0312,  1.9531,  2.2656],
        [ 1.2969, -0.3652,  0.0520,  0.7109],
        ...,
        [-0.6172, -1.5078, -2.3750, -1.6562],
        [ 0.7031,  1.0078,  0.9570,  1.5938],
        [ 1.1719,  0.0067,  0.5430,  1.5469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8477,  0.3691, -0.0996,  0.1147],
        [ 1.9922,  1.0312,  1.9531,  2.2656],
        [ 1.2969, -0.3652,  0.0520,  0.7109],
        ...,
        [-0.6172, -1.5078, -2.3750, -1.6562],
        [ 0.7031,  1.0078,  0.9570,  1.5938],
        [ 1.1719,  0.0067,  0.5430,  1.5469]], requires_grad=True)
2024-10-08 15:05:54,916 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8594,  0.3340, -0.1260,  0.0908],
        [ 1.9062,  1.1953,  2.0156,  2.2656],
        [ 1.2656, -0.3418,  0.0571,  0.7031],
        ...,
        [-0.5352, -1.6328, -2.4219, -1.6406],
        [ 0.6641,  1.0391,  0.9805,  1.5625],
        [ 1.1094,  0.2188,  0.6406,  1.5625]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8594,  0.3340, -0.1260,  0.0908],
        [ 1.9062,  1.1953,  2.0156,  2.2656],
        [ 1.2656, -0.3418,  0.0571,  0.7031],
        ...,
        [-0.5352, -1.6328, -2.4219, -1.6406],
        [ 0.6641,  1.0391,  0.9805,  1.5625],
        [ 1.1094,  0.2188,  0.6406,  1.5625]], requires_grad=True)
2024-10-08 15:05:55,068 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8633,  0.3008, -0.1484,  0.0752],
        [ 1.8047,  1.3672,  2.0781,  2.2500],
        [ 1.2188, -0.3145,  0.0635,  0.6914],
        ...,
        [-0.4512, -1.7578, -2.4688, -1.6172],
        [ 0.6172,  1.0703,  0.9961,  1.5234],
        [ 1.0312,  0.4375,  0.7344,  1.5703]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8633,  0.3008, -0.1484,  0.0752],
        [ 1.8047,  1.3672,  2.0781,  2.2500],
        [ 1.2188, -0.3145,  0.0635,  0.6914],
        ...,
        [-0.4512, -1.7578, -2.4688, -1.6172],
        [ 0.6172,  1.0703,  0.9961,  1.5234],
        [ 1.0312,  0.4375,  0.7344,  1.5703]], requires_grad=True)
2024-10-08 15:05:55,214 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8711,  0.2930, -0.1602,  0.0674],
        [ 1.7109,  1.4766,  2.1094,  2.2188],
        [ 1.1797, -0.3203,  0.0574,  0.6719],
        ...,
        [-0.3848, -1.8047, -2.4844, -1.5781],
        [ 0.5781,  1.0625,  0.9961,  1.4766],
        [ 0.9570,  0.6016,  0.8047,  1.5625]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8711,  0.2930, -0.1602,  0.0674],
        [ 1.7109,  1.4766,  2.1094,  2.2188],
        [ 1.1797, -0.3203,  0.0574,  0.6719],
        ...,
        [-0.3848, -1.8047, -2.4844, -1.5781],
        [ 0.5781,  1.0625,  0.9961,  1.4766],
        [ 0.9570,  0.6016,  0.8047,  1.5625]], requires_grad=True)
2024-10-08 15:05:55,373 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8789,  0.2871, -0.1709,  0.0608],
        [ 1.6484,  1.4688,  2.0938,  2.1875],
        [ 1.1562, -0.3613,  0.0391,  0.6445],
        ...,
        [-0.3516, -1.7969, -2.4688, -1.5391],
        [ 0.5508,  0.9805,  0.9688,  1.4141],
        [ 0.8945,  0.7422,  0.8672,  1.5547]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8789,  0.2871, -0.1709,  0.0608],
        [ 1.6484,  1.4688,  2.0938,  2.1875],
        [ 1.1562, -0.3613,  0.0391,  0.6445],
        ...,
        [-0.3516, -1.7969, -2.4688, -1.5391],
        [ 0.5508,  0.9805,  0.9688,  1.4141],
        [ 0.8945,  0.7422,  0.8672,  1.5547]], requires_grad=True)
2024-10-08 15:05:55,636 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8906,  0.2949, -0.1758,  0.0586],
        [ 1.5938,  1.4609,  2.0781,  2.1406],
        [ 1.1406, -0.4043,  0.0199,  0.6133],
        ...,
        [-0.3125, -1.7891, -2.4531, -1.4922],
        [ 0.5156,  0.9102,  0.9453,  1.3516],
        [ 0.8359,  0.8555,  0.9141,  1.5312]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8906,  0.2949, -0.1758,  0.0586],
        [ 1.5938,  1.4609,  2.0781,  2.1406],
        [ 1.1406, -0.4043,  0.0199,  0.6133],
        ...,
        [-0.3125, -1.7891, -2.4531, -1.4922],
        [ 0.5156,  0.9102,  0.9453,  1.3516],
        [ 0.8359,  0.8555,  0.9141,  1.5312]], requires_grad=True)
2024-10-08 15:05:55,904 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8984,  0.2344, -0.2031,  0.0383],
        [ 1.5156,  1.5547,  2.0938,  2.0938],
        [ 1.0859, -0.3711,  0.0262,  0.5820],
        ...,
        [-0.1582, -1.9297, -2.4688, -1.4141],
        [ 0.3867,  0.9766,  0.9570,  1.2656],
        [ 0.7617,  0.9961,  0.9648,  1.5000]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8984,  0.2344, -0.2031,  0.0383],
        [ 1.5156,  1.5547,  2.0938,  2.0938],
        [ 1.0859, -0.3711,  0.0262,  0.5820],
        ...,
        [-0.1582, -1.9297, -2.4688, -1.4141],
        [ 0.3867,  0.9766,  0.9570,  1.2656],
        [ 0.7617,  0.9961,  0.9648,  1.5000]], requires_grad=True)
2024-10-08 15:05:56,163 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8750,  0.1328, -0.2480,  0.0193],
        [ 1.3672,  1.9453,  2.2656,  2.0625],
        [ 1.0156, -0.3008,  0.0510,  0.5586],
        ...,
        [ 0.0035, -2.1562, -2.5312, -1.3594],
        [ 0.2402,  1.1094,  1.0000,  1.1875],
        [ 0.6641,  1.2109,  1.0469,  1.4688]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8750,  0.1328, -0.2480,  0.0193],
        [ 1.3672,  1.9453,  2.2656,  2.0625],
        [ 1.0156, -0.3008,  0.0510,  0.5586],
        ...,
        [ 0.0035, -2.1562, -2.5312, -1.3594],
        [ 0.2402,  1.1094,  1.0000,  1.1875],
        [ 0.6641,  1.2109,  1.0469,  1.4688]], requires_grad=True)
2024-10-08 15:05:56,428 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9492,  0.1318, -0.2373, -0.0164],
        [ 1.3359,  2.1719,  2.3438,  2.0625],
        [ 0.9883, -0.2754,  0.0503,  0.5430],
        ...,
        [ 0.0535, -2.2500, -2.5156, -1.3281],
        [ 0.1660,  1.1797,  1.0078,  1.1328],
        [ 0.5938,  1.3594,  1.0938,  1.4375]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9492,  0.1318, -0.2373, -0.0164],
        [ 1.3359,  2.1719,  2.3438,  2.0625],
        [ 0.9883, -0.2754,  0.0503,  0.5430],
        ...,
        [ 0.0535, -2.2500, -2.5156, -1.3281],
        [ 0.1660,  1.1797,  1.0078,  1.1328],
        [ 0.5938,  1.3594,  1.0938,  1.4375]], requires_grad=True)
2024-10-08 15:05:56,685 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0234,  0.1709, -0.1973, -0.0396],
        [ 1.2109,  2.4375,  2.4531,  2.0156],
        [ 1.0156, -0.3164,  0.0028,  0.5312],
        ...,
        [-0.0825, -2.2031, -2.4219, -1.3594],
        [ 0.3965,  0.9336,  0.8164,  1.1797],
        [ 0.6328,  1.3516,  1.0547,  1.4453]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0234,  0.1709, -0.1973, -0.0396],
        [ 1.2109,  2.4375,  2.4531,  2.0156],
        [ 1.0156, -0.3164,  0.0028,  0.5312],
        ...,
        [-0.0825, -2.2031, -2.4219, -1.3594],
        [ 0.3965,  0.9336,  0.8164,  1.1797],
        [ 0.6328,  1.3516,  1.0547,  1.4453]], requires_grad=True)
2024-10-08 15:05:56,847 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0625,  0.2715, -0.1064, -0.0430],
        [ 1.1250,  2.3438,  2.2500,  1.9688],
        [ 1.0312, -0.3965, -0.0776,  0.5195],
        ...,
        [-0.2266, -2.0156, -2.2031, -1.3906],
        [ 0.5859,  0.7109,  0.6406,  1.2031],
        [ 0.6992,  1.1406,  0.8672,  1.4609]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0625,  0.2715, -0.1064, -0.0430],
        [ 1.1250,  2.3438,  2.2500,  1.9688],
        [ 1.0312, -0.3965, -0.0776,  0.5195],
        ...,
        [-0.2266, -2.0156, -2.2031, -1.3906],
        [ 0.5859,  0.7109,  0.6406,  1.2031],
        [ 0.6992,  1.1406,  0.8672,  1.4609]], requires_grad=True)
2024-10-08 15:05:57,114 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1328,  0.3125, -0.0669, -0.0574],
        [ 1.1406,  2.3750,  2.1875,  1.9609],
        [ 1.1172, -0.3984, -0.0820,  0.5273],
        ...,
        [-0.4141, -1.9531, -2.1250, -1.4297],
        [ 0.8242,  0.6328,  0.5898,  1.2422],
        [ 0.7773,  1.0156,  0.7500,  1.4766]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1328,  0.3125, -0.0669, -0.0574],
        [ 1.1406,  2.3750,  2.1875,  1.9609],
        [ 1.1172, -0.3984, -0.0820,  0.5273],
        ...,
        [-0.4141, -1.9531, -2.1250, -1.4297],
        [ 0.8242,  0.6328,  0.5898,  1.2422],
        [ 0.7773,  1.0156,  0.7500,  1.4766]], requires_grad=True)
2024-10-08 15:05:57,272 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1562,  0.3418, -0.0327, -0.0491],
        [ 1.1172,  2.3125,  2.0625,  1.9531],
        [ 1.2500, -0.3594, -0.0464,  0.5391],
        ...,
        [-0.6641, -1.9688, -2.1406, -1.4766],
        [ 1.1094,  0.6562,  0.6289,  1.2812],
        [ 0.8008,  0.8789,  0.6289,  1.4609]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1562,  0.3418, -0.0327, -0.0491],
        [ 1.1172,  2.3125,  2.0625,  1.9531],
        [ 1.2500, -0.3594, -0.0464,  0.5391],
        ...,
        [-0.6641, -1.9688, -2.1406, -1.4766],
        [ 1.1094,  0.6562,  0.6289,  1.2812],
        [ 0.8008,  0.8789,  0.6289,  1.4609]], requires_grad=True)
2024-10-08 15:05:57,534 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2812,  0.3223, -0.0369, -0.0435],
        [ 1.3984,  2.4531,  2.1250,  1.9766],
        [ 1.4844, -0.2734,  0.0264,  0.5625],
        ...,
        [-1.0703, -2.0781, -2.2344, -1.5234],
        [ 1.5156,  0.7656,  0.7344,  1.3281],
        [ 0.9453,  0.8516,  0.5859,  1.4531]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2812,  0.3223, -0.0369, -0.0435],
        [ 1.3984,  2.4531,  2.1250,  1.9766],
        [ 1.4844, -0.2734,  0.0264,  0.5625],
        ...,
        [-1.0703, -2.0781, -2.2344, -1.5234],
        [ 1.5156,  0.7656,  0.7344,  1.3281],
        [ 0.9453,  0.8516,  0.5859,  1.4531]], requires_grad=True)
2024-10-08 15:05:57,800 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1562,  0.3496, -0.0496, -0.1123],
        [ 1.5469,  2.5312,  2.1875,  2.0312],
        [ 1.6406, -0.2090,  0.0918,  0.5938],
        ...,
        [-1.2344, -2.1250, -2.3281, -1.6250],
        [ 1.7578,  0.8281,  0.8320,  1.4062],
        [ 0.8867,  0.7539,  0.5625,  1.5156]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1562,  0.3496, -0.0496, -0.1123],
        [ 1.5469,  2.5312,  2.1875,  2.0312],
        [ 1.6406, -0.2090,  0.0918,  0.5938],
        ...,
        [-1.2344, -2.1250, -2.3281, -1.6250],
        [ 1.7578,  0.8281,  0.8320,  1.4062],
        [ 0.8867,  0.7539,  0.5625,  1.5156]], requires_grad=True)
2024-10-08 15:05:58,065 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9531,  0.3867, -0.0791, -0.2119],
        [ 1.4375,  2.5312,  2.3281,  2.2031],
        [ 1.7188, -0.1611,  0.1621,  0.6406],
        ...,
        [-1.2266, -2.1250, -2.4531, -1.7812],
        [ 1.8672,  0.8633,  0.9453,  1.5234],
        [ 0.6719,  0.6250,  0.5938,  1.6719]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9531,  0.3867, -0.0791, -0.2119],
        [ 1.4375,  2.5312,  2.3281,  2.2031],
        [ 1.7188, -0.1611,  0.1621,  0.6406],
        ...,
        [-1.2266, -2.1250, -2.4531, -1.7812],
        [ 1.8672,  0.8633,  0.9453,  1.5234],
        [ 0.6719,  0.6250,  0.5938,  1.6719]], requires_grad=True)
2024-10-08 15:05:58,329 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7148,  0.4316, -0.1177, -0.3105],
        [ 1.2031,  2.4531,  2.5156,  2.3906],
        [ 1.7109, -0.1367,  0.2422,  0.6992],
        ...,
        [-1.1562, -2.0938, -2.5625, -1.9219],
        [ 1.9141,  0.8672,  1.0625,  1.6328],
        [ 0.4434,  0.4902,  0.6367,  1.8203]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7148,  0.4316, -0.1177, -0.3105],
        [ 1.2031,  2.4531,  2.5156,  2.3906],
        [ 1.7109, -0.1367,  0.2422,  0.6992],
        ...,
        [-1.1562, -2.0938, -2.5625, -1.9219],
        [ 1.9141,  0.8672,  1.0625,  1.6328],
        [ 0.4434,  0.4902,  0.6367,  1.8203]], requires_grad=True)
2024-10-08 15:05:58,488 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4961,  0.4629, -0.1494, -0.3965],
        [ 0.9883,  2.3906,  2.6719,  2.5469],
        [ 1.6875, -0.1064,  0.3105,  0.7539],
        ...,
        [-1.0938, -2.0781, -2.6562, -2.0469],
        [ 1.9453,  0.8828,  1.1562,  1.7344],
        [ 0.2432,  0.3730,  0.6719,  1.9453]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4961,  0.4629, -0.1494, -0.3965],
        [ 0.9883,  2.3906,  2.6719,  2.5469],
        [ 1.6875, -0.1064,  0.3105,  0.7539],
        ...,
        [-1.0938, -2.0781, -2.6562, -2.0469],
        [ 1.9453,  0.8828,  1.1562,  1.7344],
        [ 0.2432,  0.3730,  0.6719,  1.9453]], requires_grad=True)
2024-10-08 15:05:58,643 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2832,  0.4805, -0.1777, -0.4824],
        [ 0.7656,  2.3594,  2.7969,  2.7031],
        [ 1.6562, -0.0659,  0.3691,  0.8125],
        ...,
        [-0.9570, -2.1094, -2.7344, -2.1875],
        [ 1.9219,  0.9297,  1.2422,  1.8438],
        [ 0.0254,  0.3086,  0.7070,  2.0781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2832,  0.4805, -0.1777, -0.4824],
        [ 0.7656,  2.3594,  2.7969,  2.7031],
        [ 1.6562, -0.0659,  0.3691,  0.8125],
        ...,
        [-0.9570, -2.1094, -2.7344, -2.1875],
        [ 1.9219,  0.9297,  1.2422,  1.8438],
        [ 0.0254,  0.3086,  0.7070,  2.0781]], requires_grad=True)
2024-10-08 15:05:58,901 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0393,  0.4707, -0.2119, -0.5859],
        [ 0.4453,  2.4062,  2.9375,  2.8750],
        [ 1.5234,  0.0032,  0.4316,  0.8906],
        ...,
        [-0.6523, -2.2031, -2.8281, -2.3594],
        [ 1.7344,  1.0391,  1.3359,  1.9844],
        [-0.2402,  0.3027,  0.7539,  2.2344]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0393,  0.4707, -0.2119, -0.5859],
        [ 0.4453,  2.4062,  2.9375,  2.8750],
        [ 1.5234,  0.0032,  0.4316,  0.8906],
        ...,
        [-0.6523, -2.2031, -2.8281, -2.3594],
        [ 1.7344,  1.0391,  1.3359,  1.9844],
        [-0.2402,  0.3027,  0.7539,  2.2344]], requires_grad=True)
2024-10-08 15:05:59,053 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.3398,  0.4238, -0.2656, -0.6719],
        [ 0.1226,  2.4531,  3.0469,  3.0000],
        [ 1.3594,  0.0737,  0.4941,  0.9570],
        ...,
        [-0.3008, -2.3125, -2.9375, -2.5000],
        [ 1.4453,  1.1719,  1.4453,  2.1094],
        [-0.4922,  0.3125,  0.8047,  2.3750]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.3398,  0.4238, -0.2656, -0.6719],
        [ 0.1226,  2.4531,  3.0469,  3.0000],
        [ 1.3594,  0.0737,  0.4941,  0.9570],
        ...,
        [-0.3008, -2.3125, -2.9375, -2.5000],
        [ 1.4453,  1.1719,  1.4453,  2.1094],
        [-0.4922,  0.3125,  0.8047,  2.3750]], requires_grad=True)
2024-10-08 15:05:59,212 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.8828,  0.3145, -0.3867, -0.7500],
        [-0.2139,  2.5312,  3.2031,  3.1250],
        [ 1.1562,  0.1602,  0.5781,  1.0234],
        ...,
        [ 0.0471, -2.4375, -3.0469, -2.6250],
        [ 1.2188,  1.2891,  1.5391,  2.2188],
        [-0.7852,  0.3652,  0.8945,  2.4844]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.8828,  0.3145, -0.3867, -0.7500],
        [-0.2139,  2.5312,  3.2031,  3.1250],
        [ 1.1562,  0.1602,  0.5781,  1.0234],
        ...,
        [ 0.0471, -2.4375, -3.0469, -2.6250],
        [ 1.2188,  1.2891,  1.5391,  2.2188],
        [-0.7852,  0.3652,  0.8945,  2.4844]], requires_grad=True)
2024-10-08 15:05:59,371 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3359,  0.2295, -0.4727, -0.8008],
        [-0.3379,  2.5000,  3.2031,  3.2344],
        [ 1.0859,  0.2061,  0.6133,  1.0781],
        ...,
        [ 0.0374, -2.4219, -2.9844, -2.7500],
        [ 1.1484,  1.3281,  1.5391,  2.3125],
        [-0.7656,  0.2715,  0.8008,  2.5781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3359,  0.2295, -0.4727, -0.8008],
        [-0.3379,  2.5000,  3.2031,  3.2344],
        [ 1.0859,  0.2061,  0.6133,  1.0781],
        ...,
        [ 0.0374, -2.4219, -2.9844, -2.7500],
        [ 1.1484,  1.3281,  1.5391,  2.3125],
        [-0.7656,  0.2715,  0.8008,  2.5781]], requires_grad=True)
2024-10-08 15:05:59,622 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.8203,  0.1357, -0.5703, -0.8281],
        [-0.3750,  2.4219,  3.1250,  3.3125],
        [ 1.1875,  0.2080,  0.5898,  1.1562],
        ...,
        [-0.1250, -2.3438, -2.8594, -2.8906],
        [ 1.1953,  1.3281,  1.5000,  2.4062],
        [-0.7852,  0.1865,  0.7109,  2.6250]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.8203,  0.1357, -0.5703, -0.8281],
        [-0.3750,  2.4219,  3.1250,  3.3125],
        [ 1.1875,  0.2080,  0.5898,  1.1562],
        ...,
        [-0.1250, -2.3438, -2.8594, -2.8906],
        [ 1.1953,  1.3281,  1.5000,  2.4062],
        [-0.7852,  0.1865,  0.7109,  2.6250]], requires_grad=True)
2024-10-08 15:05:59,885 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.1562,  0.0845, -0.6172, -0.8477],
        [-0.3887,  2.3594,  3.0781,  3.3750],
        [ 1.2812,  0.2070,  0.5664,  1.2188],
        ...,
        [-0.2695, -2.2969, -2.7812, -3.0000],
        [ 1.2812,  1.2969,  1.4219,  2.4688],
        [-0.8047,  0.1270,  0.6562,  2.6719]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.1562,  0.0845, -0.6172, -0.8477],
        [-0.3887,  2.3594,  3.0781,  3.3750],
        [ 1.2812,  0.2070,  0.5664,  1.2188],
        ...,
        [-0.2695, -2.2969, -2.7812, -3.0000],
        [ 1.2812,  1.2969,  1.4219,  2.4688],
        [-0.8047,  0.1270,  0.6562,  2.6719]], requires_grad=True)
2024-10-08 15:06:00,148 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.4375,  0.0344, -0.6641, -0.8633],
        [-0.3809,  2.2969,  3.0469,  3.4219],
        [ 1.3906,  0.1982,  0.5391,  1.2656],
        ...,
        [-0.4766, -2.2188, -2.6562, -3.0938],
        [ 1.4141,  1.2344,  1.3281,  2.5156],
        [-0.8359,  0.0894,  0.6211,  2.6875]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.4375,  0.0344, -0.6641, -0.8633],
        [-0.3809,  2.2969,  3.0469,  3.4219],
        [ 1.3906,  0.1982,  0.5391,  1.2656],
        ...,
        [-0.4766, -2.2188, -2.6562, -3.0938],
        [ 1.4141,  1.2344,  1.3281,  2.5156],
        [-0.8359,  0.0894,  0.6211,  2.6875]], requires_grad=True)
2024-10-08 15:06:00,410 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.6875, -0.0109, -0.7070, -0.8672],
        [-0.3184,  2.2188,  2.9844,  3.4531],
        [ 1.5469,  0.1689,  0.4922,  1.3125],
        ...,
        [-0.7539, -2.0938, -2.5000, -3.1875],
        [ 1.5938,  1.1484,  1.2109,  2.5625],
        [-0.8750,  0.0698,  0.6055,  2.6875]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.6875, -0.0109, -0.7070, -0.8672],
        [-0.3184,  2.2188,  2.9844,  3.4531],
        [ 1.5469,  0.1689,  0.4922,  1.3125],
        ...,
        [-0.7539, -2.0938, -2.5000, -3.1875],
        [ 1.5938,  1.1484,  1.2109,  2.5625],
        [-0.8750,  0.0698,  0.6055,  2.6875]], requires_grad=True)
2024-10-08 15:06:00,682 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.9219, -0.0742, -0.7695, -0.8711],
        [-0.3203,  2.2656,  3.0625,  3.4844],
        [ 1.6562,  0.1729,  0.4863,  1.3594],
        ...,
        [-0.9766, -2.0000, -2.3750, -3.2656],
        [ 1.7109,  1.1094,  1.1328,  2.5938],
        [-0.9492,  0.1455,  0.6875,  2.7031]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.9219, -0.0742, -0.7695, -0.8711],
        [-0.3203,  2.2656,  3.0625,  3.4844],
        [ 1.6562,  0.1729,  0.4863,  1.3594],
        ...,
        [-0.9766, -2.0000, -2.3750, -3.2656],
        [ 1.7109,  1.1094,  1.1328,  2.5938],
        [-0.9492,  0.1455,  0.6875,  2.7031]], requires_grad=True)
2024-10-08 15:06:00,941 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.1406, -0.1768, -0.8750, -0.8828],
        [-0.3438,  2.3594,  3.2188,  3.5000],
        [ 1.7344,  0.1982,  0.5078,  1.3984],
        ...,
        [-1.1484, -1.9219, -2.2656, -3.3125],
        [ 1.7891,  1.1016,  1.1094,  2.6094],
        [-1.0312,  0.2793,  0.8359,  2.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.1406, -0.1768, -0.8750, -0.8828],
        [-0.3438,  2.3594,  3.2188,  3.5000],
        [ 1.7344,  0.1982,  0.5078,  1.3984],
        ...,
        [-1.1484, -1.9219, -2.2656, -3.3125],
        [ 1.7891,  1.1016,  1.1094,  2.6094],
        [-1.0312,  0.2793,  0.8359,  2.7188]], requires_grad=True)
2024-10-08 15:06:01,196 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.3125, -0.2520, -0.9492, -0.8906],
        [-0.3691,  2.4375,  3.3594,  3.5000],
        [ 1.7891,  0.2305,  0.5391,  1.4297],
        ...,
        [-1.2969, -1.8672, -2.1875, -3.3438],
        [ 1.8438,  1.1094,  1.1094,  2.6250],
        [-1.1016,  0.4160,  0.9844,  2.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.3125, -0.2520, -0.9492, -0.8906],
        [-0.3691,  2.4375,  3.3594,  3.5000],
        [ 1.7891,  0.2305,  0.5391,  1.4297],
        ...,
        [-1.2969, -1.8672, -2.1875, -3.3438],
        [ 1.8438,  1.1094,  1.1094,  2.6250],
        [-1.1016,  0.4160,  0.9844,  2.7188]], requires_grad=True)
2024-10-08 15:06:01,450 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.4844, -0.3398, -1.0391, -0.8867],
        [-0.3945,  2.5000,  3.4844,  3.4844],
        [ 1.8438,  0.2422,  0.5469,  1.4609],
        ...,
        [-1.4531, -1.7734, -2.0625, -3.3750],
        [ 1.9141,  1.0781,  1.0703,  2.6406],
        [-1.1641,  0.5039,  1.0781,  2.7031]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.4844, -0.3398, -1.0391, -0.8867],
        [-0.3945,  2.5000,  3.4844,  3.4844],
        [ 1.8438,  0.2422,  0.5469,  1.4609],
        ...,
        [-1.4531, -1.7734, -2.0625, -3.3750],
        [ 1.9141,  1.0781,  1.0703,  2.6406],
        [-1.1641,  0.5039,  1.0781,  2.7031]], requires_grad=True)
2024-10-08 15:06:01,603 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.6250, -0.3926, -1.0859, -0.8789],
        [-0.4062,  2.4844,  3.4688,  3.4688],
        [ 1.8828,  0.2471,  0.5469,  1.4688],
        ...,
        [-1.6016, -1.6250, -1.8750, -3.4062],
        [ 1.9844,  0.9961,  0.9609,  2.6562],
        [-1.2188,  0.5586,  1.1328,  2.6875]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.6250, -0.3926, -1.0859, -0.8789],
        [-0.4062,  2.4844,  3.4688,  3.4688],
        [ 1.8828,  0.2471,  0.5469,  1.4688],
        ...,
        [-1.6016, -1.6250, -1.8750, -3.4062],
        [ 1.9844,  0.9961,  0.9609,  2.6562],
        [-1.2188,  0.5586,  1.1328,  2.6875]], requires_grad=True)
2024-10-08 15:06:01,857 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.7344, -0.4395, -1.1250, -0.8711],
        [-0.4141,  2.4531,  3.4375,  3.4531],
        [ 1.9141,  0.2402,  0.5312,  1.4766],
        ...,
        [-1.7344, -1.5000, -1.7109, -3.4062],
        [ 2.0312,  0.8984,  0.8359,  2.6562],
        [-1.2578,  0.6094,  1.1875,  2.6719]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.7344, -0.4395, -1.1250, -0.8711],
        [-0.4141,  2.4531,  3.4375,  3.4531],
        [ 1.9141,  0.2402,  0.5312,  1.4766],
        ...,
        [-1.7344, -1.5000, -1.7109, -3.4062],
        [ 2.0312,  0.8984,  0.8359,  2.6562],
        [-1.2578,  0.6094,  1.1875,  2.6719]], requires_grad=True)
2024-10-08 15:06:02,116 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.8281, -0.5039, -1.1953, -0.8594],
        [-0.4258,  2.4219,  3.4219,  3.4219],
        [ 1.9297,  0.2441,  0.5273,  1.4766],
        ...,
        [-1.8359, -1.4062, -1.5938, -3.3906],
        [ 2.0625,  0.8477,  0.7773,  2.6406],
        [-1.2891,  0.6797,  1.2578,  2.6406]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.8281, -0.5039, -1.1953, -0.8594],
        [-0.4258,  2.4219,  3.4219,  3.4219],
        [ 1.9297,  0.2441,  0.5273,  1.4766],
        ...,
        [-1.8359, -1.4062, -1.5938, -3.3906],
        [ 2.0625,  0.8477,  0.7773,  2.6406],
        [-1.2891,  0.6797,  1.2578,  2.6406]], requires_grad=True)
2024-10-08 15:06:02,378 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9062, -0.5742, -1.2656, -0.8398],
        [-0.4551,  2.4688,  3.5000,  3.3750],
        [ 1.9297,  0.2656,  0.5547,  1.4609],
        ...,
        [-1.9062, -1.3438, -1.5234, -3.3594],
        [ 2.0625,  0.8242,  0.7578,  2.6094],
        [-1.3281,  0.7578,  1.3438,  2.5938]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9062, -0.5742, -1.2656, -0.8398],
        [-0.4551,  2.4688,  3.5000,  3.3750],
        [ 1.9297,  0.2656,  0.5547,  1.4609],
        ...,
        [-1.9062, -1.3438, -1.5234, -3.3594],
        [ 2.0625,  0.8242,  0.7578,  2.6094],
        [-1.3281,  0.7578,  1.3438,  2.5938]], requires_grad=True)
2024-10-08 15:06:02,536 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9531, -0.6680, -1.3750, -0.8203],
        [-0.4805,  2.5312,  3.6250,  3.3281],
        [ 1.9219,  0.3066,  0.6133,  1.4453],
        ...,
        [-1.9688, -1.3281, -1.5391, -3.3281],
        [ 2.0625,  0.8281,  0.7773,  2.5781],
        [-1.3516,  0.8672,  1.4766,  2.5469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9531, -0.6680, -1.3750, -0.8203],
        [-0.4805,  2.5312,  3.6250,  3.3281],
        [ 1.9219,  0.3066,  0.6133,  1.4453],
        ...,
        [-1.9688, -1.3281, -1.5391, -3.3281],
        [ 2.0625,  0.8281,  0.7773,  2.5781],
        [-1.3516,  0.8672,  1.4766,  2.5469]], requires_grad=True)
2024-10-08 15:06:02,700 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9688, -0.7539, -1.4844, -0.8008],
        [-0.5117,  2.5625,  3.6875,  3.2812],
        [ 1.9062,  0.3359,  0.6523,  1.4219],
        ...,
        [-2.0000, -1.2969, -1.5234, -3.2969],
        [ 2.0469,  0.8281,  0.7930,  2.5469],
        [-1.3750,  0.9570,  1.5859,  2.5000]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9688, -0.7539, -1.4844, -0.8008],
        [-0.5117,  2.5625,  3.6875,  3.2812],
        [ 1.9062,  0.3359,  0.6523,  1.4219],
        ...,
        [-2.0000, -1.2969, -1.5234, -3.2969],
        [ 2.0469,  0.8281,  0.7930,  2.5469],
        [-1.3750,  0.9570,  1.5859,  2.5000]], requires_grad=True)
2024-10-08 15:06:02,857 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9531, -0.8359, -1.5859, -0.7852],
        [-0.5391,  2.5625,  3.7031,  3.2344],
        [ 1.8672,  0.3398,  0.6484,  1.4062],
        ...,
        [-1.9922, -1.2188, -1.4141, -3.2656],
        [ 2.0156,  0.8047,  0.7656,  2.5156],
        [-1.4062,  1.0000,  1.6250,  2.4531]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9531, -0.8359, -1.5859, -0.7852],
        [-0.5391,  2.5625,  3.7031,  3.2344],
        [ 1.8672,  0.3398,  0.6484,  1.4062],
        ...,
        [-1.9922, -1.2188, -1.4141, -3.2656],
        [ 2.0156,  0.8047,  0.7656,  2.5156],
        [-1.4062,  1.0000,  1.6250,  2.4531]], requires_grad=True)
2024-10-08 15:06:03,114 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9219, -0.8867, -1.6406, -0.7773],
        [-0.5547,  2.5625,  3.7188,  3.1875],
        [ 1.8281,  0.3281,  0.6172,  1.3984],
        ...,
        [-1.9766, -1.1094, -1.2656, -3.2344],
        [ 1.9766,  0.7617,  0.7109,  2.4688],
        [-1.4297,  1.0234,  1.6484,  2.4062]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.9219, -0.8867, -1.6406, -0.7773],
        [-0.5547,  2.5625,  3.7188,  3.1875],
        [ 1.8281,  0.3281,  0.6172,  1.3984],
        ...,
        [-1.9766, -1.1094, -1.2656, -3.2344],
        [ 1.9766,  0.7617,  0.7109,  2.4688],
        [-1.4297,  1.0234,  1.6484,  2.4062]], requires_grad=True)
2024-10-08 15:06:03,377 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.8750, -0.8945, -1.6328, -0.7695],
        [-0.5547,  2.5156,  3.6562,  3.1406],
        [ 1.7969,  0.3027,  0.5664,  1.3906],
        ...,
        [-1.9531, -1.0000, -1.0938, -3.1875],
        [ 1.9375,  0.7031,  0.6328,  2.4219],
        [-1.4375,  1.0312,  1.6406,  2.3594]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.8750, -0.8945, -1.6328, -0.7695],
        [-0.5547,  2.5156,  3.6562,  3.1406],
        [ 1.7969,  0.3027,  0.5664,  1.3906],
        ...,
        [-1.9531, -1.0000, -1.0938, -3.1875],
        [ 1.9375,  0.7031,  0.6328,  2.4219],
        [-1.4375,  1.0312,  1.6406,  2.3594]], requires_grad=True)
2024-10-08 15:06:03,639 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.8281, -0.8984, -1.6250, -0.7617],
        [-0.5586,  2.4375,  3.5312,  3.0781],
        [ 1.7578,  0.2832,  0.5273,  1.3750],
        ...,
        [-1.9219, -0.9141, -0.9766, -3.1406],
        [ 1.9062,  0.6680,  0.5859,  2.3750],
        [-1.4375,  1.0547,  1.6641,  2.3125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.8281, -0.8984, -1.6250, -0.7617],
        [-0.5586,  2.4375,  3.5312,  3.0781],
        [ 1.7578,  0.2832,  0.5273,  1.3750],
        ...,
        [-1.9219, -0.9141, -0.9766, -3.1406],
        [ 1.9062,  0.6680,  0.5859,  2.3750],
        [-1.4375,  1.0547,  1.6641,  2.3125]], requires_grad=True)
2024-10-08 15:06:03,791 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.8125, -0.9102, -1.6250, -0.7461],
        [-0.5664,  2.3594,  3.4219,  3.0000],
        [ 1.6953,  0.2930,  0.5352,  1.3672],
        ...,
        [-1.8594, -0.8984, -0.9766, -3.1094],
        [ 1.8281,  0.6914,  0.6406,  2.3438],
        [-1.4609,  1.1250,  1.7578,  2.2969]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.8125, -0.9102, -1.6250, -0.7461],
        [-0.5664,  2.3594,  3.4219,  3.0000],
        [ 1.6953,  0.2930,  0.5352,  1.3672],
        ...,
        [-1.8594, -0.8984, -0.9766, -3.1094],
        [ 1.8281,  0.6914,  0.6406,  2.3438],
        [-1.4609,  1.1250,  1.7578,  2.2969]], requires_grad=True)
2024-10-08 15:06:03,931 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.7969, -0.9258, -1.6250, -0.7266],
        [-0.5781,  2.2812,  3.2969,  2.9219],
        [ 1.6250,  0.3086,  0.5508,  1.3594],
        ...,
        [-1.7500, -0.9297, -1.0391, -3.0781],
        [ 1.7188,  0.7461,  0.7305,  2.3281],
        [-1.4844,  1.1875,  1.8281,  2.2656]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.7969, -0.9258, -1.6250, -0.7266],
        [-0.5781,  2.2812,  3.2969,  2.9219],
        [ 1.6250,  0.3086,  0.5508,  1.3594],
        ...,
        [-1.7500, -0.9297, -1.0391, -3.0781],
        [ 1.7188,  0.7461,  0.7305,  2.3281],
        [-1.4844,  1.1875,  1.8281,  2.2656]], requires_grad=True)
2024-10-08 15:06:04,089 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.7656, -0.9258, -1.6172, -0.7070],
        [-0.5820,  2.2188,  3.2031,  2.8438],
        [ 1.5703,  0.3086,  0.5547,  1.3438],
        ...,
        [-1.6406, -0.9570, -1.0938, -3.0312],
        [ 1.6250,  0.7812,  0.7969,  2.3125],
        [-1.4844,  1.2031,  1.8672,  2.2344]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.7656, -0.9258, -1.6172, -0.7070],
        [-0.5820,  2.2188,  3.2031,  2.8438],
        [ 1.5703,  0.3086,  0.5547,  1.3438],
        ...,
        [-1.6406, -0.9570, -1.0938, -3.0312],
        [ 1.6250,  0.7812,  0.7969,  2.3125],
        [-1.4844,  1.2031,  1.8672,  2.2344]], requires_grad=True)
2024-10-08 15:06:04,354 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.7188, -0.9219, -1.6016, -0.6992],
        [-0.5703,  2.1406,  3.0938,  2.7812],
        [ 1.5234,  0.2910,  0.5430,  1.3281],
        ...,
        [-1.5547, -0.9336, -1.1094, -2.9844],
        [ 1.5469,  0.7734,  0.8320,  2.2812],
        [-1.4766,  1.1797,  1.8750,  2.1875]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.7188, -0.9219, -1.6016, -0.6992],
        [-0.5703,  2.1406,  3.0938,  2.7812],
        [ 1.5234,  0.2910,  0.5430,  1.3281],
        ...,
        [-1.5547, -0.9336, -1.1094, -2.9844],
        [ 1.5469,  0.7734,  0.8320,  2.2812],
        [-1.4766,  1.1797,  1.8750,  2.1875]], requires_grad=True)
2024-10-08 15:06:04,512 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.6562, -0.8633, -1.5625, -0.6797],
        [-0.5391,  2.0156,  2.9688,  2.7188],
        [ 1.4766,  0.2754,  0.5312,  1.3047],
        ...,
        [-1.4766, -0.8789, -1.1016, -2.9219],
        [ 1.4922,  0.7148,  0.8398,  2.2500],
        [-1.4531,  1.1172,  1.8594,  2.1406]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.6562, -0.8633, -1.5625, -0.6797],
        [-0.5391,  2.0156,  2.9688,  2.7188],
        [ 1.4766,  0.2754,  0.5312,  1.3047],
        ...,
        [-1.4766, -0.8789, -1.1016, -2.9219],
        [ 1.4922,  0.7148,  0.8398,  2.2500],
        [-1.4531,  1.1172,  1.8594,  2.1406]], requires_grad=True)
2024-10-08 15:06:04,668 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.5938, -0.8047, -1.5234, -0.6562],
        [-0.5156,  1.9141,  2.8438,  2.6562],
        [ 1.4297,  0.2715,  0.5234,  1.2812],
        ...,
        [-1.3828, -0.8789, -1.1094, -2.8594],
        [ 1.4297,  0.6875,  0.8516,  2.2188],
        [-1.4297,  1.0703,  1.8438,  2.0938]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.5938, -0.8047, -1.5234, -0.6562],
        [-0.5156,  1.9141,  2.8438,  2.6562],
        [ 1.4297,  0.2715,  0.5234,  1.2812],
        ...,
        [-1.3828, -0.8789, -1.1094, -2.8594],
        [ 1.4297,  0.6875,  0.8516,  2.2188],
        [-1.4297,  1.0703,  1.8438,  2.0938]], requires_grad=True)
2024-10-08 15:06:04,935 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.5312, -0.7734, -1.4844, -0.6328],
        [-0.4883,  1.8594,  2.7500,  2.5938],
        [ 1.3828,  0.2871,  0.5156,  1.2578],
        ...,
        [-1.2891, -0.9297, -1.1172, -2.8125],
        [ 1.3594,  0.7070,  0.8594,  2.1719],
        [-1.4219,  1.0781,  1.8281,  2.0469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.5312, -0.7734, -1.4844, -0.6328],
        [-0.4883,  1.8594,  2.7500,  2.5938],
        [ 1.3828,  0.2871,  0.5156,  1.2578],
        ...,
        [-1.2891, -0.9297, -1.1172, -2.8125],
        [ 1.3594,  0.7070,  0.8594,  2.1719],
        [-1.4219,  1.0781,  1.8281,  2.0469]], requires_grad=True)
2024-10-08 15:06:05,191 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.4688, -0.7461, -1.4453, -0.6094],
        [-0.4688,  1.8125,  2.6562,  2.5312],
        [ 1.3281,  0.3125,  0.5078,  1.2344],
        ...,
        [-1.1875, -0.9883, -1.1250, -2.7500],
        [ 1.2891,  0.7344,  0.8633,  2.1250],
        [-1.4141,  1.0859,  1.8125,  2.0000]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.4688, -0.7461, -1.4453, -0.6094],
        [-0.4688,  1.8125,  2.6562,  2.5312],
        [ 1.3281,  0.3125,  0.5078,  1.2344],
        ...,
        [-1.1875, -0.9883, -1.1250, -2.7500],
        [ 1.2891,  0.7344,  0.8633,  2.1250],
        [-1.4141,  1.0859,  1.8125,  2.0000]], requires_grad=True)
2024-10-08 15:06:05,351 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.4062, -0.7070, -1.4062, -0.5820],
        [-0.4609,  1.7500,  2.5625,  2.4688],
        [ 1.2734,  0.3145,  0.5000,  1.2031],
        ...,
        [-1.0938, -1.0391, -1.1250, -2.6875],
        [ 1.2109,  0.7266,  0.8672,  2.0781],
        [-1.4062,  1.0625,  1.7891,  1.9453]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.4062, -0.7070, -1.4062, -0.5820],
        [-0.4609,  1.7500,  2.5625,  2.4688],
        [ 1.2734,  0.3145,  0.5000,  1.2031],
        ...,
        [-1.0938, -1.0391, -1.1250, -2.6875],
        [ 1.2109,  0.7266,  0.8672,  2.0781],
        [-1.4062,  1.0625,  1.7891,  1.9453]], requires_grad=True)
2024-10-08 15:06:05,506 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.3281, -0.6914, -1.3672, -0.5625],
        [-0.4590,  1.6953,  2.4688,  2.3906],
        [ 1.2188,  0.3145,  0.4922,  1.1719],
        ...,
        [-1.0078, -1.0781, -1.1250, -2.6250],
        [ 1.1328,  0.6992,  0.8672,  2.0156],
        [-1.3984,  1.0391,  1.7656,  1.8984]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.3281, -0.6914, -1.3672, -0.5625],
        [-0.4590,  1.6953,  2.4688,  2.3906],
        [ 1.2188,  0.3145,  0.4922,  1.1719],
        ...,
        [-1.0078, -1.0781, -1.1250, -2.6250],
        [ 1.1328,  0.6992,  0.8672,  2.0156],
        [-1.3984,  1.0391,  1.7656,  1.8984]], requires_grad=True)
2024-10-08 15:06:05,760 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.2812, -0.6641, -1.3281, -0.5391],
        [-0.4609,  1.6328,  2.3750,  2.3125],
        [ 1.1641,  0.3184,  0.4844,  1.1484],
        ...,
        [-0.9297, -1.1094, -1.1172, -2.5625],
        [ 1.0625,  0.6875,  0.8633,  1.9531],
        [-1.3750,  1.0469,  1.7344,  1.8516]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.2812, -0.6641, -1.3281, -0.5391],
        [-0.4609,  1.6328,  2.3750,  2.3125],
        [ 1.1641,  0.3184,  0.4844,  1.1484],
        ...,
        [-0.9297, -1.1094, -1.1172, -2.5625],
        [ 1.0625,  0.6875,  0.8633,  1.9531],
        [-1.3750,  1.0469,  1.7344,  1.8516]], requires_grad=True)
2024-10-08 15:06:06,026 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.2188, -0.7070, -1.2891, -0.5195],
        [-0.4648,  1.5234,  2.2812,  2.2500],
        [ 1.1250,  0.3320,  0.4766,  1.1250],
        ...,
        [-0.8711, -1.1641, -1.1094, -2.5000],
        [ 1.0078,  0.6914,  0.8594,  1.8984],
        [-1.3516,  1.0859,  1.7031,  1.8125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.2188, -0.7070, -1.2891, -0.5195],
        [-0.4648,  1.5234,  2.2812,  2.2500],
        [ 1.1250,  0.3320,  0.4766,  1.1250],
        ...,
        [-0.8711, -1.1641, -1.1094, -2.5000],
        [ 1.0078,  0.6914,  0.8594,  1.8984],
        [-1.3516,  1.0859,  1.7031,  1.8125]], requires_grad=True)
2024-10-08 15:06:06,288 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.1719, -0.7461, -1.2500, -0.4961],
        [-0.4648,  1.2891,  2.2031,  2.1719],
        [ 1.0859,  0.3184,  0.4688,  1.0938],
        ...,
        [-0.8125, -1.1484, -1.1016, -2.4219],
        [ 0.9531,  0.6445,  0.8516,  1.8438],
        [-1.3203,  1.0625,  1.6719,  1.7656]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.1719, -0.7461, -1.2500, -0.4961],
        [-0.4648,  1.2891,  2.2031,  2.1719],
        [ 1.0859,  0.3184,  0.4688,  1.0938],
        ...,
        [-0.8125, -1.1484, -1.1016, -2.4219],
        [ 0.9531,  0.6445,  0.8516,  1.8438],
        [-1.3203,  1.0625,  1.6719,  1.7656]], requires_grad=True)
2024-10-08 15:06:06,544 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.1406, -0.8242, -1.2109, -0.4746],
        [-0.4766,  1.1641,  2.1406,  2.0938],
        [ 1.0391,  0.3262,  0.4609,  1.0703],
        ...,
        [-0.7422, -1.1953, -1.0938, -2.3594],
        [ 0.8945,  0.6250,  0.8438,  1.7891],
        [-1.3047,  1.1094,  1.6484,  1.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.1406, -0.8242, -1.2109, -0.4746],
        [-0.4766,  1.1641,  2.1406,  2.0938],
        [ 1.0391,  0.3262,  0.4609,  1.0703],
        ...,
        [-0.7422, -1.1953, -1.0938, -2.3594],
        [ 0.8945,  0.6250,  0.8438,  1.7891],
        [-1.3047,  1.1094,  1.6484,  1.7188]], requires_grad=True)
2024-10-08 15:06:06,803 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.1094, -0.9023, -1.1719, -0.4570],
        [-0.4902,  1.0625,  2.0781,  2.0156],
        [ 0.9922,  0.3320,  0.4531,  1.0391],
        ...,
        [-0.6797, -1.2031, -1.0781, -2.2812],
        [ 0.8320,  0.6133,  0.8398,  1.7344],
        [-1.2891,  1.1562,  1.6172,  1.6719]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.1094, -0.9023, -1.1719, -0.4570],
        [-0.4902,  1.0625,  2.0781,  2.0156],
        [ 0.9922,  0.3320,  0.4531,  1.0391],
        ...,
        [-0.6797, -1.2031, -1.0781, -2.2812],
        [ 0.8320,  0.6133,  0.8398,  1.7344],
        [-1.2891,  1.1562,  1.6172,  1.6719]], requires_grad=True)
2024-10-08 15:06:07,068 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0781, -0.9922, -1.1484, -0.4473],
        [-0.4434,  0.9180,  2.0156,  1.9609],
        [ 0.9727,  0.3320,  0.4453,  1.0156],
        ...,
        [-0.6758, -1.1328, -1.0547, -2.2188],
        [ 0.7891,  0.5859,  0.8320,  1.6797],
        [-1.2500,  1.1328,  1.5781,  1.6250]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0781, -0.9922, -1.1484, -0.4473],
        [-0.4434,  0.9180,  2.0156,  1.9609],
        [ 0.9727,  0.3320,  0.4453,  1.0156],
        ...,
        [-0.6758, -1.1328, -1.0547, -2.2188],
        [ 0.7891,  0.5859,  0.8320,  1.6797],
        [-1.2500,  1.1328,  1.5781,  1.6250]], requires_grad=True)
2024-10-08 15:06:07,340 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0469, -1.0938, -1.1250, -0.4414],
        [-0.4277,  0.8789,  1.9609,  1.9141],
        [ 0.9492,  0.3418,  0.4395,  0.9961],
        ...,
        [-0.6484, -1.1250, -1.0391, -2.1719],
        [ 0.7344,  0.5820,  0.8242,  1.6328],
        [-1.2188,  1.1328,  1.5469,  1.5781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0469, -1.0938, -1.1250, -0.4414],
        [-0.4277,  0.8789,  1.9609,  1.9141],
        [ 0.9492,  0.3418,  0.4395,  0.9961],
        ...,
        [-0.6484, -1.1250, -1.0391, -2.1719],
        [ 0.7344,  0.5820,  0.8242,  1.6328],
        [-1.2188,  1.1328,  1.5469,  1.5781]], requires_grad=True)
2024-10-08 15:06:07,496 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0312, -1.2109, -1.1094, -0.4434],
        [-0.3594,  0.6172,  1.8594,  1.8281],
        [ 0.9453,  0.3027,  0.4238,  0.9609],
        ...,
        [-0.6797, -0.9102, -0.9805, -2.0625],
        [ 0.7266,  0.4277,  0.7891,  1.5469],
        [-1.1719,  1.0391,  1.5000,  1.5156]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0312, -1.2109, -1.1094, -0.4434],
        [-0.3594,  0.6172,  1.8594,  1.8281],
        [ 0.9453,  0.3027,  0.4238,  0.9609],
        ...,
        [-0.6797, -0.9102, -0.9805, -2.0625],
        [ 0.7266,  0.4277,  0.7891,  1.5469],
        [-1.1719,  1.0391,  1.5000,  1.5156]], requires_grad=True)
2024-10-08 15:06:07,760 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0469, -1.3750, -1.1016, -0.4570],
        [-0.3027,  0.6094,  1.8281,  1.8125],
        [ 0.9141,  0.3516,  0.4277,  0.9570],
        ...,
        [-0.6523, -0.8203, -0.9453, -1.9766],
        [ 0.6523,  0.4434,  0.7852,  1.4922],
        [-1.1953,  1.2422,  1.5078,  1.5156]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0469, -1.3750, -1.1016, -0.4570],
        [-0.3027,  0.6094,  1.8281,  1.8125],
        [ 0.9141,  0.3516,  0.4277,  0.9570],
        ...,
        [-0.6523, -0.8203, -0.9453, -1.9766],
        [ 0.6523,  0.4434,  0.7852,  1.4922],
        [-1.1953,  1.2422,  1.5078,  1.5156]], requires_grad=True)
2024-10-08 15:06:08,023 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0625, -1.5234, -1.0938, -0.4688],
        [-0.2812,  0.6406,  1.8047,  1.7969],
        [ 0.8555,  0.4160,  0.4355,  0.9531],
        ...,
        [-0.5508, -0.8203, -0.9375, -1.8906],
        [ 0.5312,  0.5156,  0.7930,  1.4375],
        [-1.2344,  1.4531,  1.5156,  1.5156]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 3.0625, -1.5234, -1.0938, -0.4688],
        [-0.2812,  0.6406,  1.8047,  1.7969],
        [ 0.8555,  0.4160,  0.4355,  0.9531],
        ...,
        [-0.5508, -0.8203, -0.9375, -1.8906],
        [ 0.5312,  0.5156,  0.7930,  1.4375],
        [-1.2344,  1.4531,  1.5156,  1.5156]], requires_grad=True)
2024-10-08 15:06:08,277 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.8906, -1.5000, -1.0312, -0.4473],
        [-0.0859,  0.4277,  1.6875,  1.7578],
        [ 0.9141,  0.3809,  0.4062,  0.9336],
        ...,
        [-0.6523, -0.6289, -0.8516, -1.8047],
        [ 0.5859,  0.4062,  0.7344,  1.3672],
        [-1.2734,  1.6484,  1.5234,  1.5078]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.8906, -1.5000, -1.0312, -0.4473],
        [-0.0859,  0.4277,  1.6875,  1.7578],
        [ 0.9141,  0.3809,  0.4062,  0.9336],
        ...,
        [-0.6523, -0.6289, -0.8516, -1.8047],
        [ 0.5859,  0.4062,  0.7344,  1.3672],
        [-1.2734,  1.6484,  1.5234,  1.5078]], requires_grad=True)
2024-10-08 15:06:08,531 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.6250, -1.3984, -0.9414, -0.4141],
        [ 0.1797,  0.1426,  1.5391,  1.7188],
        [ 1.0312,  0.3105,  0.3652,  0.9141],
        ...,
        [-0.8711, -0.3516, -0.7227, -1.7188],
        [ 0.7188,  0.2285,  0.6445,  1.2969],
        [-1.1250,  1.6406,  1.4609,  1.5078]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.6250, -1.3984, -0.9414, -0.4141],
        [ 0.1797,  0.1426,  1.5391,  1.7188],
        [ 1.0312,  0.3105,  0.3652,  0.9141],
        ...,
        [-0.8711, -0.3516, -0.7227, -1.7188],
        [ 0.7188,  0.2285,  0.6445,  1.2969],
        [-1.1250,  1.6406,  1.4609,  1.5078]], requires_grad=True)
2024-10-08 15:06:08,681 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.3438, -1.2188, -0.8164, -0.3418],
        [ 0.4512,  0.0220,  1.4766,  1.7578],
        [ 1.0938,  0.3086,  0.3574,  0.9141],
        ...,
        [-1.0078, -0.1807, -0.6445, -1.6484],
        [ 0.7852,  0.2754,  0.6719,  1.3203],
        [-1.0156,  1.6250,  1.3984,  1.4922]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.3438, -1.2188, -0.8164, -0.3418],
        [ 0.4512,  0.0220,  1.4766,  1.7578],
        [ 1.0938,  0.3086,  0.3574,  0.9141],
        ...,
        [-1.0078, -0.1807, -0.6445, -1.6484],
        [ 0.7852,  0.2754,  0.6719,  1.3203],
        [-1.0156,  1.6250,  1.3984,  1.4922]], requires_grad=True)
2024-10-08 15:06:08,936 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.0938, -1.0469, -0.6992, -0.2734],
        [ 0.5781,  0.0806,  1.5000,  1.8047],
        [ 1.1797,  0.2812,  0.3398,  0.9102],
        ...,
        [-1.0234, -0.1680, -0.6484, -1.6094],
        [ 0.7891,  0.3926,  0.7305,  1.3516],
        [-0.9375,  1.6328,  1.3516,  1.4688]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 2.0938, -1.0469, -0.6992, -0.2734],
        [ 0.5781,  0.0806,  1.5000,  1.8047],
        [ 1.1797,  0.2812,  0.3398,  0.9102],
        ...,
        [-1.0234, -0.1680, -0.6484, -1.6094],
        [ 0.7891,  0.3926,  0.7305,  1.3516],
        [-0.9375,  1.6328,  1.3516,  1.4688]], requires_grad=True)
2024-10-08 15:06:09,089 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.9219, -0.9375, -0.6172, -0.2148],
        [ 0.7344,  0.0894,  1.5000,  1.8438],
        [ 1.4453,  0.1484,  0.2754,  0.9219],
        ...,
        [-1.2344,  0.0088, -0.5703, -1.5859],
        [ 0.8906,  0.4004,  0.7344,  1.3750],
        [-0.7070,  1.4922,  1.2578,  1.4766]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.9219, -0.9375, -0.6172, -0.2148],
        [ 0.7344,  0.0894,  1.5000,  1.8438],
        [ 1.4453,  0.1484,  0.2754,  0.9219],
        ...,
        [-1.2344,  0.0088, -0.5703, -1.5859],
        [ 0.8906,  0.4004,  0.7344,  1.3750],
        [-0.7070,  1.4922,  1.2578,  1.4766]], requires_grad=True)
2024-10-08 15:06:09,347 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.7812, -0.8633, -0.5508, -0.1699],
        [ 0.9336, -0.0223,  1.4375,  1.8672],
        [ 1.7109, -0.0251,  0.1904,  0.9141],
        ...,
        [-1.5156,  0.2969, -0.4355, -1.5469],
        [ 1.0156,  0.3359,  0.7031,  1.3828],
        [-0.4102,  1.1797,  1.0938,  1.4531]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.7812, -0.8633, -0.5508, -0.1699],
        [ 0.9336, -0.0223,  1.4375,  1.8672],
        [ 1.7109, -0.0251,  0.1904,  0.9141],
        ...,
        [-1.5156,  0.2969, -0.4355, -1.5469],
        [ 1.0156,  0.3359,  0.7031,  1.3828],
        [-0.4102,  1.1797,  1.0938,  1.4531]], requires_grad=True)
2024-10-08 15:06:09,501 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.6641, -0.8047, -0.4980, -0.1328],
        [ 1.0547, -0.0383,  1.4141,  1.8828],
        [ 1.9062, -0.1553,  0.1230,  0.9023],
        ...,
        [-1.7109,  0.5039, -0.3320, -1.5000],
        [ 1.0625,  0.3340,  0.6992,  1.3828],
        [-0.1836,  0.9609,  0.9727,  1.4297]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.6641, -0.8047, -0.4980, -0.1328],
        [ 1.0547, -0.0383,  1.4141,  1.8828],
        [ 1.9062, -0.1553,  0.1230,  0.9023],
        ...,
        [-1.7109,  0.5039, -0.3320, -1.5000],
        [ 1.0625,  0.3340,  0.6992,  1.3828],
        [-0.1836,  0.9609,  0.9727,  1.4297]], requires_grad=True)
2024-10-08 15:06:09,752 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.5859, -0.7227, -0.4336, -0.0742],
        [ 1.1406,  0.0674,  1.4453,  1.9219],
        [ 2.0469, -0.2090,  0.0874,  0.8984],
        ...,
        [-1.8047,  0.5195, -0.3125, -1.4844],
        [ 1.0469,  0.4531,  0.7383,  1.3984],
        [-0.0236,  0.9258,  0.9180,  1.4453]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.5859, -0.7227, -0.4336, -0.0742],
        [ 1.1406,  0.0674,  1.4453,  1.9219],
        [ 2.0469, -0.2090,  0.0874,  0.8984],
        ...,
        [-1.8047,  0.5195, -0.3125, -1.4844],
        [ 1.0469,  0.4531,  0.7383,  1.3984],
        [-0.0236,  0.9258,  0.9180,  1.4453]], requires_grad=True)
2024-10-08 15:06:10,006 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.4922, -0.6172, -0.3691, -0.0303],
        [ 1.2188,  0.1494,  1.4609,  1.9531],
        [ 2.1719, -0.2832,  0.0483,  0.8984],
        ...,
        [-1.9219,  0.5742, -0.2812, -1.4766],
        [ 1.0469,  0.5078,  0.7539,  1.4141],
        [ 0.1406,  0.8281,  0.8516,  1.4609]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.4922, -0.6172, -0.3691, -0.0303],
        [ 1.2188,  0.1494,  1.4609,  1.9531],
        [ 2.1719, -0.2832,  0.0483,  0.8984],
        ...,
        [-1.9219,  0.5742, -0.2812, -1.4766],
        [ 1.0469,  0.5078,  0.7539,  1.4141],
        [ 0.1406,  0.8281,  0.8516,  1.4609]], requires_grad=True)
2024-10-08 15:06:10,276 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.4219, -0.5234, -0.3105,  0.0170],
        [ 1.2891,  0.1875,  1.4609,  1.9766],
        [ 2.2656, -0.3555,  0.0107,  0.8984],
        ...,
        [-2.0156,  0.6680, -0.2383, -1.4766],
        [ 1.0312,  0.5195,  0.7539,  1.4297],
        [ 0.2910,  0.7422,  0.7891,  1.4766]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.4219, -0.5234, -0.3105,  0.0170],
        [ 1.2891,  0.1875,  1.4609,  1.9766],
        [ 2.2656, -0.3555,  0.0107,  0.8984],
        ...,
        [-2.0156,  0.6680, -0.2383, -1.4766],
        [ 1.0312,  0.5195,  0.7539,  1.4297],
        [ 0.2910,  0.7422,  0.7891,  1.4766]], requires_grad=True)
2024-10-08 15:06:10,432 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3906, -0.4082, -0.2480,  0.0549],
        [ 1.3438,  0.1973,  1.4453,  2.0000],
        [ 2.3438, -0.4453, -0.0310,  0.9023],
        ...,
        [-2.0781,  0.7656, -0.1934, -1.4766],
        [ 0.9961,  0.4980,  0.7422,  1.4453],
        [ 0.4238,  0.6680,  0.7344,  1.4844]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3906, -0.4082, -0.2480,  0.0549],
        [ 1.3438,  0.1973,  1.4453,  2.0000],
        [ 2.3438, -0.4453, -0.0310,  0.9023],
        ...,
        [-2.0781,  0.7656, -0.1934, -1.4766],
        [ 0.9961,  0.4980,  0.7422,  1.4453],
        [ 0.4238,  0.6680,  0.7344,  1.4844]], requires_grad=True)
2024-10-08 15:06:10,590 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3516, -0.3203, -0.1963,  0.0889],
        [ 1.3906,  0.1885,  1.4219,  2.0156],
        [ 2.4062, -0.5000, -0.0615,  0.8984],
        ...,
        [-2.1094,  0.7539, -0.1797, -1.4453],
        [ 0.9414,  0.5742,  0.7539,  1.4141],
        [ 0.5391,  0.7031,  0.7070,  1.4688]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3516, -0.3203, -0.1963,  0.0889],
        [ 1.3906,  0.1885,  1.4219,  2.0156],
        [ 2.4062, -0.5000, -0.0615,  0.8984],
        ...,
        [-2.1094,  0.7539, -0.1797, -1.4453],
        [ 0.9414,  0.5742,  0.7539,  1.4141],
        [ 0.5391,  0.7031,  0.7070,  1.4688]], requires_grad=True)
2024-10-08 15:06:10,753 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.2969, -0.2695, -0.1621,  0.1270],
        [ 1.4688,  0.2305,  1.4375,  2.0312],
        [ 2.4531, -0.5273, -0.0786,  0.8867],
        ...,
        [-2.1406,  0.7188, -0.1777, -1.4062],
        [ 0.8984,  0.6406,  0.7617,  1.3906],
        [ 0.6523,  0.7695,  0.6992,  1.4453]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.2969, -0.2695, -0.1621,  0.1270],
        [ 1.4688,  0.2305,  1.4375,  2.0312],
        [ 2.4531, -0.5273, -0.0786,  0.8867],
        ...,
        [-2.1406,  0.7188, -0.1777, -1.4062],
        [ 0.8984,  0.6406,  0.7617,  1.3906],
        [ 0.6523,  0.7695,  0.6992,  1.4453]], requires_grad=True)
2024-10-08 15:06:11,011 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3125, -0.1992, -0.1147,  0.1592],
        [ 1.4453,  0.2090,  1.3906,  2.0312],
        [ 2.4219, -0.5781, -0.1143,  0.8750],
        ...,
        [-2.1094,  0.7148, -0.1533, -1.3750],
        [ 0.8281,  0.6797,  0.7578,  1.3672],
        [ 0.7383,  0.8164,  0.6875,  1.4141]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3125, -0.1992, -0.1147,  0.1592],
        [ 1.4453,  0.2090,  1.3906,  2.0312],
        [ 2.4219, -0.5781, -0.1143,  0.8750],
        ...,
        [-2.1094,  0.7148, -0.1533, -1.3750],
        [ 0.8281,  0.6797,  0.7578,  1.3672],
        [ 0.7383,  0.8164,  0.6875,  1.4141]], requires_grad=True)
2024-10-08 15:06:11,279 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3281, -0.1387, -0.0732,  0.1914],
        [ 1.3828,  0.1445,  1.3125,  2.0312],
        [ 2.3281, -0.6680, -0.1777,  0.8750],
        ...,
        [-2.0781,  0.7148, -0.1289, -1.3438],
        [ 0.7461,  0.6953,  0.7383,  1.3438],
        [ 0.7617,  0.7812,  0.6250,  1.4141]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3281, -0.1387, -0.0732,  0.1914],
        [ 1.3828,  0.1445,  1.3125,  2.0312],
        [ 2.3281, -0.6680, -0.1777,  0.8750],
        ...,
        [-2.0781,  0.7148, -0.1289, -1.3438],
        [ 0.7461,  0.6953,  0.7383,  1.3438],
        [ 0.7617,  0.7812,  0.6250,  1.4141]], requires_grad=True)
2024-10-08 15:06:11,692 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3438, -0.0723, -0.0273,  0.2178],
        [ 1.3438,  0.1377,  1.2812,  2.0156],
        [ 2.2344, -0.7461, -0.2354,  0.8711],
        ...,
        [-2.0469,  0.7227, -0.0996, -1.3125],
        [ 0.6641,  0.6992,  0.7109,  1.3203],
        [ 0.7773,  0.7500,  0.5664,  1.4062]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.3438, -0.0723, -0.0273,  0.2178],
        [ 1.3438,  0.1377,  1.2812,  2.0156],
        [ 2.2344, -0.7461, -0.2354,  0.8711],
        ...,
        [-2.0469,  0.7227, -0.0996, -1.3125],
        [ 0.6641,  0.6992,  0.7109,  1.3203],
        [ 0.7773,  0.7500,  0.5664,  1.4062]], requires_grad=True)
2024-10-08 15:06:11,848 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.2188,  0.0293,  0.0391,  0.1787],
        [ 1.3516,  0.1768,  1.3047,  2.0312],
        [ 2.2031, -0.8008, -0.2715,  0.8828],
        ...,
        [-2.0469,  0.7031, -0.0991, -1.2969],
        [ 0.6133,  0.6992,  0.6875,  1.3047],
        [ 0.7852,  0.7461,  0.5391,  1.3906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.2188,  0.0293,  0.0391,  0.1787],
        [ 1.3516,  0.1768,  1.3047,  2.0312],
        [ 2.2031, -0.8008, -0.2715,  0.8828],
        ...,
        [-2.0469,  0.7031, -0.0991, -1.2969],
        [ 0.6133,  0.6992,  0.6875,  1.3047],
        [ 0.7852,  0.7461,  0.5391,  1.3906]], requires_grad=True)
2024-10-08 15:06:12,100 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0781,  0.1289,  0.1055,  0.1348],
        [ 1.3672,  0.3105,  1.4297,  2.0469],
        [ 2.1562, -0.8281, -0.2812,  0.8906],
        ...,
        [-2.0312,  0.6328, -0.1611, -1.2812],
        [ 0.5508,  0.7227,  0.6914,  1.2812],
        [ 0.7969,  0.8438,  0.6055,  1.3906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0781,  0.1289,  0.1055,  0.1348],
        [ 1.3672,  0.3105,  1.4297,  2.0469],
        [ 2.1562, -0.8281, -0.2812,  0.8906],
        ...,
        [-2.0312,  0.6328, -0.1611, -1.2812],
        [ 0.5508,  0.7227,  0.6914,  1.2812],
        [ 0.7969,  0.8438,  0.6055,  1.3906]], requires_grad=True)
2024-10-08 15:06:12,357 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0000,  0.1748,  0.1230,  0.1113],
        [ 1.3906,  0.4121,  1.5234,  2.0625],
        [ 2.1250, -0.8594, -0.2988,  0.8984],
        ...,
        [-2.0000,  0.5430, -0.2402, -1.2578],
        [ 0.4824,  0.7617,  0.7188,  1.2500],
        [ 0.8164,  0.9219,  0.6523,  1.3906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0000,  0.1748,  0.1230,  0.1113],
        [ 1.3906,  0.4121,  1.5234,  2.0625],
        [ 2.1250, -0.8594, -0.2988,  0.8984],
        ...,
        [-2.0000,  0.5430, -0.2402, -1.2578],
        [ 0.4824,  0.7617,  0.7188,  1.2500],
        [ 0.8164,  0.9219,  0.6523,  1.3906]], requires_grad=True)
2024-10-08 15:06:12,617 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.8477,  0.1924,  0.1064,  0.0518],
        [ 1.4531,  0.4941,  1.5938,  2.0781],
        [ 2.1250, -0.8789, -0.3086,  0.9180],
        ...,
        [-1.9766,  0.5234, -0.2363, -1.2266],
        [ 0.4941,  0.7422,  0.6836,  1.2422],
        [ 0.8359,  0.9219,  0.6250,  1.3750]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.8477,  0.1924,  0.1064,  0.0518],
        [ 1.4531,  0.4941,  1.5938,  2.0781],
        [ 2.1250, -0.8789, -0.3086,  0.9180],
        ...,
        [-1.9766,  0.5234, -0.2363, -1.2266],
        [ 0.4941,  0.7422,  0.6836,  1.2422],
        [ 0.8359,  0.9219,  0.6250,  1.3750]], requires_grad=True)
2024-10-08 15:06:12,781 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.7266,  0.2051,  0.0908,  0.0060],
        [ 1.4922,  0.5625,  1.6562,  2.0938],
        [ 2.1094, -0.8789, -0.2949,  0.9258],
        ...,
        [-1.9219,  0.4609, -0.2969, -1.1875],
        [ 0.4824,  0.7422,  0.6836,  1.2266],
        [ 0.8398,  0.9453,  0.6328,  1.3516]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.7266,  0.2051,  0.0908,  0.0060],
        [ 1.4922,  0.5625,  1.6562,  2.0938],
        [ 2.1094, -0.8789, -0.2949,  0.9258],
        ...,
        [-1.9219,  0.4609, -0.2969, -1.1875],
        [ 0.4824,  0.7422,  0.6836,  1.2266],
        [ 0.8398,  0.9453,  0.6328,  1.3516]], requires_grad=True)
2024-10-08 15:06:13,057 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.6250,  0.2139,  0.0737, -0.0315],
        [ 1.5234,  0.6523,  1.7578,  2.0938],
        [ 2.0781, -0.8672, -0.2637,  0.9297],
        ...,
        [-1.8594,  0.3750, -0.4043, -1.1406],
        [ 0.4609,  0.7695,  0.7266,  1.2031],
        [ 0.8320,  0.9883,  0.6719,  1.3203]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.6250,  0.2139,  0.0737, -0.0315],
        [ 1.5234,  0.6523,  1.7578,  2.0938],
        [ 2.0781, -0.8672, -0.2637,  0.9297],
        ...,
        [-1.8594,  0.3750, -0.4043, -1.1406],
        [ 0.4609,  0.7695,  0.7266,  1.2031],
        [ 0.8320,  0.9883,  0.6719,  1.3203]], requires_grad=True)
2024-10-08 15:06:13,214 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.5625,  0.2949,  0.1973, -0.0596],
        [ 1.5234,  0.6836,  1.7422,  2.0781],
        [ 2.0312, -0.8750, -0.2812,  0.9297],
        ...,
        [-1.7812,  0.3145, -0.4609, -1.1016],
        [ 0.4258,  0.7148,  0.6133,  1.1719],
        [ 0.8359,  0.9805,  0.6367,  1.3047]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.5625,  0.2949,  0.1973, -0.0596],
        [ 1.5234,  0.6836,  1.7422,  2.0781],
        [ 2.0312, -0.8750, -0.2812,  0.9297],
        ...,
        [-1.7812,  0.3145, -0.4609, -1.1016],
        [ 0.4258,  0.7148,  0.6133,  1.1719],
        [ 0.8359,  0.9805,  0.6367,  1.3047]], requires_grad=True)
2024-10-08 15:06:13,634 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.4941,  0.3633,  0.2988, -0.0874],
        [ 1.5391,  0.6680,  1.6328,  2.0625],
        [ 1.9922, -0.8867, -0.3105,  0.9258],
        ...,
        [-1.6875,  0.2344, -0.5664, -1.0547],
        [ 0.3848,  0.6641,  0.5078,  1.1406],
        [ 0.8438,  0.9414,  0.5547,  1.2891]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.4941,  0.3633,  0.2988, -0.0874],
        [ 1.5391,  0.6680,  1.6328,  2.0625],
        [ 1.9922, -0.8867, -0.3105,  0.9258],
        ...,
        [-1.6875,  0.2344, -0.5664, -1.0547],
        [ 0.3848,  0.6641,  0.5078,  1.1406],
        [ 0.8438,  0.9414,  0.5547,  1.2891]], requires_grad=True)
2024-10-08 15:06:13,901 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.4277,  0.4219,  0.3867, -0.1128],
        [ 1.4844,  0.7891,  1.8359,  2.0312],
        [ 1.9297, -0.8711, -0.2871,  0.9141],
        ...,
        [-1.5781,  0.1211, -0.7500, -0.9961],
        [ 0.3164,  0.6523,  0.4844,  1.1016],
        [ 0.8320,  0.9727,  0.6016,  1.2656]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.4277,  0.4219,  0.3867, -0.1128],
        [ 1.4844,  0.7891,  1.8359,  2.0312],
        [ 1.9297, -0.8711, -0.2871,  0.9141],
        ...,
        [-1.5781,  0.1211, -0.7500, -0.9961],
        [ 0.3164,  0.6523,  0.4844,  1.1016],
        [ 0.8320,  0.9727,  0.6016,  1.2656]], requires_grad=True)
2024-10-08 15:06:14,060 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.3711,  0.4590,  0.4355, -0.1318],
        [ 1.4219,  0.8906,  2.0000,  1.9844],
        [ 1.8750, -0.8633, -0.2852,  0.8984],
        ...,
        [-1.4844,  0.0693, -0.8086, -0.9453],
        [ 0.2578,  0.6211,  0.4316,  1.0625],
        [ 0.8164,  0.9961,  0.6328,  1.2344]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.3711,  0.4590,  0.4355, -0.1318],
        [ 1.4219,  0.8906,  2.0000,  1.9844],
        [ 1.8750, -0.8633, -0.2852,  0.8984],
        ...,
        [-1.4844,  0.0693, -0.8086, -0.9453],
        [ 0.2578,  0.6211,  0.4316,  1.0625],
        [ 0.8164,  0.9961,  0.6328,  1.2344]], requires_grad=True)
2024-10-08 15:06:14,216 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.3203,  0.4863,  0.4727, -0.1465],
        [ 1.3672,  0.8320,  1.8594,  1.9531],
        [ 1.8125, -0.8945, -0.3535,  0.8867],
        ...,
        [-1.3984,  0.1025, -0.7070, -0.9141],
        [ 0.2041,  0.5508,  0.3105,  1.0312],
        [ 0.8008,  0.9219,  0.5195,  1.2188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.3203,  0.4863,  0.4727, -0.1465],
        [ 1.3672,  0.8320,  1.8594,  1.9531],
        [ 1.8125, -0.8945, -0.3535,  0.8867],
        ...,
        [-1.3984,  0.1025, -0.7070, -0.9141],
        [ 0.2041,  0.5508,  0.3105,  1.0312],
        [ 0.8008,  0.9219,  0.5195,  1.2188]], requires_grad=True)
2024-10-08 15:06:14,482 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2852,  0.5039,  0.4980, -0.1543],
        [ 1.2891,  0.8516,  1.8516,  1.8984],
        [ 1.7422, -0.9141, -0.4082,  0.8711],
        ...,
        [-1.2891,  0.0388, -0.7734, -0.8516],
        [ 0.1367,  0.5391,  0.2832,  0.9766],
        [ 0.7930,  0.8555,  0.4199,  1.2109]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2852,  0.5039,  0.4980, -0.1543],
        [ 1.2891,  0.8516,  1.8516,  1.8984],
        [ 1.7422, -0.9141, -0.4082,  0.8711],
        ...,
        [-1.2891,  0.0388, -0.7734, -0.8516],
        [ 0.1367,  0.5391,  0.2832,  0.9766],
        [ 0.7930,  0.8555,  0.4199,  1.2109]], requires_grad=True)
2024-10-08 15:06:14,736 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1328,  0.4648,  0.4434, -0.2070],
        [ 1.1172,  0.9023,  1.8906,  1.7969],
        [ 1.7266, -0.8594, -0.3535,  0.8711],
        ...,
        [-1.1484, -0.0996, -0.9492, -0.7695],
        [ 0.1138,  0.5977,  0.3555,  0.9375],
        [ 0.7461,  0.8242,  0.3691,  1.1719]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1328,  0.4648,  0.4434, -0.2070],
        [ 1.1172,  0.9023,  1.8906,  1.7969],
        [ 1.7266, -0.8594, -0.3535,  0.8711],
        ...,
        [-1.1484, -0.0996, -0.9492, -0.7695],
        [ 0.1138,  0.5977,  0.3555,  0.9375],
        [ 0.7461,  0.8242,  0.3691,  1.1719]], requires_grad=True)
2024-10-08 15:06:14,989 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0571,  0.4766,  0.4434, -0.2754],
        [ 1.0625,  0.8086,  1.7578,  1.7422],
        [ 1.7344, -0.8359, -0.3359,  0.8750],
        ...,
        [-1.0547, -0.1934, -1.0703, -0.7070],
        [ 0.1201,  0.6250,  0.3984,  0.9141],
        [ 0.7344,  0.7266,  0.2480,  1.1562]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0571,  0.4766,  0.4434, -0.2754],
        [ 1.0625,  0.8086,  1.7578,  1.7422],
        [ 1.7344, -0.8359, -0.3359,  0.8750],
        ...,
        [-1.0547, -0.1934, -1.0703, -0.7070],
        [ 0.1201,  0.6250,  0.3984,  0.9141],
        [ 0.7344,  0.7266,  0.2480,  1.1562]], requires_grad=True)
2024-10-08 15:06:15,243 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1699,  0.4453,  0.4082, -0.3223],
        [ 1.0234,  0.7266,  1.6328,  1.6953],
        [ 1.7578, -0.8359, -0.3379,  0.8789],
        ...,
        [-1.0078, -0.2432, -1.1406, -0.6641],
        [ 0.1553,  0.6250,  0.4141,  0.8984],
        [ 0.7148,  0.6211,  0.1299,  1.1328]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1699,  0.4453,  0.4082, -0.3223],
        [ 1.0234,  0.7266,  1.6328,  1.6953],
        [ 1.7578, -0.8359, -0.3379,  0.8789],
        ...,
        [-1.0078, -0.2432, -1.1406, -0.6641],
        [ 0.1553,  0.6250,  0.4141,  0.8984],
        [ 0.7148,  0.6211,  0.1299,  1.1328]], requires_grad=True)
2024-10-08 15:06:15,393 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2891,  0.4473,  0.3965, -0.3574],
        [ 1.0078,  0.5742,  1.4609,  1.6328],
        [ 1.7812, -0.8555, -0.3535,  0.8750],
        ...,
        [-0.9453, -0.2891, -1.2031, -0.6133],
        [ 0.1797,  0.6055,  0.4160,  0.8750],
        [ 0.6992,  0.5078,  0.0130,  1.1016]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2891,  0.4473,  0.3965, -0.3574],
        [ 1.0078,  0.5742,  1.4609,  1.6328],
        [ 1.7812, -0.8555, -0.3535,  0.8750],
        ...,
        [-0.9453, -0.2891, -1.2031, -0.6133],
        [ 0.1797,  0.6055,  0.4160,  0.8750],
        [ 0.6992,  0.5078,  0.0130,  1.1016]], requires_grad=True)
2024-10-08 15:06:15,648 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3770,  0.4414,  0.3828, -0.3848],
        [ 1.0000,  0.4238,  1.2969,  1.5781],
        [ 1.7500, -0.8398, -0.3555,  0.8672],
        ...,
        [-0.8242, -0.3789, -1.2734, -0.5586],
        [ 0.1436,  0.6289,  0.4336,  0.8438],
        [ 0.6523,  0.4414, -0.0757,  1.0703]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3770,  0.4414,  0.3828, -0.3848],
        [ 1.0000,  0.4238,  1.2969,  1.5781],
        [ 1.7500, -0.8398, -0.3555,  0.8672],
        ...,
        [-0.8242, -0.3789, -1.2734, -0.5586],
        [ 0.1436,  0.6289,  0.4336,  0.8438],
        [ 0.6523,  0.4414, -0.0757,  1.0703]], requires_grad=True)
2024-10-08 15:06:15,902 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3340,  0.3691,  0.3555, -0.4043],
        [ 0.7734,  0.5352,  1.2188,  1.5312],
        [ 1.6016, -0.7539, -0.3379,  0.8555],
        ...,
        [-0.5039, -0.6367, -1.3750, -0.5156],
        [ 0.0330,  0.7031,  0.4590,  0.8086],
        [ 0.5234,  0.4746, -0.1338,  1.0391]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3340,  0.3691,  0.3555, -0.4043],
        [ 0.7734,  0.5352,  1.2188,  1.5312],
        [ 1.6016, -0.7539, -0.3379,  0.8555],
        ...,
        [-0.5039, -0.6367, -1.3750, -0.5156],
        [ 0.0330,  0.7031,  0.4590,  0.8086],
        [ 0.5234,  0.4746, -0.1338,  1.0391]], requires_grad=True)
2024-10-08 15:06:16,166 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3828,  0.3711,  0.3535, -0.4102],
        [ 0.6836,  0.4473,  1.0703,  1.4531],
        [ 1.4844, -0.6875, -0.3262,  0.8398],
        ...,
        [-0.2930, -0.7656, -1.4297, -0.4590],
        [ 0.0270,  0.6562,  0.4375,  0.7539],
        [ 0.4121,  0.4961, -0.1865,  1.0000]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3828,  0.3711,  0.3535, -0.4102],
        [ 0.6836,  0.4473,  1.0703,  1.4531],
        [ 1.4844, -0.6875, -0.3262,  0.8398],
        ...,
        [-0.2930, -0.7656, -1.4297, -0.4590],
        [ 0.0270,  0.6562,  0.4375,  0.7539],
        [ 0.4121,  0.4961, -0.1865,  1.0000]], requires_grad=True)
2024-10-08 15:06:16,425 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4922,  0.4824,  0.4062, -0.3730],
        [ 0.6016,  0.3652,  0.9375,  1.3750],
        [ 1.4297, -0.7148, -0.3613,  0.8008],
        ...,
        [-0.1543, -0.7891, -1.4219, -0.3867],
        [ 0.0562,  0.5312,  0.3750,  0.6758],
        [ 0.3301,  0.4805, -0.2461,  0.9570]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4922,  0.4824,  0.4062, -0.3730],
        [ 0.6016,  0.3652,  0.9375,  1.3750],
        [ 1.4297, -0.7148, -0.3613,  0.8008],
        ...,
        [-0.1543, -0.7891, -1.4219, -0.3867],
        [ 0.0562,  0.5312,  0.3750,  0.6758],
        [ 0.3301,  0.4805, -0.2461,  0.9570]], requires_grad=True)
2024-10-08 15:06:16,677 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5234,  0.5312,  0.4219, -0.3359],
        [ 0.7227,  0.0150,  0.6211,  1.3203],
        [ 1.3750, -0.7344, -0.3867,  0.7617],
        ...,
        [-0.1113, -0.7656, -1.3828, -0.3418],
        [ 0.3418,  0.1992,  0.1816,  0.6484],
        [ 0.2676,  0.4062, -0.3359,  0.9023]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5234,  0.5312,  0.4219, -0.3359],
        [ 0.7227,  0.0150,  0.6211,  1.3203],
        [ 1.3750, -0.7344, -0.3867,  0.7617],
        ...,
        [-0.1113, -0.7656, -1.3828, -0.3418],
        [ 0.3418,  0.1992,  0.1816,  0.6484],
        [ 0.2676,  0.4062, -0.3359,  0.9023]], requires_grad=True)
2024-10-08 15:06:16,942 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4922,  0.5312,  0.4043, -0.2910],
        [ 0.7461, -0.1895,  0.4316,  1.2500],
        [ 1.3047, -0.7344, -0.3984,  0.7227],
        ...,
        [ 0.0737, -0.8789, -1.4688, -0.2715],
        [ 0.5352, -0.0371,  0.0554,  0.6094],
        [ 0.2100,  0.3496, -0.4062,  0.8555]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4922,  0.5312,  0.4043, -0.2910],
        [ 0.7461, -0.1895,  0.4316,  1.2500],
        [ 1.3047, -0.7344, -0.3984,  0.7227],
        ...,
        [ 0.0737, -0.8789, -1.4688, -0.2715],
        [ 0.5352, -0.0371,  0.0554,  0.6094],
        [ 0.2100,  0.3496, -0.4062,  0.8555]], requires_grad=True)
2024-10-08 15:06:17,193 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1138,  0.2451,  0.1348, -0.1748],
        [ 0.4004,  0.3613,  1.0156,  1.1094],
        [ 1.0703, -0.5898, -0.2734,  0.6484],
        ...,
        [ 0.4316, -1.4609, -2.0469, -0.2080],
        [ 0.8008,  0.1572,  0.3535,  0.6953],
        [ 0.0747,  0.8477,  0.0162,  0.8555]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1138,  0.2451,  0.1348, -0.1748],
        [ 0.4004,  0.3613,  1.0156,  1.1094],
        [ 1.0703, -0.5898, -0.2734,  0.6484],
        ...,
        [ 0.4316, -1.4609, -2.0469, -0.2080],
        [ 0.8008,  0.1572,  0.3535,  0.6953],
        [ 0.0747,  0.8477,  0.0162,  0.8555]], requires_grad=True)
2024-10-08 15:06:17,459 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1104, -0.1924, -0.2930, -0.1187],
        [ 0.2617,  0.7305,  1.4219,  1.0703],
        [ 0.9414, -0.4414, -0.1377,  0.6133],
        ...,
        [ 0.5352, -2.0625, -2.6719, -0.2432],
        [ 1.1250,  0.4805,  0.7812,  0.8164],
        [ 0.0242,  1.4141,  0.5234,  0.8906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1104, -0.1924, -0.2930, -0.1187],
        [ 0.2617,  0.7305,  1.4219,  1.0703],
        [ 0.9414, -0.4414, -0.1377,  0.6133],
        ...,
        [ 0.5352, -2.0625, -2.6719, -0.2432],
        [ 1.1250,  0.4805,  0.7812,  0.8164],
        [ 0.0242,  1.4141,  0.5234,  0.8906]], requires_grad=True)
2024-10-08 15:06:17,618 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0133, -0.4121, -0.5156, -0.2617],
        [ 0.7031,  0.5391,  1.2266,  1.4375],
        [ 1.0703, -0.4590, -0.1699,  0.7188],
        ...,
        [ 0.3262, -2.3594, -2.9688, -0.4629],
        [ 1.6172,  0.5586,  0.9453,  1.0859],
        [ 0.2109,  1.6953,  0.7734,  1.1094]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0133, -0.4121, -0.5156, -0.2617],
        [ 0.7031,  0.5391,  1.2266,  1.4375],
        [ 1.0703, -0.4590, -0.1699,  0.7188],
        ...,
        [ 0.3262, -2.3594, -2.9688, -0.4629],
        [ 1.6172,  0.5586,  0.9453,  1.0859],
        [ 0.2109,  1.6953,  0.7734,  1.1094]], requires_grad=True)
2024-10-08 15:06:17,886 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1260, -0.6289, -0.7148, -0.3848],
        [ 1.0781,  0.3223,  1.0234,  1.7500],
        [ 1.1719, -0.4863, -0.2021,  0.8086],
        ...,
        [ 0.1436, -2.6094, -3.2188, -0.6523],
        [ 2.0469,  0.6055,  1.0781,  1.3125],
        [ 0.3750,  1.8906,  0.9766,  1.3047]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1260, -0.6289, -0.7148, -0.3848],
        [ 1.0781,  0.3223,  1.0234,  1.7500],
        [ 1.1719, -0.4863, -0.2021,  0.8086],
        ...,
        [ 0.1436, -2.6094, -3.2188, -0.6523],
        [ 2.0469,  0.6055,  1.0781,  1.3125],
        [ 0.3750,  1.8906,  0.9766,  1.3047]], requires_grad=True)
2024-10-08 15:06:18,038 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2256, -0.8125, -0.8906, -0.4902],
        [ 1.4062,  0.0928,  0.9141,  1.9922],
        [ 1.2656, -0.5234, -0.2080,  0.8750],
        ...,
        [-0.0070, -2.8281, -3.4219, -0.8125],
        [ 2.4062,  0.6445,  1.1875,  1.5078],
        [ 0.5195,  2.0625,  1.1562,  1.4688]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2256, -0.8125, -0.8906, -0.4902],
        [ 1.4062,  0.0928,  0.9141,  1.9922],
        [ 1.2656, -0.5234, -0.2080,  0.8750],
        ...,
        [-0.0070, -2.8281, -3.4219, -0.8125],
        [ 2.4062,  0.6445,  1.1875,  1.5078],
        [ 0.5195,  2.0625,  1.1562,  1.4688]], requires_grad=True)
2024-10-08 15:06:18,288 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3418, -0.9805, -0.9805, -0.6406],
        [ 1.7031, -0.0923,  0.6172,  2.2969],
        [ 1.3438, -0.5469, -0.2852,  0.9766],
        ...,
        [-0.1309, -3.0312, -3.4375, -1.0156],
        [ 2.7188,  0.6914,  1.1250,  1.7656],
        [ 0.6445,  2.2188,  1.1719,  1.6953]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3418, -0.9805, -0.9805, -0.6406],
        [ 1.7031, -0.0923,  0.6172,  2.2969],
        [ 1.3438, -0.5469, -0.2852,  0.9766],
        ...,
        [-0.1309, -3.0312, -3.4375, -1.0156],
        [ 2.7188,  0.6914,  1.1250,  1.7656],
        [ 0.6445,  2.2188,  1.1719,  1.6953]], requires_grad=True)
2024-10-08 15:06:18,546 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4453, -1.1172, -1.0703, -0.7617],
        [ 1.9375, -0.2305,  0.2471,  2.5938],
        [ 1.4062, -0.5625, -0.3730,  1.0703],
        ...,
        [-0.2354, -3.1875, -3.4219, -1.2031],
        [ 2.9844,  0.7344,  1.0547,  1.9922],
        [ 0.7500,  2.3594,  1.1641,  1.8906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4453, -1.1172, -1.0703, -0.7617],
        [ 1.9375, -0.2305,  0.2471,  2.5938],
        [ 1.4062, -0.5625, -0.3730,  1.0703],
        ...,
        [-0.2354, -3.1875, -3.4219, -1.2031],
        [ 2.9844,  0.7344,  1.0547,  1.9922],
        [ 0.7500,  2.3594,  1.1641,  1.8906]], requires_grad=True)
2024-10-08 15:06:18,686 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5352, -1.2344, -1.1484, -0.8672],
        [ 2.1406, -0.3535, -0.0703,  2.8438],
        [ 1.4766, -0.5898, -0.4180,  1.1406],
        ...,
        [-0.3379, -3.3125, -3.4375, -1.3516],
        [ 3.2188,  0.7500,  1.0234,  2.1719],
        [ 0.8477,  2.4375,  1.1875,  2.0469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5352, -1.2344, -1.1484, -0.8672],
        [ 2.1406, -0.3535, -0.0703,  2.8438],
        [ 1.4766, -0.5898, -0.4180,  1.1406],
        ...,
        [-0.3379, -3.3125, -3.4375, -1.3516],
        [ 3.2188,  0.7500,  1.0234,  2.1719],
        [ 0.8477,  2.4375,  1.1875,  2.0469]], requires_grad=True)
2024-10-08 15:06:18,847 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6016, -1.3359, -1.2031, -0.9609],
        [ 2.3125, -0.4609, -0.3516,  3.0625],
        [ 1.5547, -0.6250, -0.4316,  1.1953],
        ...,
        [-0.4688, -3.3750, -3.4844, -1.4609],
        [ 3.4531,  0.7305,  1.0391,  2.3125],
        [ 0.9336,  2.5000,  1.2109,  2.1719]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6016, -1.3359, -1.2031, -0.9609],
        [ 2.3125, -0.4609, -0.3516,  3.0625],
        [ 1.5547, -0.6250, -0.4316,  1.1953],
        ...,
        [-0.4688, -3.3750, -3.4844, -1.4609],
        [ 3.4531,  0.7305,  1.0391,  2.3125],
        [ 0.9336,  2.5000,  1.2109,  2.1719]], requires_grad=True)
2024-10-08 15:06:18,997 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6562, -1.4219, -1.2422, -1.0391],
        [ 2.4844, -0.5703, -0.5586,  3.2031],
        [ 1.6172, -0.6523, -0.4414,  1.2266],
        ...,
        [-0.5898, -3.4219, -3.5312, -1.5391],
        [ 3.6250,  0.7148,  1.0469,  2.4375],
        [ 1.0156,  2.5312,  1.2344,  2.2656]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6562, -1.4219, -1.2422, -1.0391],
        [ 2.4844, -0.5703, -0.5586,  3.2031],
        [ 1.6172, -0.6523, -0.4414,  1.2266],
        ...,
        [-0.5898, -3.4219, -3.5312, -1.5391],
        [ 3.6250,  0.7148,  1.0469,  2.4375],
        [ 1.0156,  2.5312,  1.2344,  2.2656]], requires_grad=True)
2024-10-08 15:06:19,254 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7969, -1.4766, -1.3047, -1.0625],
        [ 2.5781, -0.6484, -0.7695,  3.3438],
        [ 1.6250, -0.6641, -0.4668,  1.2734],
        ...,
        [-0.5703, -3.4688, -3.5000, -1.6562],
        [ 3.6562,  0.7383,  0.9922,  2.5781],
        [ 1.0703,  2.5625,  1.2422,  2.3594]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7969, -1.4766, -1.3047, -1.0625],
        [ 2.5781, -0.6484, -0.7695,  3.3438],
        [ 1.6250, -0.6641, -0.4668,  1.2734],
        ...,
        [-0.5703, -3.4688, -3.5000, -1.6562],
        [ 3.6562,  0.7383,  0.9922,  2.5781],
        [ 1.0703,  2.5625,  1.2422,  2.3594]], requires_grad=True)
2024-10-08 15:06:19,410 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8242, -1.5391, -1.3438, -1.1172],
        [ 2.5938, -0.6836, -0.9727,  3.4844],
        [ 1.5547, -0.6523, -0.5039,  1.3281],
        ...,
        [-0.4043, -3.5625, -3.4062, -1.8125],
        [ 3.5625,  0.8008,  0.9062,  2.7500],
        [ 1.0391,  2.6250,  1.2188,  2.4531]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8242, -1.5391, -1.3438, -1.1172],
        [ 2.5938, -0.6836, -0.9727,  3.4844],
        [ 1.5547, -0.6523, -0.5039,  1.3281],
        ...,
        [-0.4043, -3.5625, -3.4062, -1.8125],
        [ 3.5625,  0.8008,  0.9062,  2.7500],
        [ 1.0391,  2.6250,  1.2188,  2.4531]], requires_grad=True)
2024-10-08 15:06:19,667 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9648, -1.5469, -1.3750, -1.1484],
        [ 2.6406, -0.7383, -1.1484,  3.5781],
        [ 1.5391, -0.6641, -0.5312,  1.3672],
        ...,
        [-0.2715, -3.6250, -3.3125, -1.9375],
        [ 3.5000,  0.8281,  0.8320,  2.8750],
        [ 1.0312,  2.6562,  1.1953,  2.5312]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9648, -1.5469, -1.3750, -1.1484],
        [ 2.6406, -0.7383, -1.1484,  3.5781],
        [ 1.5391, -0.6641, -0.5312,  1.3672],
        ...,
        [-0.2715, -3.6250, -3.3125, -1.9375],
        [ 3.5000,  0.8281,  0.8320,  2.8750],
        [ 1.0312,  2.6562,  1.1953,  2.5312]], requires_grad=True)
2024-10-08 15:06:19,921 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0859, -1.5547, -1.3984, -1.1719],
        [ 2.7812, -0.8438, -1.3125,  3.6406],
        [ 1.5703, -0.6953, -0.5586,  1.3906],
        ...,
        [-0.2490, -3.6250, -3.2188, -2.0312],
        [ 3.4531,  0.8320,  0.7617,  2.9531],
        [ 1.0781,  2.6406,  1.1641,  2.5781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0859, -1.5547, -1.3984, -1.1719],
        [ 2.7812, -0.8438, -1.3125,  3.6406],
        [ 1.5703, -0.6953, -0.5586,  1.3906],
        ...,
        [-0.2490, -3.6250, -3.2188, -2.0312],
        [ 3.4531,  0.8320,  0.7617,  2.9531],
        [ 1.0781,  2.6406,  1.1641,  2.5781]], requires_grad=True)
2024-10-08 15:06:20,172 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2109, -1.5469, -1.4141, -1.1797],
        [ 2.9375, -0.9688, -1.4609,  3.6875],
        [ 1.6562, -0.7500, -0.5938,  1.3906],
        ...,
        [-0.3223, -3.5469, -3.1094, -2.0781],
        [ 3.4531,  0.8008,  0.6836,  3.0000],
        [ 1.1328,  2.6094,  1.1406,  2.6094]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2109, -1.5469, -1.4141, -1.1797],
        [ 2.9375, -0.9688, -1.4609,  3.6875],
        [ 1.6562, -0.7500, -0.5938,  1.3906],
        ...,
        [-0.3223, -3.5469, -3.1094, -2.0781],
        [ 3.4531,  0.8008,  0.6836,  3.0000],
        [ 1.1328,  2.6094,  1.1406,  2.6094]], requires_grad=True)
2024-10-08 15:06:20,422 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1875, -1.5781, -1.4453, -1.1953],
        [ 2.9375, -1.0000, -1.5312,  3.7344],
        [ 1.6797, -0.7773, -0.6094,  1.3906],
        ...,
        [-0.2197, -3.5469, -3.0469, -2.1406],
        [ 3.2969,  0.8398,  0.6523,  3.0625],
        [ 1.0938,  2.6250,  1.1406,  2.6406]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1875, -1.5781, -1.4453, -1.1953],
        [ 2.9375, -1.0000, -1.5312,  3.7344],
        [ 1.6797, -0.7773, -0.6094,  1.3906],
        ...,
        [-0.2197, -3.5469, -3.0469, -2.1406],
        [ 3.2969,  0.8398,  0.6523,  3.0625],
        [ 1.0938,  2.6250,  1.1406,  2.6406]], requires_grad=True)
2024-10-08 15:06:20,691 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2188, -1.5781, -1.4531, -1.1953],
        [ 2.8906, -0.9961, -1.5781,  3.7812],
        [ 1.6875, -0.7969, -0.6211,  1.3906],
        ...,
        [-0.1133, -3.5469, -2.9844, -2.1875],
        [ 3.1719,  0.8711,  0.6211,  3.1094],
        [ 1.0312,  2.6406,  1.1406,  2.6719]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2188, -1.5781, -1.4531, -1.1953],
        [ 2.8906, -0.9961, -1.5781,  3.7812],
        [ 1.6875, -0.7969, -0.6211,  1.3906],
        ...,
        [-0.1133, -3.5469, -2.9844, -2.1875],
        [ 3.1719,  0.8711,  0.6211,  3.1094],
        [ 1.0312,  2.6406,  1.1406,  2.6719]], requires_grad=True)
2024-10-08 15:06:20,954 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1562, -1.6094, -1.4766, -1.2031],
        [ 2.6719, -0.8750, -1.5156,  3.8281],
        [ 1.6562, -0.7969, -0.6172,  1.3906],
        ...,
        [ 0.0593, -3.5781, -2.9531, -2.2344],
        [ 3.0000,  0.9180,  0.6133,  3.1406],
        [ 0.8828,  2.7188,  1.1875,  2.7031]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1562, -1.6094, -1.4766, -1.2031],
        [ 2.6719, -0.8750, -1.5156,  3.8281],
        [ 1.6562, -0.7969, -0.6172,  1.3906],
        ...,
        [ 0.0593, -3.5781, -2.9531, -2.2344],
        [ 3.0000,  0.9180,  0.6133,  3.1406],
        [ 0.8828,  2.7188,  1.1875,  2.7031]], requires_grad=True)
2024-10-08 15:06:21,225 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1016, -1.6328, -1.4922, -1.2109],
        [ 2.4844, -0.7656, -1.4609,  3.8594],
        [ 1.6016, -0.7734, -0.5938,  1.3984],
        ...,
        [ 0.1992, -3.5938, -2.9219, -2.2656],
        [ 2.8594,  0.9492,  0.5938,  3.1719],
        [ 0.7461,  2.7812,  1.2266,  2.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1016, -1.6328, -1.4922, -1.2109],
        [ 2.4844, -0.7656, -1.4609,  3.8594],
        [ 1.6016, -0.7734, -0.5938,  1.3984],
        ...,
        [ 0.1992, -3.5938, -2.9219, -2.2656],
        [ 2.8594,  0.9492,  0.5938,  3.1719],
        [ 0.7461,  2.7812,  1.2266,  2.7188]], requires_grad=True)
2024-10-08 15:06:21,377 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0156, -1.6562, -1.5078, -1.2031],
        [ 2.4375, -0.7734, -1.5000,  3.8594],
        [ 1.6484, -0.7930, -0.6094,  1.3984],
        ...,
        [ 0.1084, -3.4688, -2.7656, -2.2812],
        [ 2.8438,  0.9062,  0.5195,  3.1719],
        [ 0.7227,  2.7500,  1.1953,  2.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0156, -1.6562, -1.5078, -1.2031],
        [ 2.4375, -0.7734, -1.5000,  3.8594],
        [ 1.6484, -0.7930, -0.6094,  1.3984],
        ...,
        [ 0.1084, -3.4688, -2.7656, -2.2812],
        [ 2.8438,  0.9062,  0.5195,  3.1719],
        [ 0.7227,  2.7500,  1.1953,  2.7188]], requires_grad=True)
2024-10-08 15:06:21,634 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9727, -1.6719, -1.5156, -1.2031],
        [ 2.3281, -0.7109, -1.4609,  3.8750],
        [ 1.7109, -0.8125, -0.6211,  1.3984],
        ...,
        [-0.0295, -3.3125, -2.6094, -2.2812],
        [ 2.8438,  0.8594,  0.4492,  3.1719],
        [ 0.6875,  2.7344,  1.1719,  2.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9727, -1.6719, -1.5156, -1.2031],
        [ 2.3281, -0.7109, -1.4609,  3.8750],
        [ 1.7109, -0.8125, -0.6211,  1.3984],
        ...,
        [-0.0295, -3.3125, -2.6094, -2.2812],
        [ 2.8438,  0.8594,  0.4492,  3.1719],
        [ 0.6875,  2.7344,  1.1719,  2.7188]], requires_grad=True)
2024-10-08 15:06:21,892 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9062, -1.6875, -1.5312, -1.2031],
        [ 2.1875, -0.6211, -1.3906,  3.8750],
        [ 1.7109, -0.8125, -0.6172,  1.3984],
        ...,
        [-0.0430, -3.2344, -2.5156, -2.2969],
        [ 2.7656,  0.8555,  0.4238,  3.1562],
        [ 0.6211,  2.7344,  1.1797,  2.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9062, -1.6875, -1.5312, -1.2031],
        [ 2.1875, -0.6211, -1.3906,  3.8750],
        [ 1.7109, -0.8125, -0.6172,  1.3984],
        ...,
        [-0.0430, -3.2344, -2.5156, -2.2969],
        [ 2.7656,  0.8555,  0.4238,  3.1562],
        [ 0.6211,  2.7344,  1.1797,  2.7188]], requires_grad=True)
2024-10-08 15:06:22,145 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8242, -1.7188, -1.5625, -1.2109],
        [ 2.0156, -0.4824, -1.2656,  3.8906],
        [ 1.6406, -0.7734, -0.5703,  1.4062],
        ...,
        [ 0.0159, -3.2031, -2.4688, -2.3125],
        [ 2.6250,  0.8984,  0.4473,  3.1562],
        [ 0.4902,  2.7969,  1.2344,  2.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8242, -1.7188, -1.5625, -1.2109],
        [ 2.0156, -0.4824, -1.2656,  3.8906],
        [ 1.6406, -0.7734, -0.5703,  1.4062],
        ...,
        [ 0.0159, -3.2031, -2.4688, -2.3125],
        [ 2.6250,  0.8984,  0.4473,  3.1562],
        [ 0.4902,  2.7969,  1.2344,  2.7188]], requires_grad=True)
2024-10-08 15:06:22,417 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8320, -1.6875, -1.5391, -1.1953],
        [ 1.8828, -0.3574, -1.1406,  3.9062],
        [ 1.5781, -0.7266, -0.5234,  1.4141],
        ...,
        [ 0.0408, -3.1562, -2.4062, -2.3125],
        [ 2.5000,  0.9336,  0.4688,  3.1562],
        [ 0.3926,  2.8281,  1.2656,  2.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8320, -1.6875, -1.5391, -1.1953],
        [ 1.8828, -0.3574, -1.1406,  3.9062],
        [ 1.5781, -0.7266, -0.5234,  1.4141],
        ...,
        [ 0.0408, -3.1562, -2.4062, -2.3125],
        [ 2.5000,  0.9336,  0.4688,  3.1562],
        [ 0.3926,  2.8281,  1.2656,  2.7188]], requires_grad=True)
2024-10-08 15:06:22,673 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9102, -1.6250, -1.4844, -1.1719],
        [ 1.8438, -0.3184, -1.0938,  3.8906],
        [ 1.5859, -0.7148, -0.5078,  1.4141],
        ...,
        [-0.0383, -3.0469, -2.2812, -2.2969],
        [ 2.4688,  0.9102,  0.4434,  3.1406],
        [ 0.3594,  2.7969,  1.2500,  2.6875]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9102, -1.6250, -1.4844, -1.1719],
        [ 1.8438, -0.3184, -1.0938,  3.8906],
        [ 1.5859, -0.7148, -0.5078,  1.4141],
        ...,
        [-0.0383, -3.0469, -2.2812, -2.2969],
        [ 2.4688,  0.9102,  0.4434,  3.1406],
        [ 0.3594,  2.7969,  1.2500,  2.6875]], requires_grad=True)
2024-10-08 15:06:22,826 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0391, -1.5391, -1.4062, -1.1484],
        [ 1.8203, -0.3125, -1.0781,  3.8594],
        [ 1.6172, -0.7188, -0.5039,  1.4062],
        ...,
        [-0.1465, -2.9062, -2.1562, -2.2812],
        [ 2.4844,  0.8398,  0.3809,  3.1094],
        [ 0.3457,  2.7344,  1.2109,  2.6406]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0391, -1.5391, -1.4062, -1.1484],
        [ 1.8203, -0.3125, -1.0781,  3.8594],
        [ 1.6172, -0.7188, -0.5039,  1.4062],
        ...,
        [-0.1465, -2.9062, -2.1562, -2.2812],
        [ 2.4844,  0.8398,  0.3809,  3.1094],
        [ 0.3457,  2.7344,  1.2109,  2.6406]], requires_grad=True)
2024-10-08 15:06:23,089 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1797, -1.4375, -1.3281, -1.1172],
        [ 1.9766, -0.4434, -1.1797,  3.8125],
        [ 1.6875, -0.7344, -0.5117,  1.3984],
        ...,
        [-0.2578, -2.7812, -2.0469, -2.2656],
        [ 2.5156,  0.7617,  0.3125,  3.0625],
        [ 0.3613,  2.6562,  1.1562,  2.5938]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1797, -1.4375, -1.3281, -1.1172],
        [ 1.9766, -0.4434, -1.1797,  3.8125],
        [ 1.6875, -0.7344, -0.5117,  1.3984],
        ...,
        [-0.2578, -2.7812, -2.0469, -2.2656],
        [ 2.5156,  0.7617,  0.3125,  3.0625],
        [ 0.3613,  2.6562,  1.1562,  2.5938]], requires_grad=True)
2024-10-08 15:06:23,355 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2969, -1.3438, -1.2578, -1.0859],
        [ 2.0625, -0.5078, -1.2109,  3.7812],
        [ 1.7109, -0.7227, -0.4961,  1.3984],
        ...,
        [-0.3555, -2.6875, -1.9531, -2.2500],
        [ 2.5469,  0.6914,  0.2559,  3.0156],
        [ 0.3359,  2.6094,  1.1406,  2.5625]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2969, -1.3438, -1.2578, -1.0859],
        [ 2.0625, -0.5078, -1.2109,  3.7812],
        [ 1.7109, -0.7227, -0.4961,  1.3984],
        ...,
        [-0.3555, -2.6875, -1.9531, -2.2500],
        [ 2.5469,  0.6914,  0.2559,  3.0156],
        [ 0.3359,  2.6094,  1.1406,  2.5625]], requires_grad=True)
2024-10-08 15:06:23,621 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2266, -1.3359, -1.2422, -1.0781],
        [ 2.1719, -0.5938, -1.2656,  3.7344],
        [ 1.6641, -0.6836, -0.4590,  1.3984],
        ...,
        [-0.3789, -2.6250, -1.8984, -2.2500],
        [ 2.4844,  0.6797,  0.2422,  2.9688],
        [ 0.2715,  2.5938,  1.1484,  2.5312]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2266, -1.3359, -1.2422, -1.0781],
        [ 2.1719, -0.5938, -1.2656,  3.7344],
        [ 1.6641, -0.6836, -0.4590,  1.3984],
        ...,
        [-0.3789, -2.6250, -1.8984, -2.2500],
        [ 2.4844,  0.6797,  0.2422,  2.9688],
        [ 0.2715,  2.5938,  1.1484,  2.5312]], requires_grad=True)
2024-10-08 15:06:23,909 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1484, -1.3359, -1.2344, -1.0703],
        [ 2.3125, -0.7109, -1.3438,  3.6719],
        [ 1.6328, -0.6562, -0.4316,  1.3906],
        ...,
        [-0.4199, -2.5469, -1.8359, -2.2344],
        [ 2.4375,  0.6523,  0.2197,  2.9219],
        [ 0.1943,  2.5938,  1.1719,  2.5000]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1484, -1.3359, -1.2344, -1.0703],
        [ 2.3125, -0.7109, -1.3438,  3.6719],
        [ 1.6328, -0.6562, -0.4316,  1.3906],
        ...,
        [-0.4199, -2.5469, -1.8359, -2.2344],
        [ 2.4375,  0.6523,  0.2197,  2.9219],
        [ 0.1943,  2.5938,  1.1719,  2.5000]], requires_grad=True)
2024-10-08 15:06:24,173 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0469, -1.3516, -1.2344, -1.0703],
        [ 2.4375, -0.8203, -1.4141,  3.5938],
        [ 1.6016, -0.6289, -0.4043,  1.3828],
        ...,
        [-0.4453, -2.4844, -1.7812, -2.2188],
        [ 2.3750,  0.6328,  0.2041,  2.8750],
        [ 0.0898,  2.6250,  1.2109,  2.4688]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.0469, -1.3516, -1.2344, -1.0703],
        [ 2.4375, -0.8203, -1.4141,  3.5938],
        [ 1.6016, -0.6289, -0.4043,  1.3828],
        ...,
        [-0.4453, -2.4844, -1.7812, -2.2188],
        [ 2.3750,  0.6328,  0.2041,  2.8750],
        [ 0.0898,  2.6250,  1.2109,  2.4688]], requires_grad=True)
2024-10-08 15:06:24,426 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9766, -1.3438, -1.2188, -1.0547],
        [ 2.5312, -0.9258, -1.4844,  3.5156],
        [ 1.5625, -0.6016, -0.3809,  1.3750],
        ...,
        [-0.4512, -2.4219, -1.7344, -2.2031],
        [ 2.3281,  0.5977,  0.1787,  2.8125],
        [-0.0271,  2.6719,  1.2578,  2.4531]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9766, -1.3438, -1.2188, -1.0547],
        [ 2.5312, -0.9258, -1.4844,  3.5156],
        [ 1.5625, -0.6016, -0.3809,  1.3750],
        ...,
        [-0.4512, -2.4219, -1.7344, -2.2031],
        [ 2.3281,  0.5977,  0.1787,  2.8125],
        [-0.0271,  2.6719,  1.2578,  2.4531]], requires_grad=True)
2024-10-08 15:06:24,684 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9180, -1.3281, -1.1953, -1.0391],
        [ 2.6250, -1.0391, -1.5547,  3.4219],
        [ 1.5312, -0.5820, -0.3633,  1.3594],
        ...,
        [-0.4531, -2.3594, -1.6875, -2.1719],
        [ 2.3125,  0.5547,  0.1494,  2.7656],
        [-0.1299,  2.7188,  1.2969,  2.4219]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9180, -1.3281, -1.1953, -1.0391],
        [ 2.6250, -1.0391, -1.5547,  3.4219],
        [ 1.5312, -0.5820, -0.3633,  1.3594],
        ...,
        [-0.4531, -2.3594, -1.6875, -2.1719],
        [ 2.3125,  0.5547,  0.1494,  2.7656],
        [-0.1299,  2.7188,  1.2969,  2.4219]], requires_grad=True)
2024-10-08 15:06:24,841 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8867, -1.2891, -1.1562, -1.0078],
        [ 2.7031, -1.1328, -1.6094,  3.3281],
        [ 1.5156, -0.5781, -0.3555,  1.3359],
        ...,
        [-0.4785, -2.2656, -1.6250, -2.1250],
        [ 2.3125,  0.4844,  0.0991,  2.7031],
        [-0.2207,  2.7500,  1.3281,  2.3906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8867, -1.2891, -1.1562, -1.0078],
        [ 2.7031, -1.1328, -1.6094,  3.3281],
        [ 1.5156, -0.5781, -0.3555,  1.3359],
        ...,
        [-0.4785, -2.2656, -1.6250, -2.1250],
        [ 2.3125,  0.4844,  0.0991,  2.7031],
        [-0.2207,  2.7500,  1.3281,  2.3906]], requires_grad=True)
2024-10-08 15:06:25,108 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8594, -1.2422, -1.1172, -0.9727],
        [ 2.7344, -1.1875, -1.6328,  3.2500],
        [ 1.4766, -0.5664, -0.3438,  1.3125],
        ...,
        [-0.5078, -2.1719, -1.5547, -2.0781],
        [ 2.3281,  0.3965,  0.0361,  2.6250],
        [-0.3203,  2.7969,  1.3672,  2.3594]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8594, -1.2422, -1.1172, -0.9727],
        [ 2.7344, -1.1875, -1.6328,  3.2500],
        [ 1.4766, -0.5664, -0.3438,  1.3125],
        ...,
        [-0.5078, -2.1719, -1.5547, -2.0781],
        [ 2.3281,  0.3965,  0.0361,  2.6250],
        [-0.3203,  2.7969,  1.3672,  2.3594]], requires_grad=True)
2024-10-08 15:06:25,268 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8320, -1.1953, -1.0781, -0.9414],
        [ 2.7500, -1.2188, -1.6484,  3.1719],
        [ 1.4531, -0.5625, -0.3379,  1.2812],
        ...,
        [-0.5234, -2.0938, -1.4844, -2.0312],
        [ 2.3281,  0.3223, -0.0166,  2.5469],
        [-0.4023,  2.8281,  1.3906,  2.3281]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8320, -1.1953, -1.0781, -0.9414],
        [ 2.7500, -1.2188, -1.6484,  3.1719],
        [ 1.4531, -0.5625, -0.3379,  1.2812],
        ...,
        [-0.5234, -2.0938, -1.4844, -2.0312],
        [ 2.3281,  0.3223, -0.0166,  2.5469],
        [-0.4023,  2.8281,  1.3906,  2.3281]], requires_grad=True)
2024-10-08 15:06:25,528 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7539, -1.1797, -1.0625, -0.9141],
        [ 2.6406, -1.1328, -1.5781,  3.1094],
        [ 1.3438, -0.5117, -0.3008,  1.2578],
        ...,
        [-0.3770, -2.1250, -1.5078, -2.0000],
        [ 2.2188,  0.3359, -0.0060,  2.4844],
        [-0.6211,  2.9844,  1.5000,  2.3281]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7539, -1.1797, -1.0625, -0.9141],
        [ 2.6406, -1.1328, -1.5781,  3.1094],
        [ 1.3438, -0.5117, -0.3008,  1.2578],
        ...,
        [-0.3770, -2.1250, -1.5078, -2.0000],
        [ 2.2188,  0.3359, -0.0060,  2.4844],
        [-0.6211,  2.9844,  1.5000,  2.3281]], requires_grad=True)
2024-10-08 15:06:25,689 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6602, -1.1719, -1.0547, -0.8945],
        [ 2.5000, -1.0312, -1.4844,  3.0469],
        [ 1.2344, -0.4590, -0.2617,  1.2344],
        ...,
        [-0.2480, -2.1406, -1.5156, -1.9609],
        [ 2.0938,  0.3652,  0.0168,  2.4219],
        [-0.8281,  3.1250,  1.6016,  2.3125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6602, -1.1719, -1.0547, -0.8945],
        [ 2.5000, -1.0312, -1.4844,  3.0469],
        [ 1.2344, -0.4590, -0.2617,  1.2344],
        ...,
        [-0.2480, -2.1406, -1.5156, -1.9609],
        [ 2.0938,  0.3652,  0.0168,  2.4219],
        [-0.8281,  3.1250,  1.6016,  2.3125]], requires_grad=True)
2024-10-08 15:06:25,935 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6211, -1.1250, -1.0078, -0.8555],
        [ 2.4531, -1.0625, -1.4922,  2.9531],
        [ 1.2031, -0.4707, -0.2695,  1.1953],
        ...,
        [-0.2178, -2.0469, -1.4453, -1.8984],
        [ 2.0625,  0.2676, -0.0500,  2.3281],
        [-0.9453,  3.1406,  1.6250,  2.2812]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6211, -1.1250, -1.0078, -0.8555],
        [ 2.4531, -1.0625, -1.4922,  2.9531],
        [ 1.2031, -0.4707, -0.2695,  1.1953],
        ...,
        [-0.2178, -2.0469, -1.4453, -1.8984],
        [ 2.0625,  0.2676, -0.0500,  2.3281],
        [-0.9453,  3.1406,  1.6250,  2.2812]], requires_grad=True)
2024-10-08 15:06:26,102 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5430, -1.1016, -0.9805, -0.8164],
        [ 2.3750, -1.0469, -1.4609,  2.8594],
        [ 1.1641, -0.4766, -0.2734,  1.1562],
        ...,
        [-0.1631, -1.9766, -1.3906, -1.8438],
        [ 1.9844,  0.2178, -0.0835,  2.2500],
        [-1.0703,  3.1719,  1.6484,  2.2500]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5430, -1.1016, -0.9805, -0.8164],
        [ 2.3750, -1.0469, -1.4609,  2.8594],
        [ 1.1641, -0.4766, -0.2734,  1.1562],
        ...,
        [-0.1631, -1.9766, -1.3906, -1.8438],
        [ 1.9844,  0.2178, -0.0835,  2.2500],
        [-1.0703,  3.1719,  1.6484,  2.2500]], requires_grad=True)
2024-10-08 15:06:26,261 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4824, -1.0703, -0.9492, -0.7812],
        [ 2.2188, -0.9570, -1.3750,  2.7969],
        [ 1.1016, -0.4648, -0.2656,  1.1250],
        ...,
        [ 0.0126, -2.0156, -1.4297, -1.8125],
        [ 1.7969,  0.2676, -0.0420,  2.2031],
        [-1.3047,  3.3438,  1.7656,  2.2344]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4824, -1.0703, -0.9492, -0.7812],
        [ 2.2188, -0.9570, -1.3750,  2.7969],
        [ 1.1016, -0.4648, -0.2656,  1.1250],
        ...,
        [ 0.0126, -2.0156, -1.4297, -1.8125],
        [ 1.7969,  0.2676, -0.0420,  2.2031],
        [-1.3047,  3.3438,  1.7656,  2.2344]], requires_grad=True)
2024-10-08 15:06:26,513 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4258, -1.0391, -0.9219, -0.7500],
        [ 1.9844, -0.7617, -1.2031,  2.7500],
        [ 0.9922, -0.4219, -0.2334,  1.1016],
        ...,
        [ 0.2988, -2.1406, -1.5312, -1.7891],
        [ 1.5000,  0.4121,  0.0654,  2.1719],
        [-1.5938,  3.5938,  1.9297,  2.2500]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4258, -1.0391, -0.9219, -0.7500],
        [ 1.9844, -0.7617, -1.2031,  2.7500],
        [ 0.9922, -0.4219, -0.2334,  1.1016],
        ...,
        [ 0.2988, -2.1406, -1.5312, -1.7891],
        [ 1.5000,  0.4121,  0.0654,  2.1719],
        [-1.5938,  3.5938,  1.9297,  2.2500]], requires_grad=True)
2024-10-08 15:06:26,774 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4785, -0.9414, -0.8516, -0.7070],
        [ 1.9062, -0.7305, -1.1484,  2.7031],
        [ 1.0000, -0.4414, -0.2422,  1.0781],
        ...,
        [ 0.4023, -2.1406, -1.5391, -1.7578],
        [ 1.3906,  0.4316,  0.0908,  2.1250],
        [-1.7500,  3.7188,  2.0312,  2.2500]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.4785, -0.9414, -0.8516, -0.7070],
        [ 1.9062, -0.7305, -1.1484,  2.7031],
        [ 1.0000, -0.4414, -0.2422,  1.0781],
        ...,
        [ 0.4023, -2.1406, -1.5391, -1.7578],
        [ 1.3906,  0.4316,  0.0908,  2.1250],
        [-1.7500,  3.7188,  2.0312,  2.2500]], requires_grad=True)
2024-10-08 15:06:27,045 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5703, -0.8164, -0.7578, -0.6602],
        [ 1.8750, -0.7500, -1.1484,  2.6406],
        [ 1.0078, -0.4570, -0.2520,  1.0547],
        ...,
        [ 0.4551, -2.0781, -1.5000, -1.7109],
        [ 1.2969,  0.4355,  0.1060,  2.0781],
        [-1.8359,  3.7500,  2.0625,  2.2344]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5703, -0.8164, -0.7578, -0.6602],
        [ 1.8750, -0.7500, -1.1484,  2.6406],
        [ 1.0078, -0.4570, -0.2520,  1.0547],
        ...,
        [ 0.4551, -2.0781, -1.5000, -1.7109],
        [ 1.2969,  0.4355,  0.1060,  2.0781],
        [-1.8359,  3.7500,  2.0625,  2.2344]], requires_grad=True)
2024-10-08 15:06:27,199 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6680, -0.6797, -0.6562, -0.6094],
        [ 1.8516, -0.7930, -1.1641,  2.5781],
        [ 0.9961, -0.4609, -0.2520,  1.0312],
        ...,
        [ 0.5586, -2.1094, -1.5391, -1.6875],
        [ 1.1875,  0.4746,  0.1455,  2.0312],
        [-1.9453,  3.8438,  2.1406,  2.2344]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6680, -0.6797, -0.6562, -0.6094],
        [ 1.8516, -0.7930, -1.1641,  2.5781],
        [ 0.9961, -0.4609, -0.2520,  1.0312],
        ...,
        [ 0.5586, -2.1094, -1.5391, -1.6875],
        [ 1.1875,  0.4746,  0.1455,  2.0312],
        [-1.9453,  3.8438,  2.1406,  2.2344]], requires_grad=True)
2024-10-08 15:06:27,467 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7266, -0.5195, -0.5352, -0.5312],
        [ 1.7422, -0.6797, -1.0547,  2.5312],
        [ 0.9453, -0.4121, -0.2129,  1.0156],
        ...,
        [ 0.6953, -2.2031, -1.6250, -1.6797],
        [ 1.0547,  0.5859,  0.2373,  2.0156],
        [-2.0469,  3.9531,  2.2188,  2.2344]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7266, -0.5195, -0.5352, -0.5312],
        [ 1.7422, -0.6797, -1.0547,  2.5312],
        [ 0.9453, -0.4121, -0.2129,  1.0156],
        ...,
        [ 0.6953, -2.2031, -1.6250, -1.6797],
        [ 1.0547,  0.5859,  0.2373,  2.0156],
        [-2.0469,  3.9531,  2.2188,  2.2344]], requires_grad=True)
2024-10-08 15:06:27,626 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6914, -0.4395, -0.4668, -0.4648],
        [ 1.7812, -0.7148, -1.0391,  2.5000],
        [ 1.0547, -0.4941, -0.2617,  0.9922],
        ...,
        [ 0.4844, -1.9609, -1.4766, -1.6641],
        [ 1.0625,  0.5312,  0.2178,  1.9844],
        [-1.7969,  3.6719,  2.0938,  2.2500]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6914, -0.4395, -0.4668, -0.4648],
        [ 1.7812, -0.7148, -1.0391,  2.5000],
        [ 1.0547, -0.4941, -0.2617,  0.9922],
        ...,
        [ 0.4844, -1.9609, -1.4766, -1.6641],
        [ 1.0625,  0.5312,  0.2178,  1.9844],
        [-1.7969,  3.6719,  2.0938,  2.2500]], requires_grad=True)
2024-10-08 15:06:27,895 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7227, -0.3184, -0.3770, -0.4004],
        [ 1.8984, -0.8828, -1.1172,  2.4375],
        [ 1.1484, -0.5586, -0.3008,  0.9727],
        ...,
        [ 0.2793, -1.7500, -1.3516, -1.6562],
        [ 1.0859,  0.4785,  0.2002,  1.9531],
        [-1.5547,  3.4375,  1.9922,  2.2812]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7227, -0.3184, -0.3770, -0.4004],
        [ 1.8984, -0.8828, -1.1172,  2.4375],
        [ 1.1484, -0.5586, -0.3008,  0.9727],
        ...,
        [ 0.2793, -1.7500, -1.3516, -1.6562],
        [ 1.0859,  0.4785,  0.2002,  1.9531],
        [-1.5547,  3.4375,  1.9922,  2.2812]], requires_grad=True)
2024-10-08 15:06:28,159 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7266, -0.2090, -0.2930, -0.3359],
        [ 2.0312, -1.0391, -1.1953,  2.3906],
        [ 1.2031, -0.5977, -0.3223,  0.9570],
        ...,
        [ 0.2158, -1.7422, -1.3516, -1.6641],
        [ 1.0078,  0.5742,  0.2656,  1.9375],
        [-1.3594,  3.2500,  1.9141,  2.3125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7266, -0.2090, -0.2930, -0.3359],
        [ 2.0312, -1.0391, -1.1953,  2.3906],
        [ 1.2031, -0.5977, -0.3223,  0.9570],
        ...,
        [ 0.2158, -1.7422, -1.3516, -1.6641],
        [ 1.0078,  0.5742,  0.2656,  1.9375],
        [-1.3594,  3.2500,  1.9141,  2.3125]], requires_grad=True)
2024-10-08 15:06:28,420 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7109, -0.1338, -0.2305, -0.2773],
        [ 2.1250, -1.1562, -1.2422,  2.3438],
        [ 1.2266, -0.6094, -0.3320,  0.9375],
        ...,
        [ 0.1875, -1.7656, -1.3672, -1.6641],
        [ 0.9180,  0.6836,  0.3359,  1.9219],
        [-1.2031,  3.1250,  1.8594,  2.3281]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7109, -0.1338, -0.2305, -0.2773],
        [ 2.1250, -1.1562, -1.2422,  2.3438],
        [ 1.2266, -0.6094, -0.3320,  0.9375],
        ...,
        [ 0.1875, -1.7656, -1.3672, -1.6641],
        [ 0.9180,  0.6836,  0.3359,  1.9219],
        [-1.2031,  3.1250,  1.8594,  2.3281]], requires_grad=True)
2024-10-08 15:06:28,691 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7188, -0.0322, -0.1602, -0.2246],
        [ 2.1875, -1.2578, -1.2812,  2.2969],
        [ 1.2734, -0.6602, -0.3574,  0.9180],
        ...,
        [ 0.1084, -1.7109, -1.3438, -1.6641],
        [ 0.8633,  0.7344,  0.3789,  1.9062],
        [-1.0234,  2.9375,  1.7812,  2.3438]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7188, -0.0322, -0.1602, -0.2246],
        [ 2.1875, -1.2578, -1.2812,  2.2969],
        [ 1.2734, -0.6602, -0.3574,  0.9180],
        ...,
        [ 0.1084, -1.7109, -1.3438, -1.6641],
        [ 0.8633,  0.7344,  0.3789,  1.9062],
        [-1.0234,  2.9375,  1.7812,  2.3438]], requires_grad=True)
2024-10-08 15:06:28,941 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7500,  0.1553, -0.0620, -0.1689],
        [ 2.2656, -1.5078, -1.3828,  2.2344],
        [ 1.3281, -0.7617, -0.4023,  0.8906],
        ...,
        [ 0.0039, -1.5312, -1.2734, -1.6562],
        [ 0.8359,  0.6445,  0.3652,  1.8750],
        [-0.8398,  2.6562,  1.6719,  2.3438]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7500,  0.1553, -0.0620, -0.1689],
        [ 2.2656, -1.5078, -1.3828,  2.2344],
        [ 1.3281, -0.7617, -0.4023,  0.8906],
        ...,
        [ 0.0039, -1.5312, -1.2734, -1.6562],
        [ 0.8359,  0.6445,  0.3652,  1.8750],
        [-0.8398,  2.6562,  1.6719,  2.3438]], requires_grad=True)
2024-10-08 15:06:29,198 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7852,  0.3164,  0.0228, -0.1230],
        [ 2.2969, -1.6562, -1.4453,  2.1719],
        [ 1.3203, -0.7852, -0.4180,  0.8594],
        ...,
        [-0.0500, -1.4375, -1.2266, -1.6406],
        [ 0.7539,  0.6719,  0.3887,  1.8359],
        [-0.7227,  2.5000,  1.6016,  2.3281]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7852,  0.3164,  0.0228, -0.1230],
        [ 2.2969, -1.6562, -1.4453,  2.1719],
        [ 1.3203, -0.7852, -0.4180,  0.8594],
        ...,
        [-0.0500, -1.4375, -1.2266, -1.6406],
        [ 0.7539,  0.6719,  0.3887,  1.8359],
        [-0.7227,  2.5000,  1.6016,  2.3281]], requires_grad=True)
2024-10-08 15:06:29,456 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7227,  0.4043,  0.0825, -0.0613],
        [ 2.2188, -1.5156, -1.3828,  2.1094],
        [ 1.2656, -0.7109, -0.3945,  0.8359],
        ...,
        [ 0.0143, -1.5391, -1.2578, -1.6172],
        [ 0.5781,  0.8711,  0.4727,  1.7891],
        [-0.6914,  2.5312,  1.5938,  2.3125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7227,  0.4043,  0.0825, -0.0613],
        [ 2.2188, -1.5156, -1.3828,  2.1094],
        [ 1.2656, -0.7109, -0.3945,  0.8359],
        ...,
        [ 0.0143, -1.5391, -1.2578, -1.6172],
        [ 0.5781,  0.8711,  0.4727,  1.7891],
        [-0.6914,  2.5312,  1.5938,  2.3125]], requires_grad=True)
2024-10-08 15:06:29,720 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6562,  0.4707,  0.1299, -0.0056],
        [ 2.1406, -1.3516, -1.3125,  2.0469],
        [ 1.2109, -0.6562, -0.3770,  0.8125],
        ...,
        [ 0.0723, -1.5938, -1.2656, -1.5859],
        [ 0.4219,  1.0156,  0.5352,  1.7344],
        [-0.6680,  2.5156,  1.5703,  2.2812]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6562,  0.4707,  0.1299, -0.0056],
        [ 2.1406, -1.3516, -1.3125,  2.0469],
        [ 1.2109, -0.6562, -0.3770,  0.8125],
        ...,
        [ 0.0723, -1.5938, -1.2656, -1.5859],
        [ 0.4219,  1.0156,  0.5352,  1.7344],
        [-0.6680,  2.5156,  1.5703,  2.2812]], requires_grad=True)
2024-10-08 15:06:29,978 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6211,  0.6250,  0.2168,  0.0601],
        [ 2.0781, -1.3750, -1.3359,  1.9688],
        [ 1.1641, -0.7148, -0.4160,  0.7656],
        ...,
        [ 0.0408, -1.3750, -1.1328, -1.5312],
        [ 0.3457,  1.0078,  0.5273,  1.6797],
        [-0.6172,  2.2344,  1.4219,  2.2031]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6211,  0.6250,  0.2168,  0.0601],
        [ 2.0781, -1.3750, -1.3359,  1.9688],
        [ 1.1641, -0.7148, -0.4160,  0.7656],
        ...,
        [ 0.0408, -1.3750, -1.1328, -1.5312],
        [ 0.3457,  1.0078,  0.5273,  1.6797],
        [-0.6172,  2.2344,  1.4219,  2.2031]], requires_grad=True)
2024-10-08 15:06:30,238 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6836,  0.8555,  0.3398,  0.1270],
        [ 2.0312, -1.3828, -1.3516,  1.8906],
        [ 1.1953, -0.8359, -0.4883,  0.7148],
        ...,
        [-0.1504, -0.9727, -0.9023, -1.4688],
        [ 0.4043,  0.8359,  0.4355,  1.6250],
        [-0.5547,  1.9297,  1.2656,  2.1250]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6836,  0.8555,  0.3398,  0.1270],
        [ 2.0312, -1.3828, -1.3516,  1.8906],
        [ 1.1953, -0.8359, -0.4883,  0.7148],
        ...,
        [-0.1504, -0.9727, -0.9023, -1.4688],
        [ 0.4043,  0.8359,  0.4355,  1.6250],
        [-0.5547,  1.9297,  1.2656,  2.1250]], requires_grad=True)
2024-10-08 15:06:30,494 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7422,  0.9648,  0.3926,  0.1260],
        [ 1.9141, -1.2344, -1.2578,  1.8594],
        [ 1.1328, -0.8438, -0.4902,  0.6992],
        ...,
        [-0.0771, -0.8867, -0.8594, -1.4453],
        [ 0.2295,  0.9102,  0.4785,  1.5938],
        [-0.6367,  1.8906,  1.2422,  2.0781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7422,  0.9648,  0.3926,  0.1260],
        [ 1.9141, -1.2344, -1.2578,  1.8594],
        [ 1.1328, -0.8438, -0.4902,  0.6992],
        ...,
        [-0.0771, -0.8867, -0.8594, -1.4453],
        [ 0.2295,  0.9102,  0.4785,  1.5938],
        [-0.6367,  1.8906,  1.2422,  2.0781]], requires_grad=True)
2024-10-08 15:06:30,760 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7500,  0.9492,  0.3652,  0.0469],
        [ 1.8281, -1.0703, -1.1484,  1.8516],
        [ 1.0703, -0.8320, -0.4785,  0.6914],
        ...,
        [ 0.0583, -0.8789, -0.8633, -1.4297],
        [-0.0092,  1.1094,  0.6016,  1.6172],
        [-0.7188,  2.0156,  1.3125,  2.1250]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7500,  0.9492,  0.3652,  0.0469],
        [ 1.8281, -1.0703, -1.1484,  1.8516],
        [ 1.0703, -0.8320, -0.4785,  0.6914],
        ...,
        [ 0.0583, -0.8789, -0.8633, -1.4297],
        [-0.0092,  1.1094,  0.6016,  1.6172],
        [-0.7188,  2.0156,  1.3125,  2.1250]], requires_grad=True)
2024-10-08 15:06:31,013 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7148,  0.9102,  0.3262, -0.0264],
        [ 1.6484, -0.8438, -0.9961,  1.8438],
        [ 0.9844, -0.8008, -0.4551,  0.6875],
        ...,
        [ 0.2559, -0.9375, -0.9141, -1.4375],
        [-0.2217,  1.2969,  0.7148,  1.6328],
        [-0.7852,  2.0938,  1.3672,  2.1562]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7148,  0.9102,  0.3262, -0.0264],
        [ 1.6484, -0.8438, -0.9961,  1.8438],
        [ 0.9844, -0.8008, -0.4551,  0.6875],
        ...,
        [ 0.2559, -0.9375, -0.9141, -1.4375],
        [-0.2217,  1.2969,  0.7148,  1.6328],
        [-0.7852,  2.0938,  1.3672,  2.1562]], requires_grad=True)
2024-10-08 15:06:31,271 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6836,  0.8828,  0.3008, -0.0742],
        [ 1.9375, -0.9531, -1.0781,  1.7891],
        [ 1.0312, -0.8320, -0.4766,  0.6602],
        ...,
        [ 0.2773, -0.8945, -0.8906, -1.4062],
        [-0.2266,  1.3438,  0.7422,  1.6172],
        [-0.6328,  1.9844,  1.3047,  2.1250]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6836,  0.8828,  0.3008, -0.0742],
        [ 1.9375, -0.9531, -1.0781,  1.7891],
        [ 1.0312, -0.8320, -0.4766,  0.6602],
        ...,
        [ 0.2773, -0.8945, -0.8906, -1.4062],
        [-0.2266,  1.3438,  0.7422,  1.6172],
        [-0.6328,  1.9844,  1.3047,  2.1250]], requires_grad=True)
2024-10-08 15:06:31,522 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7188,  0.8789,  0.2910, -0.1118],
        [ 2.3438, -1.1406, -1.2109,  1.7188],
        [ 1.2891, -0.9375, -0.5469,  0.6172],
        ...,
        [ 0.0510, -0.7305, -0.7773, -1.3672],
        [ 0.0442,  1.2500,  0.6797,  1.5859],
        [-0.4160,  1.8281,  1.2109,  2.0781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7188,  0.8789,  0.2910, -0.1118],
        [ 2.3438, -1.1406, -1.2109,  1.7188],
        [ 1.2891, -0.9375, -0.5469,  0.6172],
        ...,
        [ 0.0510, -0.7305, -0.7773, -1.3672],
        [ 0.0442,  1.2500,  0.6797,  1.5859],
        [-0.4160,  1.8281,  1.2109,  2.0781]], requires_grad=True)
2024-10-08 15:06:31,768 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5156,  0.8008,  0.2441, -0.1494],
        [ 2.3906, -1.1172, -1.2109,  1.6719],
        [ 1.3594, -0.9688, -0.5703,  0.5898],
        ...,
        [ 0.2109, -0.7734, -0.7969, -1.3594],
        [ 0.0630,  1.2734,  0.6914,  1.5703],
        [-0.5586,  1.9062,  1.2344,  2.0625]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5156,  0.8008,  0.2441, -0.1494],
        [ 2.3906, -1.1172, -1.2109,  1.6719],
        [ 1.3594, -0.9688, -0.5703,  0.5898],
        ...,
        [ 0.2109, -0.7734, -0.7969, -1.3594],
        [ 0.0630,  1.2734,  0.6914,  1.5703],
        [-0.5586,  1.9062,  1.2344,  2.0625]], requires_grad=True)
2024-10-08 15:06:32,028 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1396,  0.6289,  0.1436, -0.2188],
        [ 2.1562, -0.8477, -1.0469,  1.6953],
        [ 1.2734, -0.9219, -0.5469,  0.5820],
        ...,
        [ 0.5352, -0.9297, -0.8906, -1.3750],
        [-0.1416,  1.4219,  0.7734,  1.5703],
        [-0.7500,  2.0781,  1.3125,  2.0938]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1396,  0.6289,  0.1436, -0.2188],
        [ 2.1562, -0.8477, -1.0469,  1.6953],
        [ 1.2734, -0.9219, -0.5469,  0.5820],
        ...,
        [ 0.5352, -0.9297, -0.8906, -1.3750],
        [-0.1416,  1.4219,  0.7734,  1.5703],
        [-0.7500,  2.0781,  1.3125,  2.0938]], requires_grad=True)
2024-10-08 15:06:32,300 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1201,  0.5234,  0.0835, -0.2676],
        [ 1.9844, -0.6562, -0.9375,  1.6875],
        [ 1.2422, -0.9062, -0.5430,  0.5742],
        ...,
        [ 0.6758, -0.9688, -0.9062, -1.3828],
        [-0.2363,  1.4922,  0.8047,  1.5625],
        [-0.8242,  2.1250,  1.3359,  2.1094]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1201,  0.5234,  0.0835, -0.2676],
        [ 1.9844, -0.6562, -0.9375,  1.6875],
        [ 1.2422, -0.9062, -0.5430,  0.5742],
        ...,
        [ 0.6758, -0.9688, -0.9062, -1.3828],
        [-0.2363,  1.4922,  0.8047,  1.5625],
        [-0.8242,  2.1250,  1.3359,  2.1094]], requires_grad=True)
2024-10-08 15:06:32,568 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2305,  0.4863,  0.0630, -0.3262],
        [ 2.0312, -0.7148, -0.9922,  1.6953],
        [ 1.3203, -0.9531, -0.5820,  0.5742],
        ...,
        [ 0.6523, -0.8789, -0.8320, -1.3984],
        [-0.2178,  1.4688,  0.7852,  1.5625],
        [-0.8438,  2.1094,  1.3203,  2.1250]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2305,  0.4863,  0.0630, -0.3262],
        [ 2.0312, -0.7148, -0.9922,  1.6953],
        [ 1.3203, -0.9531, -0.5820,  0.5742],
        ...,
        [ 0.6523, -0.8789, -0.8320, -1.3984],
        [-0.2178,  1.4688,  0.7852,  1.5625],
        [-0.8438,  2.1094,  1.3203,  2.1250]], requires_grad=True)
2024-10-08 15:06:32,825 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.3340,  0.4883,  0.0688, -0.3555],
        [ 2.0625, -0.7344, -1.0078,  1.6875],
        [ 1.3750, -0.9766, -0.6016,  0.5703],
        ...,
        [ 0.6445, -0.8086, -0.7773, -1.3984],
        [-0.2168,  1.4766,  0.7852,  1.5625],
        [-0.8633,  2.0625,  1.2891,  2.1094]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.3340,  0.4883,  0.0688, -0.3555],
        [ 2.0625, -0.7344, -1.0078,  1.6875],
        [ 1.3750, -0.9766, -0.6016,  0.5703],
        ...,
        [ 0.6445, -0.8086, -0.7773, -1.3984],
        [-0.2168,  1.4766,  0.7852,  1.5625],
        [-0.8633,  2.0625,  1.2891,  2.1094]], requires_grad=True)
2024-10-08 15:06:32,964 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.4316,  0.4727,  0.0654, -0.3809],
        [ 2.0625, -0.6992, -0.9922,  1.6797],
        [ 1.4062, -0.9805, -0.6055,  0.5625],
        ...,
        [ 0.6641, -0.7656, -0.7383, -1.3828],
        [-0.2422,  1.5156,  0.8047,  1.5469],
        [-0.9023,  2.0625,  1.2734,  2.0781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.4316,  0.4727,  0.0654, -0.3809],
        [ 2.0625, -0.6992, -0.9922,  1.6797],
        [ 1.4062, -0.9805, -0.6055,  0.5625],
        ...,
        [ 0.6641, -0.7656, -0.7383, -1.3828],
        [-0.2422,  1.5156,  0.8047,  1.5469],
        [-0.9023,  2.0625,  1.2734,  2.0781]], requires_grad=True)
2024-10-08 15:06:33,228 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.5117,  0.4473,  0.0559, -0.4023],
        [ 2.0469, -0.7070, -0.9961,  1.6641],
        [ 1.4297, -1.0000, -0.6211,  0.5547],
        ...,
        [ 0.6875, -0.6914, -0.6797, -1.3594],
        [-0.2715,  1.5312,  0.8125,  1.5234],
        [-0.9375,  2.0312,  1.2500,  2.0469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.5117,  0.4473,  0.0559, -0.4023],
        [ 2.0469, -0.7070, -0.9961,  1.6641],
        [ 1.4297, -1.0000, -0.6211,  0.5547],
        ...,
        [ 0.6875, -0.6914, -0.6797, -1.3594],
        [-0.2715,  1.5312,  0.8125,  1.5234],
        [-0.9375,  2.0312,  1.2500,  2.0469]], requires_grad=True)
2024-10-08 15:06:33,495 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.5820,  0.4258,  0.0496, -0.4180],
        [ 2.0156, -0.7070, -0.9961,  1.6406],
        [ 1.4453, -1.0078, -0.6289,  0.5430],
        ...,
        [ 0.7109, -0.6445, -0.6406, -1.3281],
        [-0.3027,  1.5547,  0.8242,  1.4844],
        [-0.9688,  2.0000,  1.2266,  2.0156]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.5820,  0.4258,  0.0496, -0.4180],
        [ 2.0156, -0.7070, -0.9961,  1.6406],
        [ 1.4453, -1.0078, -0.6289,  0.5430],
        ...,
        [ 0.7109, -0.6445, -0.6406, -1.3281],
        [-0.3027,  1.5547,  0.8242,  1.4844],
        [-0.9688,  2.0000,  1.2266,  2.0156]], requires_grad=True)
2024-10-08 15:06:33,659 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.6562,  0.4453,  0.0654, -0.4492],
        [ 1.9688, -0.7578, -1.0312,  1.6250],
        [ 1.4375, -1.0469, -0.6523,  0.5430],
        ...,
        [ 0.7422, -0.5664, -0.5781, -1.3047],
        [-0.3340,  1.5391,  0.8125,  1.4609],
        [-1.0078,  1.8828,  1.1562,  2.0000]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.6562,  0.4453,  0.0654, -0.4492],
        [ 1.9688, -0.7578, -1.0312,  1.6250],
        [ 1.4375, -1.0469, -0.6523,  0.5430],
        ...,
        [ 0.7422, -0.5664, -0.5781, -1.3047],
        [-0.3340,  1.5391,  0.8125,  1.4609],
        [-1.0078,  1.8828,  1.1562,  2.0000]], requires_grad=True)
2024-10-08 15:06:33,919 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.7148,  0.4258,  0.0579, -0.4570],
        [ 1.9844, -0.6719, -0.9688,  1.5938],
        [ 1.4453, -1.0547, -0.6562,  0.5352],
        ...,
        [ 0.7617, -0.5664, -0.5703, -1.2500],
        [-0.3359,  1.6016,  0.8516,  1.4141],
        [-1.0234,  1.8203,  1.1172,  1.9766]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.7148,  0.4258,  0.0579, -0.4570],
        [ 1.9844, -0.6719, -0.9688,  1.5938],
        [ 1.4453, -1.0547, -0.6562,  0.5352],
        ...,
        [ 0.7617, -0.5664, -0.5703, -1.2500],
        [-0.3359,  1.6016,  0.8516,  1.4141],
        [-1.0234,  1.8203,  1.1172,  1.9766]], requires_grad=True)
2024-10-08 15:06:34,173 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.7969,  0.4336,  0.0654, -0.4727],
        [ 1.9688, -0.6406, -0.9375,  1.5781],
        [ 1.4219, -1.0703, -0.6680,  0.5312],
        ...,
        [ 0.8047, -0.5234, -0.5391, -1.2109],
        [-0.3633,  1.6172,  0.8633,  1.3828],
        [-1.0625,  1.6953,  1.0547,  1.9688]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.7969,  0.4336,  0.0654, -0.4727],
        [ 1.9688, -0.6406, -0.9375,  1.5781],
        [ 1.4219, -1.0703, -0.6680,  0.5312],
        ...,
        [ 0.8047, -0.5234, -0.5391, -1.2109],
        [-0.3633,  1.6172,  0.8633,  1.3828],
        [-1.0625,  1.6953,  1.0547,  1.9688]], requires_grad=True)
2024-10-08 15:06:34,430 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.8281,  0.4141,  0.0588, -0.4785],
        [ 1.9609, -0.5820, -0.8906,  1.5469],
        [ 1.4141, -1.0703, -0.6680,  0.5195],
        ...,
        [ 0.8281, -0.5078, -0.5234, -1.1641],
        [-0.3887,  1.6328,  0.8711,  1.3438],
        [-1.0859,  1.6094,  1.0078,  1.9453]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.8281,  0.4141,  0.0588, -0.4785],
        [ 1.9609, -0.5820, -0.8906,  1.5469],
        [ 1.4141, -1.0703, -0.6680,  0.5195],
        ...,
        [ 0.8281, -0.5078, -0.5234, -1.1641],
        [-0.3887,  1.6328,  0.8711,  1.3438],
        [-1.0859,  1.6094,  1.0078,  1.9453]], requires_grad=True)
2024-10-08 15:06:34,696 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.8398,  0.3809,  0.0449, -0.4766],
        [ 1.9375, -0.5391, -0.8516,  1.5078],
        [ 1.3984, -1.0625, -0.6641,  0.5039],
        ...,
        [ 0.8594, -0.4922, -0.5039, -1.1172],
        [-0.4219,  1.6406,  0.8750,  1.3047],
        [-1.1094,  1.5391,  0.9609,  1.9219]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.8398,  0.3809,  0.0449, -0.4766],
        [ 1.9375, -0.5391, -0.8516,  1.5078],
        [ 1.3984, -1.0625, -0.6641,  0.5039],
        ...,
        [ 0.8594, -0.4922, -0.5039, -1.1172],
        [-0.4219,  1.6406,  0.8750,  1.3047],
        [-1.1094,  1.5391,  0.9609,  1.9219]], requires_grad=True)
2024-10-08 15:06:34,961 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.9219,  0.4434,  0.0796, -0.4941],
        [ 1.8672, -0.5820, -0.8594,  1.4766],
        [ 1.3594, -1.0781, -0.6758,  0.4922],
        ...,
        [ 0.9141, -0.4199, -0.4531, -1.0781],
        [-0.4844,  1.5781,  0.8438,  1.2734],
        [-1.1562,  1.3750,  0.8789,  1.9141]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.9219,  0.4434,  0.0796, -0.4941],
        [ 1.8672, -0.5820, -0.8594,  1.4766],
        [ 1.3594, -1.0781, -0.6758,  0.4922],
        ...,
        [ 0.9141, -0.4199, -0.4531, -1.0781],
        [-0.4844,  1.5781,  0.8438,  1.2734],
        [-1.1562,  1.3750,  0.8789,  1.9141]], requires_grad=True)
2024-10-08 15:06:35,218 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0078,  0.5117,  0.1196, -0.5000],
        [ 1.7969, -0.5742, -0.8398,  1.4375],
        [ 1.3203, -1.0781, -0.6758,  0.4785],
        ...,
        [ 0.9766, -0.4219, -0.4473, -1.0312],
        [-0.5625,  1.5938,  0.8594,  1.2266],
        [-1.2188,  1.3047,  0.8398,  1.8828]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0078,  0.5117,  0.1196, -0.5000],
        [ 1.7969, -0.5742, -0.8398,  1.4375],
        [ 1.3203, -1.0781, -0.6758,  0.4785],
        ...,
        [ 0.9766, -0.4219, -0.4473, -1.0312],
        [-0.5625,  1.5938,  0.8594,  1.2266],
        [-1.2188,  1.3047,  0.8398,  1.8828]], requires_grad=True)
2024-10-08 15:06:35,368 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0781,  0.5742,  0.1562, -0.5039],
        [ 1.7188, -0.5586, -0.8164,  1.3906],
        [ 1.2812, -1.0781, -0.6758,  0.4648],
        ...,
        [ 1.0469, -0.4707, -0.4707, -0.9727],
        [-0.6445,  1.6406,  0.8828,  1.1641],
        [-1.2734,  1.2578,  0.8125,  1.8438]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0781,  0.5742,  0.1562, -0.5039],
        [ 1.7188, -0.5586, -0.8164,  1.3906],
        [ 1.2812, -1.0781, -0.6758,  0.4648],
        ...,
        [ 1.0469, -0.4707, -0.4707, -0.9727],
        [-0.6445,  1.6406,  0.8828,  1.1641],
        [-1.2734,  1.2578,  0.8125,  1.8438]], requires_grad=True)
2024-10-08 15:06:35,624 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.1406,  0.6211,  0.1826, -0.5039],
        [ 1.6406, -0.5547, -0.8047,  1.3516],
        [ 1.2422, -1.0703, -0.6680,  0.4492],
        ...,
        [ 1.1094, -0.5508, -0.5117, -0.9141],
        [-0.7148,  1.7031,  0.9180,  1.1094],
        [-1.3203,  1.2031,  0.7812,  1.7969]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.1406,  0.6211,  0.1826, -0.5039],
        [ 1.6406, -0.5547, -0.8047,  1.3516],
        [ 1.2422, -1.0703, -0.6680,  0.4492],
        ...,
        [ 1.1094, -0.5508, -0.5117, -0.9141],
        [-0.7148,  1.7031,  0.9180,  1.1094],
        [-1.3203,  1.2031,  0.7812,  1.7969]], requires_grad=True)
2024-10-08 15:06:35,881 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.1953,  0.6719,  0.2139, -0.5078],
        [ 1.5312, -0.6328, -0.8477,  1.3125],
        [ 1.1641, -1.1094, -0.6953,  0.4395],
        ...,
        [ 1.2266, -0.4570, -0.4414, -0.8711],
        [-0.8164,  1.6328,  0.8789,  1.0703],
        [-1.3906,  0.9609,  0.6523,  1.7891]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.1953,  0.6719,  0.2139, -0.5078],
        [ 1.5312, -0.6328, -0.8477,  1.3125],
        [ 1.1641, -1.1094, -0.6953,  0.4395],
        ...,
        [ 1.2266, -0.4570, -0.4414, -0.8711],
        [-0.8164,  1.6328,  0.8789,  1.0703],
        [-1.3906,  0.9609,  0.6523,  1.7891]], requires_grad=True)
2024-10-08 15:06:36,140 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.2031,  0.7852,  0.2793, -0.5273],
        [ 1.4375, -0.6758, -0.8594,  1.2734],
        [ 1.0938, -1.1328, -0.7109,  0.4258],
        ...,
        [ 1.3047, -0.3848, -0.3867, -0.8359],
        [-0.8945,  1.5859,  0.8516,  1.0312],
        [-1.4375,  0.7578,  0.5430,  1.7734]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.2031,  0.7852,  0.2793, -0.5273],
        [ 1.4375, -0.6758, -0.8594,  1.2734],
        [ 1.0938, -1.1328, -0.7109,  0.4258],
        ...,
        [ 1.3047, -0.3848, -0.3867, -0.8359],
        [-0.8945,  1.5859,  0.8516,  1.0312],
        [-1.4375,  0.7578,  0.5430,  1.7734]], requires_grad=True)
2024-10-08 15:06:36,300 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0859,  0.7617,  0.2559, -0.5938],
        [ 1.5234, -0.3203, -0.5664,  1.3203],
        [ 1.1016, -1.1250, -0.6992,  0.4395],
        ...,
        [ 1.2344, -0.3887, -0.4004, -0.8633],
        [-0.8984,  1.5078,  0.8086,  1.0234],
        [-1.4062,  0.6250,  0.4824,  1.7891]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0859,  0.7617,  0.2559, -0.5938],
        [ 1.5234, -0.3203, -0.5664,  1.3203],
        [ 1.1016, -1.1250, -0.6992,  0.4395],
        ...,
        [ 1.2344, -0.3887, -0.4004, -0.8633],
        [-0.8984,  1.5078,  0.8086,  1.0234],
        [-1.4062,  0.6250,  0.4824,  1.7891]], requires_grad=True)
2024-10-08 15:06:36,445 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0703,  0.6680,  0.1895, -0.6250],
        [ 1.3828,  0.4668,  0.0249,  1.3203],
        [ 0.9766, -0.9453, -0.5742,  0.4316],
        ...,
        [ 1.2422, -0.5742, -0.5430, -0.8750],
        [-1.0312,  1.6797,  0.9297,  0.9922],
        [-1.4453,  0.6250,  0.4941,  1.7812]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0703,  0.6680,  0.1895, -0.6250],
        [ 1.3828,  0.4668,  0.0249,  1.3203],
        [ 0.9766, -0.9453, -0.5742,  0.4316],
        ...,
        [ 1.2422, -0.5742, -0.5430, -0.8750],
        [-1.0312,  1.6797,  0.9297,  0.9922],
        [-1.4453,  0.6250,  0.4941,  1.7812]], requires_grad=True)
2024-10-08 15:06:36,697 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0859,  0.5664,  0.1226, -0.6445],
        [ 1.3047,  1.1016,  0.5078,  1.3203],
        [ 0.8633, -0.7852, -0.4648,  0.4219],
        ...,
        [ 1.1562, -0.6289, -0.5938, -0.8945],
        [-1.0938,  1.7656,  0.9961,  0.9688],
        [-1.4375,  0.5586,  0.4629,  1.7656]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 1.0859,  0.5664,  0.1226, -0.6445],
        [ 1.3047,  1.1016,  0.5078,  1.3203],
        [ 0.8633, -0.7852, -0.4648,  0.4219],
        ...,
        [ 1.1562, -0.6289, -0.5938, -0.8945],
        [-1.0938,  1.7656,  0.9961,  0.9688],
        [-1.4375,  0.5586,  0.4629,  1.7656]], requires_grad=True)
2024-10-08 15:06:36,951 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.7617,  0.7109,  0.1973, -0.6953],
        [ 1.5391,  1.2812,  0.6836,  1.3594],
        [ 0.8438, -0.7227, -0.4180,  0.4141],
        ...,
        [ 0.9297, -0.4883, -0.5117, -0.9180],
        [-1.0234,  1.6953,  0.9648,  0.9531],
        [-1.4141,  0.4414,  0.4023,  1.7422]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.7617,  0.7109,  0.1973, -0.6953],
        [ 1.5391,  1.2812,  0.6836,  1.3594],
        [ 0.8438, -0.7227, -0.4180,  0.4141],
        ...,
        [ 0.9297, -0.4883, -0.5117, -0.9180],
        [-1.0234,  1.6953,  0.9648,  0.9531],
        [-1.4141,  0.4414,  0.4023,  1.7422]], requires_grad=True)
2024-10-08 15:06:37,215 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.4746,  0.8242,  0.2539, -0.7461],
        [ 1.8047,  1.3516,  0.7891,  1.3906],
        [ 0.8398, -0.6719, -0.3828,  0.4043],
        ...,
        [ 0.7305, -0.3594, -0.4355, -0.9336],
        [-0.9141,  1.5703,  0.9062,  0.9336],
        [-1.3750,  0.3301,  0.3457,  1.7188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.4746,  0.8242,  0.2539, -0.7461],
        [ 1.8047,  1.3516,  0.7891,  1.3906],
        [ 0.8398, -0.6719, -0.3828,  0.4043],
        ...,
        [ 0.7305, -0.3594, -0.4355, -0.9336],
        [-0.9141,  1.5703,  0.9062,  0.9336],
        [-1.3750,  0.3301,  0.3457,  1.7188]], requires_grad=True)
2024-10-08 15:06:37,367 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1289,  0.7617,  0.2070, -0.9180],
        [ 2.0469,  1.4609,  0.9141,  1.4453],
        [ 0.8398, -0.6016, -0.3320,  0.4102],
        ...,
        [ 0.5195, -0.3887, -0.4648, -1.0234],
        [-0.7812,  1.5391,  0.9023,  0.9688],
        [-1.3203,  0.3555,  0.3652,  1.7578]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1289,  0.7617,  0.2070, -0.9180],
        [ 2.0469,  1.4609,  0.9141,  1.4453],
        [ 0.8398, -0.6016, -0.3320,  0.4102],
        ...,
        [ 0.5195, -0.3887, -0.4648, -1.0234],
        [-0.7812,  1.5391,  0.9023,  0.9688],
        [-1.3203,  0.3555,  0.3652,  1.7578]], requires_grad=True)
2024-10-08 15:06:37,522 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1328,  0.6562,  0.1416, -1.0781],
        [ 2.2656,  1.5234,  1.0000,  1.4766],
        [ 0.8750, -0.5625, -0.2988,  0.4160],
        ...,
        [ 0.2305, -0.3145, -0.4375, -1.0938],
        [-0.5977,  1.4453,  0.8633,  0.9961],
        [-1.2109,  0.3066,  0.3496,  1.7891]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1328,  0.6562,  0.1416, -1.0781],
        [ 2.2656,  1.5234,  1.0000,  1.4766],
        [ 0.8750, -0.5625, -0.2988,  0.4160],
        ...,
        [ 0.2305, -0.3145, -0.4375, -1.0938],
        [-0.5977,  1.4453,  0.8633,  0.9961],
        [-1.2109,  0.3066,  0.3496,  1.7891]], requires_grad=True)
2024-10-08 15:06:37,679 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3203,  0.5312,  0.0728, -1.2266],
        [ 2.5156,  1.5078,  1.0469,  1.4922],
        [ 0.9531, -0.5664, -0.2832,  0.4141],
        ...,
        [-0.1230, -0.1592, -0.3750, -1.1406],
        [-0.3457,  1.2734,  0.7969,  1.0000],
        [-1.0547,  0.2080,  0.3184,  1.8125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3203,  0.5312,  0.0728, -1.2266],
        [ 2.5156,  1.5078,  1.0469,  1.4922],
        [ 0.9531, -0.5664, -0.2832,  0.4141],
        ...,
        [-0.1230, -0.1592, -0.3750, -1.1406],
        [-0.3457,  1.2734,  0.7969,  1.0000],
        [-1.0547,  0.2080,  0.3184,  1.8125]], requires_grad=True)
2024-10-08 15:06:37,940 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4961e-01,  3.7305e-01,  1.7700e-03, -1.3438e+00],
        [ 2.6406e+00,  1.5391e+00,  1.0938e+00,  1.5000e+00],
        [ 1.0000e+00, -5.6250e-01, -2.6953e-01,  4.1016e-01],
        ...,
        [-1.5332e-01, -1.6309e-01, -3.5352e-01, -1.1797e+00],
        [-3.4180e-01,  1.2344e+00,  7.6172e-01,  1.0000e+00],
        [-1.0000e+00,  1.7676e-01,  3.0078e-01,  1.8203e+00]],
       requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4961e-01,  3.7305e-01,  1.7700e-03, -1.3438e+00],
        [ 2.6406e+00,  1.5391e+00,  1.0938e+00,  1.5000e+00],
        [ 1.0000e+00, -5.6250e-01, -2.6953e-01,  4.1016e-01],
        ...,
        [-1.5332e-01, -1.6309e-01, -3.5352e-01, -1.1797e+00],
        [-3.4180e-01,  1.2344e+00,  7.6172e-01,  1.0000e+00],
        [-1.0000e+00,  1.7676e-01,  3.0078e-01,  1.8203e+00]],
       requires_grad=True)
2024-10-08 15:06:38,207 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2559,  0.1279, -0.0908, -1.5078],
        [ 2.5625,  1.7969,  1.2109,  1.5781],
        [ 0.9883, -0.5195, -0.2451,  0.4199],
        ...,
        [-0.0154, -0.3262, -0.3848, -1.2578],
        [-0.4512,  1.2969,  0.7578,  1.0312],
        [-0.9883,  0.2227,  0.3027,  1.8516]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2559,  0.1279, -0.0908, -1.5078],
        [ 2.5625,  1.7969,  1.2109,  1.5781],
        [ 0.9883, -0.5195, -0.2451,  0.4199],
        ...,
        [-0.0154, -0.3262, -0.3848, -1.2578],
        [-0.4512,  1.2969,  0.7578,  1.0312],
        [-0.9883,  0.2227,  0.3027,  1.8516]], requires_grad=True)
2024-10-08 15:06:38,460 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2080, -0.0491, -0.1602, -1.6328],
        [ 2.6406,  1.9062,  1.2812,  1.6719],
        [ 1.0469, -0.5000, -0.2266,  0.4395],
        ...,
        [-0.2520, -0.2148, -0.3379, -1.3516],
        [-0.4199,  1.2578,  0.7266,  1.0703],
        [-0.9766,  0.2490,  0.3008,  1.8672]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2080, -0.0491, -0.1602, -1.6328],
        [ 2.6406,  1.9062,  1.2812,  1.6719],
        [ 1.0469, -0.5000, -0.2266,  0.4395],
        ...,
        [-0.2520, -0.2148, -0.3379, -1.3516],
        [-0.4199,  1.2578,  0.7266,  1.0703],
        [-0.9766,  0.2490,  0.3008,  1.8672]], requires_grad=True)
2024-10-08 15:06:38,713 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2031, -0.1826, -0.2178, -1.7422],
        [ 2.8906,  1.7656,  1.2812,  1.7422],
        [ 1.1641, -0.5312, -0.2207,  0.4512],
        ...,
        [-0.4492, -0.1309, -0.3008, -1.4375],
        [-0.3320,  1.1562,  0.6836,  1.0938],
        [-0.9375,  0.2412,  0.2930,  1.8750]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.2031, -0.1826, -0.2178, -1.7422],
        [ 2.8906,  1.7656,  1.2812,  1.7422],
        [ 1.1641, -0.5312, -0.2207,  0.4512],
        ...,
        [-0.4492, -0.1309, -0.3008, -1.4375],
        [-0.3320,  1.1562,  0.6836,  1.0938],
        [-0.9375,  0.2412,  0.2930,  1.8750]], requires_grad=True)
2024-10-08 15:06:38,874 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1748, -0.3047, -0.2656, -1.8281],
        [ 3.1250,  1.5859,  1.2812,  1.7734],
        [ 1.2578, -0.5664, -0.2158,  0.4531],
        ...,
        [-0.6406, -0.0374, -0.2637, -1.5000],
        [-0.2314,  1.0469,  0.6445,  1.1016],
        [-0.9102,  0.2305,  0.2852,  1.8750]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1748, -0.3047, -0.2656, -1.8281],
        [ 3.1250,  1.5859,  1.2812,  1.7734],
        [ 1.2578, -0.5664, -0.2158,  0.4531],
        ...,
        [-0.6406, -0.0374, -0.2637, -1.5000],
        [-0.2314,  1.0469,  0.6445,  1.1016],
        [-0.9102,  0.2305,  0.2852,  1.8750]], requires_grad=True)
2024-10-08 15:06:39,129 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1357, -0.4180, -0.3066, -1.8984],
        [ 3.2969,  1.5000,  1.2656,  1.8672],
        [ 1.2656, -0.5508, -0.2188,  0.4922],
        ...,
        [-0.7617,  0.0137, -0.2256, -1.5625],
        [-0.2100,  0.9961,  0.5977,  1.1406],
        [-0.9180,  0.2461,  0.2715,  1.8750]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1357, -0.4180, -0.3066, -1.8984],
        [ 3.2969,  1.5000,  1.2656,  1.8672],
        [ 1.2656, -0.5508, -0.2188,  0.4922],
        ...,
        [-0.7617,  0.0137, -0.2256, -1.5625],
        [-0.2100,  0.9961,  0.5977,  1.1406],
        [-0.9180,  0.2461,  0.2715,  1.8750]], requires_grad=True)
2024-10-08 15:06:39,379 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1050, -0.5195, -0.3398, -1.9688],
        [ 3.4219,  1.4609,  1.2266,  1.9688],
        [ 1.2422, -0.5117, -0.2314,  0.5469],
        ...,
        [-0.8203,  0.0442, -0.1816, -1.6172],
        [-0.2393,  0.9766,  0.5391,  1.1875],
        [-0.9570,  0.2910,  0.2441,  1.8906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.1050, -0.5195, -0.3398, -1.9688],
        [ 3.4219,  1.4609,  1.2266,  1.9688],
        [ 1.2422, -0.5117, -0.2314,  0.5469],
        ...,
        [-0.8203,  0.0442, -0.1816, -1.6172],
        [-0.2393,  0.9766,  0.5391,  1.1875],
        [-0.9570,  0.2910,  0.2441,  1.8906]], requires_grad=True)
2024-10-08 15:06:39,543 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0688, -0.6172, -0.3594, -2.0156],
        [ 3.5156,  1.4219,  1.1875,  2.0469],
        [ 1.2188, -0.4805, -0.2373,  0.5859],
        ...,
        [-0.8711,  0.0801, -0.1514, -1.6484],
        [-0.2656,  0.9648,  0.4805,  1.2266],
        [-0.9766,  0.3242,  0.2246,  1.8984]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0688, -0.6172, -0.3594, -2.0156],
        [ 3.5156,  1.4219,  1.1875,  2.0469],
        [ 1.2188, -0.4805, -0.2373,  0.5859],
        ...,
        [-0.8711,  0.0801, -0.1514, -1.6484],
        [-0.2656,  0.9648,  0.4805,  1.2266],
        [-0.9766,  0.3242,  0.2246,  1.8984]], requires_grad=True)
2024-10-08 15:06:39,796 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0238, -0.6797, -0.4023, -2.0469],
        [ 3.5781,  1.3594,  1.1953,  2.1094],
        [ 1.1875, -0.4590, -0.2305,  0.6211],
        ...,
        [-0.9102,  0.1289, -0.1562, -1.6719],
        [-0.2910,  0.9375,  0.4590,  1.2578],
        [-0.9883,  0.3496,  0.2119,  1.8984]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0238, -0.6797, -0.4023, -2.0469],
        [ 3.5781,  1.3594,  1.1953,  2.1094],
        [ 1.1875, -0.4590, -0.2305,  0.6211],
        ...,
        [-0.9102,  0.1289, -0.1562, -1.6719],
        [-0.2910,  0.9375,  0.4590,  1.2578],
        [-0.9883,  0.3496,  0.2119,  1.8984]], requires_grad=True)
2024-10-08 15:06:40,049 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.0244, -0.7227, -0.4609, -2.0625],
        [ 3.6250,  1.2734,  1.2500,  2.1562],
        [ 1.1484, -0.4473, -0.2070,  0.6406],
        ...,
        [-0.9375,  0.1875, -0.1885, -1.6875],
        [-0.3164,  0.9023,  0.4512,  1.2734],
        [-0.9961,  0.3594,  0.2168,  1.8906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.0244, -0.7227, -0.4609, -2.0625],
        [ 3.6250,  1.2734,  1.2500,  2.1562],
        [ 1.1484, -0.4473, -0.2070,  0.6406],
        ...,
        [-0.9375,  0.1875, -0.1885, -1.6875],
        [-0.3164,  0.9023,  0.4512,  1.2734],
        [-0.9961,  0.3594,  0.2168,  1.8906]], requires_grad=True)
2024-10-08 15:06:40,313 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.0806, -0.7656, -0.5078, -2.0781],
        [ 3.6094,  1.2500,  1.2656,  2.2031],
        [ 1.0781, -0.4141, -0.1973,  0.6562],
        ...,
        [-0.9102,  0.2002, -0.1943, -1.6953],
        [-0.3789,  0.8984,  0.4258,  1.2891],
        [-1.0391,  0.4062,  0.2012,  1.8828]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.0806, -0.7656, -0.5078, -2.0781],
        [ 3.6094,  1.2500,  1.2656,  2.2031],
        [ 1.0781, -0.4141, -0.1973,  0.6562],
        ...,
        [-0.9102,  0.2002, -0.1943, -1.6953],
        [-0.3789,  0.8984,  0.4258,  1.2891],
        [-1.0391,  0.4062,  0.2012,  1.8828]], requires_grad=True)
2024-10-08 15:06:40,470 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1328, -0.8086, -0.5469, -2.0938],
        [ 3.6094,  1.2031,  1.2656,  2.2344],
        [ 1.0312, -0.3926, -0.1895,  0.6680],
        ...,
        [-0.9141,  0.2256, -0.1963, -1.6875],
        [-0.4141,  0.8867,  0.4004,  1.2969],
        [-1.0625,  0.4395,  0.1865,  1.8672]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1328, -0.8086, -0.5469, -2.0938],
        [ 3.6094,  1.2031,  1.2656,  2.2344],
        [ 1.0312, -0.3926, -0.1895,  0.6680],
        ...,
        [-0.9141,  0.2256, -0.1963, -1.6875],
        [-0.4141,  0.8867,  0.4004,  1.2969],
        [-1.0625,  0.4395,  0.1865,  1.8672]], requires_grad=True)
2024-10-08 15:06:40,727 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2051, -0.8633, -0.5898, -2.1094],
        [ 3.5938,  1.1875,  1.2891,  2.2812],
        [ 0.9961, -0.3691, -0.1797,  0.6797],
        ...,
        [-0.9648,  0.2598, -0.1924, -1.6875],
        [-0.4219,  0.8750,  0.3809,  1.3047],
        [-1.0938,  0.4922,  0.1846,  1.8594]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2051, -0.8633, -0.5898, -2.1094],
        [ 3.5938,  1.1875,  1.2891,  2.2812],
        [ 0.9961, -0.3691, -0.1797,  0.6797],
        ...,
        [-0.9648,  0.2598, -0.1924, -1.6875],
        [-0.4219,  0.8750,  0.3809,  1.3047],
        [-1.0938,  0.4922,  0.1846,  1.8594]], requires_grad=True)
2024-10-08 15:06:40,885 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2500, -0.8906, -0.6133, -2.1094],
        [ 3.6406,  1.1406,  1.2891,  2.3125],
        [ 0.9844, -0.3594, -0.1777,  0.6836],
        ...,
        [-1.1406,  0.3828, -0.1260, -1.6484],
        [-0.3242,  0.7891,  0.3184,  1.2812],
        [-1.0703,  0.4844,  0.1543,  1.8281]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2500, -0.8906, -0.6133, -2.1094],
        [ 3.6406,  1.1406,  1.2891,  2.3125],
        [ 0.9844, -0.3594, -0.1777,  0.6836],
        ...,
        [-1.1406,  0.3828, -0.1260, -1.6484],
        [-0.3242,  0.7891,  0.3184,  1.2812],
        [-1.0703,  0.4844,  0.1543,  1.8281]], requires_grad=True)
2024-10-08 15:06:41,043 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2441, -0.8906, -0.6211, -2.0781],
        [ 3.6406,  1.1250,  1.2969,  2.3438],
        [ 0.9453, -0.3359, -0.1670,  0.6914],
        ...,
        [-1.1875,  0.4297, -0.1094, -1.6328],
        [-0.3145,  0.7617,  0.2949,  1.2812],
        [-1.0703,  0.4922,  0.1348,  1.7891]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2441, -0.8906, -0.6211, -2.0781],
        [ 3.6406,  1.1250,  1.2969,  2.3438],
        [ 0.9453, -0.3359, -0.1670,  0.6914],
        ...,
        [-1.1875,  0.4297, -0.1094, -1.6328],
        [-0.3145,  0.7617,  0.2949,  1.2812],
        [-1.0703,  0.4922,  0.1348,  1.7891]], requires_grad=True)
2024-10-08 15:06:41,294 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2969, -0.9219, -0.6484, -2.0781],
        [ 3.4844,  1.2109,  1.3828,  2.4062],
        [ 0.8516, -0.2871, -0.1348,  0.7148],
        ...,
        [-1.1641,  0.4395, -0.1177, -1.6250],
        [-0.3535,  0.7656,  0.2949,  1.2891],
        [-1.1250,  0.5430,  0.1494,  1.7734]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.2969, -0.9219, -0.6484, -2.0781],
        [ 3.4844,  1.2109,  1.3828,  2.4062],
        [ 0.8516, -0.2871, -0.1348,  0.7148],
        ...,
        [-1.1641,  0.4395, -0.1177, -1.6250],
        [-0.3535,  0.7656,  0.2949,  1.2891],
        [-1.1250,  0.5430,  0.1494,  1.7734]], requires_grad=True)
2024-10-08 15:06:41,450 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1787, -0.8945, -0.6328, -2.0469],
        [ 3.5469,  1.1562,  1.3516,  2.4062],
        [ 0.8477, -0.2754, -0.1318,  0.7148],
        ...,
        [-1.3438,  0.5430, -0.0408, -1.5625],
        [-0.2598,  0.7031,  0.2441,  1.2656],
        [-1.0312,  0.4922,  0.0933,  1.7109]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1787, -0.8945, -0.6328, -2.0469],
        [ 3.5469,  1.1562,  1.3516,  2.4062],
        [ 0.8477, -0.2754, -0.1318,  0.7148],
        ...,
        [-1.3438,  0.5430, -0.0408, -1.5625],
        [-0.2598,  0.7031,  0.2441,  1.2656],
        [-1.0312,  0.4922,  0.0933,  1.7109]], requires_grad=True)
2024-10-08 15:06:41,612 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1045, -0.8750, -0.6211, -2.0156],
        [ 3.5781,  1.1172,  1.3281,  2.3906],
        [ 0.8672, -0.2754, -0.1377,  0.7031],
        ...,
        [-1.3906,  0.5938, -0.0053, -1.5234],
        [-0.2754,  0.6875,  0.2275,  1.2578],
        [-0.9570,  0.4551,  0.0505,  1.6484]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.1045, -0.8750, -0.6211, -2.0156],
        [ 3.5781,  1.1172,  1.3281,  2.3906],
        [ 0.8672, -0.2754, -0.1377,  0.7031],
        ...,
        [-1.3906,  0.5938, -0.0053, -1.5234],
        [-0.2754,  0.6875,  0.2275,  1.2578],
        [-0.9570,  0.4551,  0.0505,  1.6484]], requires_grad=True)
2024-10-08 15:06:41,869 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.0280, -0.8555, -0.6094, -1.9844],
        [ 3.3750,  1.2266,  1.4141,  2.4688],
        [ 0.8086, -0.2471, -0.1235,  0.7148],
        ...,
        [-1.2422,  0.5508, -0.0342, -1.5312],
        [-0.5078,  0.7773,  0.2832,  1.3203],
        [-1.1328,  0.5742,  0.1064,  1.6953]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[ 0.0280, -0.8555, -0.6094, -1.9844],
        [ 3.3750,  1.2266,  1.4141,  2.4688],
        [ 0.8086, -0.2471, -0.1235,  0.7148],
        ...,
        [-1.2422,  0.5508, -0.0342, -1.5312],
        [-0.5078,  0.7773,  0.2832,  1.3203],
        [-1.1328,  0.5742,  0.1064,  1.6953]], requires_grad=True)
2024-10-08 15:06:42,126 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0305, -0.8438, -0.6016, -1.9609],
        [ 3.2344,  1.2812,  1.4688,  2.5000],
        [ 0.7773, -0.2324, -0.1172,  0.7109],
        ...,
        [-1.2031,  0.5664, -0.0277, -1.4922],
        [-0.6211,  0.8008,  0.3047,  1.3281],
        [-1.2812,  0.6680,  0.1504,  1.7109]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.0305, -0.8438, -0.6016, -1.9609],
        [ 3.2344,  1.2812,  1.4688,  2.5000],
        [ 0.7773, -0.2324, -0.1172,  0.7109],
        ...,
        [-1.2031,  0.5664, -0.0277, -1.4922],
        [-0.6211,  0.8008,  0.3047,  1.3281],
        [-1.2812,  0.6680,  0.1504,  1.7109]], requires_grad=True)
2024-10-08 15:06:42,390 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3008, -0.7500, -0.5508, -1.8594],
        [ 3.2812,  1.2266,  1.4531,  2.4688],
        [ 0.8594, -0.2559, -0.1299,  0.6797],
        ...,
        [-1.2891,  0.6406,  0.0115, -1.4141],
        [-0.6094,  0.7617,  0.2891,  1.2891],
        [-1.2734,  0.6836,  0.1621,  1.7031]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.3008, -0.7500, -0.5508, -1.8594],
        [ 3.2812,  1.2266,  1.4531,  2.4688],
        [ 0.8594, -0.2559, -0.1299,  0.6797],
        ...,
        [-1.2891,  0.6406,  0.0115, -1.4141],
        [-0.6094,  0.7617,  0.2891,  1.2891],
        [-1.2734,  0.6836,  0.1621,  1.7031]], requires_grad=True)
2024-10-08 15:06:42,657 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5938, -0.6406, -0.4980, -1.7500],
        [ 3.3594,  1.1406,  1.4141,  2.4062],
        [ 0.9570, -0.2891, -0.1484,  0.6367],
        ...,
        [-1.3750,  0.7148,  0.0527, -1.3359],
        [-0.5703,  0.7070,  0.2695,  1.2344],
        [-1.2734,  0.6992,  0.1729,  1.6875]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5938, -0.6406, -0.4980, -1.7500],
        [ 3.3594,  1.1406,  1.4141,  2.4062],
        [ 0.9570, -0.2891, -0.1484,  0.6367],
        ...,
        [-1.3750,  0.7148,  0.0527, -1.3359],
        [-0.5703,  0.7070,  0.2695,  1.2344],
        [-1.2734,  0.6992,  0.1729,  1.6875]], requires_grad=True)
2024-10-08 15:06:42,812 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7422, -0.5742, -0.4629, -1.6562],
        [ 3.2656,  1.1641,  1.4297,  2.3906],
        [ 1.0156, -0.3086, -0.1592,  0.6016],
        ...,
        [-1.2656,  0.6875,  0.0430, -1.3047],
        [-0.6523,  0.7188,  0.2773,  1.2109],
        [-1.4297,  0.8203,  0.2266,  1.7266]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7422, -0.5742, -0.4629, -1.6562],
        [ 3.2656,  1.1641,  1.4297,  2.3906],
        [ 1.0156, -0.3086, -0.1592,  0.6016],
        ...,
        [-1.2656,  0.6875,  0.0430, -1.3047],
        [-0.6523,  0.7188,  0.2773,  1.2109],
        [-1.4297,  0.8203,  0.2266,  1.7266]], requires_grad=True)
2024-10-08 15:06:43,065 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6445, -0.5820, -0.4570, -1.5938],
        [ 3.1875,  1.1641,  1.4297,  2.3594],
        [ 0.9883, -0.3027, -0.1582,  0.5742],
        ...,
        [-1.0156,  0.6016,  0.0072, -1.2812],
        [-0.9570,  0.8203,  0.3242,  1.2109],
        [-1.7422,  1.0156,  0.3086,  1.7656]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.6445, -0.5820, -0.4570, -1.5938],
        [ 3.1875,  1.1641,  1.4297,  2.3594],
        [ 0.9883, -0.3027, -0.1582,  0.5742],
        ...,
        [-1.0156,  0.6016,  0.0072, -1.2812],
        [-0.9570,  0.8203,  0.3242,  1.2109],
        [-1.7422,  1.0156,  0.3086,  1.7656]], requires_grad=True)
2024-10-08 15:06:43,322 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5664, -0.5781, -0.4473, -1.5234],
        [ 3.1094,  1.1562,  1.4219,  2.3281],
        [ 0.9922, -0.3086, -0.1631,  0.5469],
        ...,
        [-0.7070,  0.4707, -0.0522, -1.2812],
        [-1.2266,  0.9102,  0.3652,  1.2031],
        [-2.0938,  1.2422,  0.4043,  1.8125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.5664, -0.5781, -0.4473, -1.5234],
        [ 3.1094,  1.1562,  1.4219,  2.3281],
        [ 0.9922, -0.3086, -0.1631,  0.5469],
        ...,
        [-0.7070,  0.4707, -0.0522, -1.2812],
        [-1.2266,  0.9102,  0.3652,  1.2031],
        [-2.0938,  1.2422,  0.4043,  1.8125]], requires_grad=True)
2024-10-08 15:06:43,577 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7070, -0.4980, -0.4004, -1.4297],
        [ 3.4219,  0.8828,  1.2656,  2.2344],
        [ 1.1719, -0.3828, -0.2051,  0.4961],
        ...,
        [-0.6250,  0.4512, -0.0515, -1.2500],
        [-1.2188,  0.8516,  0.3301,  1.1484],
        [-2.1562,  1.2891,  0.4219,  1.8281]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.7070, -0.4980, -0.4004, -1.4297],
        [ 3.4219,  0.8828,  1.2656,  2.2344],
        [ 1.1719, -0.3828, -0.2051,  0.4961],
        ...,
        [-0.6250,  0.4512, -0.0515, -1.2500],
        [-1.2188,  0.8516,  0.3301,  1.1484],
        [-2.1562,  1.2891,  0.4219,  1.8281]], requires_grad=True)
2024-10-08 15:06:43,829 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9570, -0.3613, -0.3262, -1.3203],
        [ 3.9844,  0.3770,  0.9648,  2.0625],
        [ 1.3672, -0.4648, -0.2520,  0.4395],
        ...,
        [-0.5508,  0.4297, -0.0522, -1.2188],
        [-1.1328,  0.7422,  0.2715,  1.0781],
        [-2.0312,  1.1719,  0.3613,  1.7891]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9570, -0.3613, -0.3262, -1.3203],
        [ 3.9844,  0.3770,  0.9648,  2.0625],
        [ 1.3672, -0.4648, -0.2520,  0.4395],
        ...,
        [-0.5508,  0.4297, -0.0522, -1.2188],
        [-1.1328,  0.7422,  0.2715,  1.0781],
        [-2.0312,  1.1719,  0.3613,  1.7891]], requires_grad=True)
2024-10-08 15:06:44,092 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1641, -0.3203, -0.3086, -1.2969],
        [ 4.4062, -0.0299,  0.7227,  1.8984],
        [ 1.5078, -0.5117, -0.2754,  0.4023],
        ...,
        [-0.3965,  0.3008, -0.1226, -1.2344],
        [-1.0703,  0.6992,  0.2520,  1.0469],
        [-1.9375,  1.0312,  0.2812,  1.6953]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.1641, -0.3203, -0.3086, -1.2969],
        [ 4.4062, -0.0299,  0.7227,  1.8984],
        [ 1.5078, -0.5117, -0.2754,  0.4023],
        ...,
        [-0.3965,  0.3008, -0.1226, -1.2344],
        [-1.0703,  0.6992,  0.2520,  1.0469],
        [-1.9375,  1.0312,  0.2812,  1.6953]], requires_grad=True)
2024-10-08 15:06:44,351 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.3828, -0.3438, -0.3281, -1.3359],
        [ 5.0312, -0.6133,  0.3691,  1.7344],
        [ 1.6797, -0.5625, -0.3027,  0.3789],
        ...,
        [-0.1592, -0.0977, -0.3691, -1.3828],
        [-1.0234,  0.7344,  0.2773,  1.0625],
        [-1.8281,  1.0312,  0.2871,  1.7031]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.3828, -0.3438, -0.3281, -1.3359],
        [ 5.0312, -0.6133,  0.3691,  1.7344],
        [ 1.6797, -0.5625, -0.3027,  0.3789],
        ...,
        [-0.1592, -0.0977, -0.3691, -1.3828],
        [-1.0234,  0.7344,  0.2773,  1.0625],
        [-1.8281,  1.0312,  0.2871,  1.7031]], requires_grad=True)
2024-10-08 15:06:44,511 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.5156, -0.4199, -0.3750, -1.3828],
        [ 5.5000, -1.0625,  0.0962,  1.5859],
        [ 1.7812, -0.5547, -0.2949,  0.3672],
        ...,
        [ 0.0659, -0.4492, -0.5820, -1.5078],
        [-1.0000,  0.7812,  0.3125,  1.0781],
        [-1.7109,  0.9961,  0.2734,  1.6953]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.5156, -0.4199, -0.3750, -1.3828],
        [ 5.5000, -1.0625,  0.0962,  1.5859],
        [ 1.7812, -0.5547, -0.2949,  0.3672],
        ...,
        [ 0.0659, -0.4492, -0.5820, -1.5078],
        [-1.0000,  0.7812,  0.3125,  1.0781],
        [-1.7109,  0.9961,  0.2734,  1.6953]], requires_grad=True)
2024-10-08 15:06:44,772 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.8438, -0.3418, -0.3438, -1.4297],
        [ 5.9062, -1.5391, -0.1953,  1.4219],
        [ 1.9297, -0.6055, -0.3184,  0.3516],
        ...,
        [-0.0325, -0.5273, -0.6484, -1.6328],
        [-0.7969,  0.6367,  0.2461,  1.0781],
        [-1.4844,  0.8203,  0.1973,  1.6875]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.8438, -0.3418, -0.3438, -1.4297],
        [ 5.9062, -1.5391, -0.1953,  1.4219],
        [ 1.9297, -0.6055, -0.3184,  0.3516],
        ...,
        [-0.0325, -0.5273, -0.6484, -1.6328],
        [-0.7969,  0.6367,  0.2461,  1.0781],
        [-1.4844,  0.8203,  0.1973,  1.6875]], requires_grad=True)
2024-10-08 15:06:45,037 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.1250, -0.1797, -0.2695, -1.4297],
        [ 6.1875, -1.9453, -0.4473,  1.2656],
        [ 2.1094, -0.6992, -0.3652,  0.3379],
        ...,
        [-0.1006, -0.5586, -0.6797, -1.7266],
        [-0.2871,  0.2969,  0.0913,  1.1641],
        [-1.4688,  0.7539,  0.1602,  1.6094]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.1250, -0.1797, -0.2695, -1.4297],
        [ 6.1875, -1.9453, -0.4473,  1.2656],
        [ 2.1094, -0.6992, -0.3652,  0.3379],
        ...,
        [-0.1006, -0.5586, -0.6797, -1.7266],
        [-0.2871,  0.2969,  0.0913,  1.1641],
        [-1.4688,  0.7539,  0.1602,  1.6094]], requires_grad=True)
2024-10-08 15:06:45,305 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.2812, -0.1680, -0.2637, -1.4141],
        [ 6.3125, -2.1094, -0.5703,  1.0938],
        [ 2.2188, -0.7383, -0.3828,  0.3223],
        ...,
        [-0.0674, -0.6797, -0.7539, -1.7734],
        [ 0.0620,  0.1328,  0.0182,  1.2109],
        [-1.4609,  0.7148,  0.1367,  1.5312]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.2812, -0.1680, -0.2637, -1.4141],
        [ 6.3125, -2.1094, -0.5703,  1.0938],
        [ 2.2188, -0.7383, -0.3828,  0.3223],
        ...,
        [-0.0674, -0.6797, -0.7539, -1.7734],
        [ 0.0620,  0.1328,  0.0182,  1.2109],
        [-1.4609,  0.7148,  0.1367,  1.5312]], requires_grad=True)
2024-10-08 15:06:45,554 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4219e+00, -2.4023e-01, -2.9492e-01, -1.4141e+00],
        [ 6.4062e+00, -2.0469e+00, -5.7422e-01,  9.4922e-01],
        [ 2.3125e+00, -7.1094e-01, -3.7109e-01,  3.0859e-01],
        ...,
        [-5.4688e-02, -9.4141e-01, -8.9844e-01, -1.8281e+00],
        [ 3.5938e-01,  9.1797e-02,  2.6093e-03,  1.2500e+00],
        [-1.4141e+00,  8.1641e-01,  1.7676e-01,  1.4844e+00]],
       requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4219e+00, -2.4023e-01, -2.9492e-01, -1.4141e+00],
        [ 6.4062e+00, -2.0469e+00, -5.7422e-01,  9.4922e-01],
        [ 2.3125e+00, -7.1094e-01, -3.7109e-01,  3.0859e-01],
        ...,
        [-5.4688e-02, -9.4141e-01, -8.9844e-01, -1.8281e+00],
        [ 3.5938e-01,  9.1797e-02,  2.6093e-03,  1.2500e+00],
        [-1.4141e+00,  8.1641e-01,  1.7676e-01,  1.4844e+00]],
       requires_grad=True)
2024-10-08 15:06:45,815 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5312, -0.3242, -0.3281, -1.4062],
        [ 6.4688, -1.9688, -0.5703,  0.8125],
        [ 2.3750, -0.6836, -0.3574,  0.2930],
        ...,
        [-0.0398, -1.1875, -1.0312, -1.8672],
        [ 0.6094,  0.0635, -0.0085,  1.2734],
        [-1.3906,  0.9062,  0.2119,  1.4297]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5312, -0.3242, -0.3281, -1.4062],
        [ 6.4688, -1.9688, -0.5703,  0.8125],
        [ 2.3750, -0.6836, -0.3574,  0.2930],
        ...,
        [-0.0398, -1.1875, -1.0312, -1.8672],
        [ 0.6094,  0.0635, -0.0085,  1.2734],
        [-1.3906,  0.9062,  0.2119,  1.4297]], requires_grad=True)
2024-10-08 15:06:45,975 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.3301, -0.3340, -1.3828],
        [ 6.4062, -2.0156, -0.6172,  0.6680],
        [ 2.3438, -0.7188, -0.3711,  0.2656],
        ...,
        [ 0.0586, -1.2656, -1.0859, -1.8828],
        [ 0.7539, -0.0776, -0.0615,  1.2812],
        [-1.4062,  0.9258,  0.2217,  1.3672]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.3301, -0.3340, -1.3828],
        [ 6.4062, -2.0156, -0.6172,  0.6680],
        [ 2.3438, -0.7188, -0.3711,  0.2656],
        ...,
        [ 0.0586, -1.2656, -1.0859, -1.8828],
        [ 0.7539, -0.0776, -0.0615,  1.2812],
        [-1.4062,  0.9258,  0.2217,  1.3672]], requires_grad=True)
2024-10-08 15:06:46,129 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5312, -0.3633, -0.3496, -1.3516],
        [ 6.3750, -2.0625, -0.6562,  0.5508],
        [ 2.3438, -0.6953, -0.3594,  0.2500],
        ...,
        [ 0.1177, -1.3828, -1.1562, -1.9062],
        [ 0.9062, -0.1953, -0.1035,  1.2969],
        [-1.4062,  0.9141,  0.2217,  1.3125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5312, -0.3633, -0.3496, -1.3516],
        [ 6.3750, -2.0625, -0.6562,  0.5508],
        [ 2.3438, -0.6953, -0.3594,  0.2500],
        ...,
        [ 0.1177, -1.3828, -1.1562, -1.9062],
        [ 0.9062, -0.1953, -0.1035,  1.2969],
        [-1.4062,  0.9141,  0.2217,  1.3125]], requires_grad=True)
2024-10-08 15:06:46,396 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.3711, -0.3555, -1.3281],
        [ 6.5312, -2.2344, -0.7383,  0.5078],
        [ 2.3750, -0.6719, -0.3438,  0.2490],
        ...,
        [ 0.1582, -1.5156, -1.2344, -1.9375],
        [ 1.1250, -0.2969, -0.1348,  1.3438],
        [-1.4453,  0.9023,  0.2188,  1.2344]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.3711, -0.3555, -1.3281],
        [ 6.5312, -2.2344, -0.7383,  0.5078],
        [ 2.3750, -0.6719, -0.3438,  0.2490],
        ...,
        [ 0.1582, -1.5156, -1.2344, -1.9375],
        [ 1.1250, -0.2969, -0.1348,  1.3438],
        [-1.4453,  0.9023,  0.2188,  1.2344]], requires_grad=True)
2024-10-08 15:06:46,645 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.3887, -0.3633, -1.3047],
        [ 6.6562, -2.3438, -0.7891,  0.4824],
        [ 2.3750, -0.6250, -0.3223,  0.2559],
        ...,
        [ 0.2080, -1.7188, -1.3359, -1.9844],
        [ 1.3125, -0.3223, -0.1387,  1.4062],
        [-1.4531,  0.9258,  0.2275,  1.1797]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.3887, -0.3633, -1.3047],
        [ 6.6562, -2.3438, -0.7891,  0.4824],
        [ 2.3750, -0.6250, -0.3223,  0.2559],
        ...,
        [ 0.2080, -1.7188, -1.3359, -1.9844],
        [ 1.3125, -0.3223, -0.1387,  1.4062],
        [-1.4531,  0.9258,  0.2275,  1.1797]], requires_grad=True)
2024-10-08 15:06:46,906 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.4336, -0.3750, -1.2969],
        [ 6.7500, -2.4219, -0.8281,  0.4590],
        [ 2.3594, -0.5859, -0.3027,  0.2578],
        ...,
        [ 0.2715, -1.8672, -1.4141, -2.0000],
        [ 1.4531, -0.3555, -0.1455,  1.4453],
        [-1.4609,  0.9531,  0.2363,  1.1250]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.4336, -0.3750, -1.2969],
        [ 6.7500, -2.4219, -0.8281,  0.4590],
        [ 2.3594, -0.5859, -0.3027,  0.2578],
        ...,
        [ 0.2715, -1.8672, -1.4141, -2.0000],
        [ 1.4531, -0.3555, -0.1455,  1.4453],
        [-1.4609,  0.9531,  0.2363,  1.1250]], requires_grad=True)
2024-10-08 15:06:47,068 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.4609, -0.3828, -1.2812],
        [ 6.8438, -2.5000, -0.8633,  0.4434],
        [ 2.3438, -0.5664, -0.2852,  0.2559],
        ...,
        [ 0.3242, -1.9688, -1.4766, -2.0000],
        [ 1.5781, -0.4121, -0.1553,  1.4609],
        [-1.4609,  0.9648,  0.2432,  1.0781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5469, -0.4609, -0.3828, -1.2812],
        [ 6.8438, -2.5000, -0.8633,  0.4434],
        [ 2.3438, -0.5664, -0.2852,  0.2559],
        ...,
        [ 0.3242, -1.9688, -1.4766, -2.0000],
        [ 1.5781, -0.4121, -0.1553,  1.4609],
        [-1.4609,  0.9648,  0.2432,  1.0781]], requires_grad=True)
2024-10-08 15:06:47,218 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5156, -0.4980, -0.3887, -1.2656],
        [ 6.8750, -2.5781, -0.8906,  0.4180],
        [ 2.3281, -0.5430, -0.2715,  0.2520],
        ...,
        [ 0.3906, -2.0781, -1.5234, -2.0000],
        [ 1.6641, -0.4355, -0.1611,  1.4766],
        [-1.4609,  0.9805,  0.2480,  1.0312]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5156, -0.4980, -0.3887, -1.2656],
        [ 6.8750, -2.5781, -0.8906,  0.4180],
        [ 2.3281, -0.5430, -0.2715,  0.2520],
        ...,
        [ 0.3906, -2.0781, -1.5234, -2.0000],
        [ 1.6641, -0.4355, -0.1611,  1.4766],
        [-1.4609,  0.9805,  0.2480,  1.0312]], requires_grad=True)
2024-10-08 15:06:47,486 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4844, -0.5195, -0.3945, -1.2422],
        [ 6.9375, -2.6719, -0.9102,  0.3945],
        [ 2.3281, -0.5273, -0.2578,  0.2441],
        ...,
        [ 0.4277, -2.1250, -1.5625, -1.9688],
        [ 1.7422, -0.4746, -0.1650,  1.4688],
        [-1.4609,  0.9961,  0.2500,  0.9922]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4844, -0.5195, -0.3945, -1.2422],
        [ 6.9375, -2.6719, -0.9102,  0.3945],
        [ 2.3281, -0.5273, -0.2578,  0.2441],
        ...,
        [ 0.4277, -2.1250, -1.5625, -1.9688],
        [ 1.7422, -0.4746, -0.1650,  1.4688],
        [-1.4609,  0.9961,  0.2500,  0.9922]], requires_grad=True)
2024-10-08 15:06:47,749 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4375, -0.5742, -0.3945, -1.2656],
        [ 6.9688, -2.7656, -0.9219,  0.3652],
        [ 2.3281, -0.5078, -0.2441,  0.2441],
        ...,
        [ 0.5078, -2.2344, -1.5781, -2.0000],
        [ 1.7578, -0.4453, -0.1777,  1.5156],
        [-1.4844,  1.0703,  0.2441,  1.0000]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4375, -0.5742, -0.3945, -1.2656],
        [ 6.9688, -2.7656, -0.9219,  0.3652],
        [ 2.3281, -0.5078, -0.2441,  0.2441],
        ...,
        [ 0.5078, -2.2344, -1.5781, -2.0000],
        [ 1.7578, -0.4453, -0.1777,  1.5156],
        [-1.4844,  1.0703,  0.2441,  1.0000]], requires_grad=True)
2024-10-08 15:06:48,014 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4062, -0.6016, -0.4082, -1.2578],
        [ 7.0000, -2.9062, -0.8633,  0.2754],
        [ 2.3281, -0.5000, -0.2207,  0.2305],
        ...,
        [ 0.5586, -2.2969, -1.6172, -1.9844],
        [ 1.7656, -0.4277, -0.1816,  1.5391],
        [-1.4844,  1.0938,  0.2676,  0.9648]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4062, -0.6016, -0.4082, -1.2578],
        [ 7.0000, -2.9062, -0.8633,  0.2754],
        [ 2.3281, -0.5000, -0.2207,  0.2305],
        ...,
        [ 0.5586, -2.2969, -1.6172, -1.9844],
        [ 1.7656, -0.4277, -0.1816,  1.5391],
        [-1.4844,  1.0938,  0.2676,  0.9648]], requires_grad=True)
2024-10-08 15:06:48,271 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.3750, -0.6172, -0.4238, -1.2344],
        [ 7.0625, -3.0469, -0.7695,  0.1621],
        [ 2.3281, -0.4922, -0.1992,  0.2168],
        ...,
        [ 0.5859, -2.3281, -1.6719, -1.9453],
        [ 1.7734, -0.4199, -0.1709,  1.5391],
        [-1.4766,  1.1016,  0.2969,  0.9219]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.3750, -0.6172, -0.4238, -1.2344],
        [ 7.0625, -3.0469, -0.7695,  0.1621],
        [ 2.3281, -0.4922, -0.1992,  0.2168],
        ...,
        [ 0.5859, -2.3281, -1.6719, -1.9453],
        [ 1.7734, -0.4199, -0.1709,  1.5391],
        [-1.4766,  1.1016,  0.2969,  0.9219]], requires_grad=True)
2024-10-08 15:06:48,523 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.1406, -0.6719, -0.3691, -1.3281],
        [ 6.9688, -3.1094, -0.7773,  0.1553],
        [ 2.2344, -0.4668, -0.2100,  0.2451],
        ...,
        [ 0.9297, -2.4531, -1.5547, -2.0938],
        [ 1.5859, -0.3594, -0.2471,  1.6484],
        [-1.6016,  1.1641,  0.2441,  0.9844]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.1406, -0.6719, -0.3691, -1.3281],
        [ 6.9688, -3.1094, -0.7773,  0.1553],
        [ 2.2344, -0.4668, -0.2100,  0.2451],
        ...,
        [ 0.9297, -2.4531, -1.5547, -2.0938],
        [ 1.5859, -0.3594, -0.2471,  1.6484],
        [-1.6016,  1.1641,  0.2441,  0.9844]], requires_grad=True)
2024-10-08 15:06:48,685 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.9688, -0.7109, -0.3320, -1.3906],
        [ 7.0312, -3.2188, -0.6914,  0.0933],
        [ 2.2188, -0.4629, -0.1924,  0.2412],
        ...,
        [ 1.0234, -2.4688, -1.5391, -2.1250],
        [ 1.4453, -0.3145, -0.2988,  1.7188],
        [-1.5703,  1.1562,  0.2676,  0.9648]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.9688, -0.7109, -0.3320, -1.3906],
        [ 7.0312, -3.2188, -0.6914,  0.0933],
        [ 2.2188, -0.4629, -0.1924,  0.2412],
        ...,
        [ 1.0234, -2.4688, -1.5391, -2.1250],
        [ 1.4453, -0.3145, -0.2988,  1.7188],
        [-1.5703,  1.1562,  0.2676,  0.9648]], requires_grad=True)
2024-10-08 15:06:48,834 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.8438, -0.7383, -0.3145, -1.4297],
        [ 7.0000, -3.2969, -0.6484,  0.0593],
        [ 2.2188, -0.4609, -0.1670,  0.2285],
        ...,
        [ 1.0547, -2.4688, -1.5547, -2.1250],
        [ 1.3438, -0.2832, -0.3203,  1.7500],
        [-1.5078,  1.1328,  0.3223,  0.9102]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.8438, -0.7383, -0.3145, -1.4297],
        [ 7.0000, -3.2969, -0.6484,  0.0593],
        [ 2.2188, -0.4609, -0.1670,  0.2285],
        ...,
        [ 1.0547, -2.4688, -1.5547, -2.1250],
        [ 1.3438, -0.2832, -0.3203,  1.7500],
        [-1.5078,  1.1328,  0.3223,  0.9102]], requires_grad=True)
2024-10-08 15:06:49,091 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.5938, -0.7734, -0.2393, -1.5312],
        [ 6.8750, -3.3281, -0.7070,  0.1011],
        [ 2.1406, -0.4512, -0.1748,  0.2412],
        ...,
        [ 1.2891, -2.4844, -1.4297, -2.2031],
        [ 1.1562, -0.2422, -0.3965,  1.8203],
        [-1.5156,  1.1172,  0.3203,  0.8984]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.5938, -0.7734, -0.2393, -1.5312],
        [ 6.8750, -3.3281, -0.7070,  0.1011],
        [ 2.1406, -0.4512, -0.1748,  0.2412],
        ...,
        [ 1.2891, -2.4844, -1.4297, -2.2031],
        [ 1.1562, -0.2422, -0.3965,  1.8203],
        [-1.5156,  1.1172,  0.3203,  0.8984]], requires_grad=True)
2024-10-08 15:06:49,249 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2344, -0.8125, -0.1055, -1.6875],
        [ 6.6250, -3.3281, -0.8867,  0.2393],
        [ 2.0156, -0.4355, -0.2119,  0.2812],
        ...,
        [ 1.5234, -2.5000, -1.2969, -2.2969],
        [ 0.9648, -0.2031, -0.4766,  1.8828],
        [-1.6484,  1.1250,  0.2090,  0.9883]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.2344, -0.8125, -0.1055, -1.6875],
        [ 6.6250, -3.3281, -0.8867,  0.2393],
        [ 2.0156, -0.4355, -0.2119,  0.2812],
        ...,
        [ 1.5234, -2.5000, -1.2969, -2.2969],
        [ 0.9648, -0.2031, -0.4766,  1.8828],
        [-1.6484,  1.1250,  0.2090,  0.9883]], requires_grad=True)
2024-10-08 15:06:49,513 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-8.9844e-01, -8.4375e-01, -4.2915e-04, -1.7891e+00],
        [ 6.5625e+00, -3.3438e+00, -8.5547e-01,  2.4609e-01],
        [ 1.9297e+00, -4.2383e-01, -2.1484e-01,  2.8711e-01],
        ...,
        [ 1.7031e+00, -2.5000e+00, -1.1797e+00, -2.3750e+00],
        [ 8.5156e-01, -1.7383e-01, -5.0391e-01,  1.8984e+00],
        [-1.7109e+00,  1.1250e+00,  1.5527e-01,  1.0234e+00]],
       requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-8.9844e-01, -8.4375e-01, -4.2915e-04, -1.7891e+00],
        [ 6.5625e+00, -3.3438e+00, -8.5547e-01,  2.4609e-01],
        [ 1.9297e+00, -4.2383e-01, -2.1484e-01,  2.8711e-01],
        ...,
        [ 1.7031e+00, -2.5000e+00, -1.1797e+00, -2.3750e+00],
        [ 8.5156e-01, -1.7383e-01, -5.0391e-01,  1.8984e+00],
        [-1.7109e+00,  1.1250e+00,  1.5527e-01,  1.0234e+00]],
       requires_grad=True)
2024-10-08 15:06:49,764 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8984, -0.8516, -0.0270, -1.7656],
        [ 6.4688, -3.3438, -0.8438,  0.2559],
        [ 2.0625, -0.4258, -0.1250,  0.2236],
        ...,
        [ 1.5547, -2.4688, -1.2656, -2.3125],
        [ 0.9531, -0.1670, -0.4043,  1.8281],
        [-1.5625,  1.0938,  0.2480,  0.9531]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8984, -0.8516, -0.0270, -1.7656],
        [ 6.4688, -3.3438, -0.8438,  0.2559],
        [ 2.0625, -0.4258, -0.1250,  0.2236],
        ...,
        [ 1.5547, -2.4688, -1.2656, -2.3125],
        [ 0.9531, -0.1670, -0.4043,  1.8281],
        [-1.5625,  1.0938,  0.2480,  0.9531]], requires_grad=True)
2024-10-08 15:06:50,017 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9648, -0.8398, -0.0747, -1.6875],
        [ 6.1562, -3.2812, -0.9688,  0.4043],
        [ 1.9844, -0.4043, -0.1021,  0.2422],
        ...,
        [ 1.9922, -2.5156, -1.1094, -2.4844],
        [ 0.8008, -0.1270, -0.4023,  1.8516],
        [-1.7500,  1.1250,  0.1855,  1.0547]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.9648, -0.8398, -0.0747, -1.6875],
        [ 6.1562, -3.2812, -0.9688,  0.4043],
        [ 1.9844, -0.4043, -0.1021,  0.2422],
        ...,
        [ 1.9922, -2.5156, -1.1094, -2.4844],
        [ 0.8008, -0.1270, -0.4023,  1.8516],
        [-1.7500,  1.1250,  0.1855,  1.0547]], requires_grad=True)
2024-10-08 15:06:50,264 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8711, -0.8594, -0.0981, -1.6875],
        [ 5.9062, -3.2031, -1.0781,  0.6094],
        [ 1.7656, -0.3574, -0.1016,  0.3203],
        ...,
        [ 2.7031, -2.6406, -0.9062, -2.7656],
        [ 0.4766, -0.0427, -0.4336,  1.9609],
        [-2.0625,  1.2188,  0.0923,  1.2734]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-0.8711, -0.8594, -0.0981, -1.6875],
        [ 5.9062, -3.2031, -1.0781,  0.6094],
        [ 1.7656, -0.3574, -0.1016,  0.3203],
        ...,
        [ 2.7031, -2.6406, -0.9062, -2.7656],
        [ 0.4766, -0.0427, -0.4336,  1.9609],
        [-2.0625,  1.2188,  0.0923,  1.2734]], requires_grad=True)
2024-10-08 15:06:50,425 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.3281, -0.7656, -0.1138, -1.5703],
        [ 5.8125, -3.1875, -1.1719,  0.7266],
        [ 1.7812, -0.3633, -0.1045,  0.3340],
        ...,
        [ 3.0781, -2.6719, -0.7188, -2.9688],
        [ 0.6367, -0.0854, -0.4629,  1.9531],
        [-2.1719,  1.2266,  0.0072,  1.3984]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.3281, -0.7656, -0.1138, -1.5703],
        [ 5.8125, -3.1875, -1.1719,  0.7266],
        [ 1.7812, -0.3633, -0.1045,  0.3340],
        ...,
        [ 3.0781, -2.6719, -0.7188, -2.9688],
        [ 0.6367, -0.0854, -0.4629,  1.9531],
        [-2.1719,  1.2266,  0.0072,  1.3984]], requires_grad=True)
2024-10-08 15:06:50,560 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.6953, -0.7031, -0.1377, -1.4922],
        [ 5.6875, -3.1406, -1.2344,  0.8594],
        [ 1.7578, -0.3457, -0.0947,  0.3730],
        ...,
        [ 3.4375, -2.7188, -0.5703, -3.1562],
        [ 0.7578, -0.1128, -0.4824,  1.9453],
        [-2.2969,  1.2734, -0.0435,  1.5625]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.6953, -0.7031, -0.1377, -1.4922],
        [ 5.6875, -3.1406, -1.2344,  0.8594],
        [ 1.7578, -0.3457, -0.0947,  0.3730],
        ...,
        [ 3.4375, -2.7188, -0.5703, -3.1562],
        [ 0.7578, -0.1128, -0.4824,  1.9453],
        [-2.2969,  1.2734, -0.0435,  1.5625]], requires_grad=True)
2024-10-08 15:06:50,815 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.0000, -0.6445, -0.1572, -1.4141],
        [ 5.5625, -3.0469, -1.2422,  0.9961],
        [ 1.7266, -0.3281, -0.0840,  0.4062],
        ...,
        [ 3.7969, -2.8125, -0.4902, -3.3438],
        [ 0.8359, -0.1104, -0.4766,  1.9609],
        [-2.4531,  1.3516, -0.0554,  1.7266]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.0000, -0.6445, -0.1572, -1.4141],
        [ 5.5625, -3.0469, -1.2422,  0.9961],
        [ 1.7266, -0.3281, -0.0840,  0.4062],
        ...,
        [ 3.7969, -2.8125, -0.4902, -3.3438],
        [ 0.8359, -0.1104, -0.4766,  1.9609],
        [-2.4531,  1.3516, -0.0554,  1.7266]], requires_grad=True)
2024-10-08 15:06:50,967 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.2656, -0.6016, -0.1836, -1.3516],
        [ 5.5312, -3.0312, -1.3438,  1.0938],
        [ 1.7188, -0.3203, -0.0835,  0.4355],
        ...,
        [ 4.0938, -2.8750, -0.4004, -3.5156],
        [ 0.9062, -0.1094, -0.4707,  1.9609],
        [-2.5625,  1.4297, -0.0601,  1.8672]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.2656, -0.6016, -0.1836, -1.3516],
        [ 5.5312, -3.0312, -1.3438,  1.0938],
        [ 1.7188, -0.3203, -0.0835,  0.4355],
        ...,
        [ 4.0938, -2.8750, -0.4004, -3.5156],
        [ 0.9062, -0.1094, -0.4707,  1.9609],
        [-2.5625,  1.4297, -0.0601,  1.8672]], requires_grad=True)
2024-10-08 15:06:51,126 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5000, -0.5625, -0.2070, -1.2969],
        [ 5.4062, -2.8906, -1.2656,  1.2109],
        [ 1.6953, -0.2988, -0.0684,  0.4609],
        ...,
        [ 4.3750, -2.9531, -0.3867, -3.6562],
        [ 0.9297, -0.0654, -0.4160,  1.9688],
        [-2.7031,  1.5781,  0.0232,  2.0156]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5000, -0.5625, -0.2070, -1.2969],
        [ 5.4062, -2.8906, -1.2656,  1.2109],
        [ 1.6953, -0.2988, -0.0684,  0.4609],
        ...,
        [ 4.3750, -2.9531, -0.3867, -3.6562],
        [ 0.9297, -0.0654, -0.4160,  1.9688],
        [-2.7031,  1.5781,  0.0232,  2.0156]], requires_grad=True)
2024-10-08 15:06:51,391 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.7188, -0.4453, -0.1357, -1.2109],
        [ 5.3125, -2.8125, -1.2812,  1.2891],
        [ 1.6641, -0.3008, -0.0806,  0.4727],
        ...,
        [ 4.5938, -3.0000, -0.3398, -3.7500],
        [ 0.9609, -0.0708, -0.4180,  1.9531],
        [-2.8125,  1.6641,  0.0593,  2.1250]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.7188, -0.4453, -0.1357, -1.2109],
        [ 5.3125, -2.8125, -1.2812,  1.2891],
        [ 1.6641, -0.3008, -0.0806,  0.4727],
        ...,
        [ 4.5938, -3.0000, -0.3398, -3.7500],
        [ 0.9609, -0.0708, -0.4180,  1.9531],
        [-2.8125,  1.6641,  0.0593,  2.1250]], requires_grad=True)
2024-10-08 15:06:51,646 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.9062, -0.3398, -0.0723, -1.1250],
        [ 5.2188, -2.7656, -1.3047,  1.3594],
        [ 1.6562, -0.3105, -0.1001,  0.4805],
        ...,
        [ 4.8125, -3.0781, -0.3672, -3.8438],
        [ 0.9844, -0.0540, -0.3945,  1.9375],
        [-2.9219,  1.7578,  0.1128,  2.2188]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.9062, -0.3398, -0.0723, -1.1250],
        [ 5.2188, -2.7656, -1.3047,  1.3594],
        [ 1.6562, -0.3105, -0.1001,  0.4805],
        ...,
        [ 4.8125, -3.0781, -0.3672, -3.8438],
        [ 0.9844, -0.0540, -0.3945,  1.9375],
        [-2.9219,  1.7578,  0.1128,  2.2188]], requires_grad=True)
2024-10-08 15:06:51,905 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.0781, -0.1982,  0.0282, -1.0312],
        [ 5.1562, -2.7344, -1.3516,  1.4219],
        [ 1.6406, -0.2949, -0.0952,  0.4961],
        ...,
        [ 4.9688, -3.1875, -0.4336, -3.9219],
        [ 1.0000, -0.0114, -0.3457,  1.9453],
        [-3.0000,  1.8359,  0.1641,  2.2969]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.0781, -0.1982,  0.0282, -1.0312],
        [ 5.1562, -2.7344, -1.3516,  1.4219],
        [ 1.6406, -0.2949, -0.0952,  0.4961],
        ...,
        [ 4.9688, -3.1875, -0.4336, -3.9219],
        [ 1.0000, -0.0114, -0.3457,  1.9453],
        [-3.0000,  1.8359,  0.1641,  2.2969]], requires_grad=True)
2024-10-08 15:06:52,164 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.2031, -0.0889,  0.1040, -0.9453],
        [ 5.0938, -2.7031, -1.3828,  1.4766],
        [ 1.6172, -0.2910, -0.1001,  0.5039],
        ...,
        [ 5.0938, -3.2500, -0.4707, -3.9688],
        [ 1.0078,  0.0150, -0.3125,  1.9375],
        [-3.0625,  1.8516,  0.1650,  2.3438]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.2031, -0.0889,  0.1040, -0.9453],
        [ 5.0938, -2.7031, -1.3828,  1.4766],
        [ 1.6172, -0.2910, -0.1001,  0.5039],
        ...,
        [ 5.0938, -3.2500, -0.4707, -3.9688],
        [ 1.0078,  0.0150, -0.3125,  1.9375],
        [-3.0625,  1.8516,  0.1650,  2.3438]], requires_grad=True)
2024-10-08 15:06:52,422 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3438,  0.1328,  0.2637, -0.8516],
        [ 5.0312, -2.8281, -1.5469,  1.4922],
        [ 1.5938, -0.3555, -0.1582,  0.4980],
        ...,
        [ 5.2188, -3.3125, -0.5039, -4.0000],
        [ 1.0234, -0.0244, -0.3301,  1.9219],
        [-3.1094,  1.7734,  0.1069,  2.3750]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3438,  0.1328,  0.2637, -0.8516],
        [ 5.0312, -2.8281, -1.5469,  1.4922],
        [ 1.5938, -0.3555, -0.1582,  0.4980],
        ...,
        [ 5.2188, -3.3125, -0.5039, -4.0000],
        [ 1.0234, -0.0244, -0.3301,  1.9219],
        [-3.1094,  1.7734,  0.1069,  2.3750]], requires_grad=True)
2024-10-08 15:06:52,690 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4531,  0.3398,  0.4121, -0.7578],
        [ 4.9062, -2.7031, -1.5156,  1.5234],
        [ 1.5469, -0.3691, -0.1787,  0.4961],
        ...,
        [ 5.3750, -3.4844, -0.6250, -4.0312],
        [ 1.0000,  0.0464, -0.2734,  1.9141],
        [-3.1562,  1.8125,  0.1235,  2.3906]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4531,  0.3398,  0.4121, -0.7578],
        [ 4.9062, -2.7031, -1.5156,  1.5234],
        [ 1.5469, -0.3691, -0.1787,  0.4961],
        ...,
        [ 5.3750, -3.4844, -0.6250, -4.0312],
        [ 1.0000,  0.0464, -0.2734,  1.9141],
        [-3.1562,  1.8125,  0.1235,  2.3906]], requires_grad=True)
2024-10-08 15:06:52,959 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5156,  0.4902,  0.5273, -0.6797],
        [ 4.7500, -2.3594, -1.3438,  1.5938],
        [ 1.5000, -0.3066, -0.1562,  0.5156],
        ...,
        [ 5.5000, -3.7812, -0.8281, -4.0625],
        [ 0.9727,  0.2109, -0.1660,  1.9297],
        [-3.1719,  2.0312,  0.2373,  2.4531]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5156,  0.4902,  0.5273, -0.6797],
        [ 4.7500, -2.3594, -1.3438,  1.5938],
        [ 1.5000, -0.3066, -0.1562,  0.5156],
        ...,
        [ 5.5000, -3.7812, -0.8281, -4.0625],
        [ 0.9727,  0.2109, -0.1660,  1.9297],
        [-3.1719,  2.0312,  0.2373,  2.4531]], requires_grad=True)
2024-10-08 15:06:53,227 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5312,  0.6250,  0.6289, -0.6016],
        [ 4.5625, -2.0781, -1.2031,  1.6406],
        [ 1.4297, -0.2988, -0.1572,  0.5156],
        ...,
        [ 5.6250, -3.9688, -0.9766, -4.0625],
        [ 0.9375,  0.3301, -0.0825,  1.9297],
        [-3.1875,  2.1406,  0.3066,  2.4844]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5312,  0.6250,  0.6289, -0.6016],
        [ 4.5625, -2.0781, -1.2031,  1.6406],
        [ 1.4297, -0.2988, -0.1572,  0.5156],
        ...,
        [ 5.6250, -3.9688, -0.9766, -4.0625],
        [ 0.9375,  0.3301, -0.0825,  1.9297],
        [-3.1875,  2.1406,  0.3066,  2.4844]], requires_grad=True)
2024-10-08 15:06:53,479 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5625,  0.7852,  0.7266, -0.5273],
        [ 4.4688, -1.8906, -1.0938,  1.6797],
        [ 1.3906, -0.3301, -0.1680,  0.5117],
        ...,
        [ 5.6875, -4.0312, -1.0703, -4.0625],
        [ 0.9141,  0.3789, -0.0234,  1.9141],
        [-3.1250,  2.0312,  0.3164,  2.5000]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5625,  0.7852,  0.7266, -0.5273],
        [ 4.4688, -1.8906, -1.0938,  1.6797],
        [ 1.3906, -0.3301, -0.1680,  0.5117],
        ...,
        [ 5.6875, -4.0312, -1.0703, -4.0625],
        [ 0.9141,  0.3789, -0.0234,  1.9141],
        [-3.1250,  2.0312,  0.3164,  2.5000]], requires_grad=True)
2024-10-08 15:06:53,731 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5938,  1.0000,  0.8125, -0.4199],
        [ 4.4375, -2.0156, -1.0234,  1.6250],
        [ 1.3672, -0.4102, -0.1816,  0.4805],
        ...,
        [ 5.6875, -3.9531, -1.1484, -4.0000],
        [ 0.9492,  0.3145,  0.0212,  1.8672],
        [-3.0469,  1.7578,  0.3125,  2.4375]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5938,  1.0000,  0.8125, -0.4199],
        [ 4.4375, -2.0156, -1.0234,  1.6250],
        [ 1.3672, -0.4102, -0.1816,  0.4805],
        ...,
        [ 5.6875, -3.9531, -1.1484, -4.0000],
        [ 0.9492,  0.3145,  0.0212,  1.8672],
        [-3.0469,  1.7578,  0.3125,  2.4375]], requires_grad=True)
2024-10-08 15:06:53,893 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4688,  1.1094,  0.8984, -0.3613],
        [ 4.0938, -1.7344, -1.0156,  1.7656],
        [ 1.1172, -0.3320, -0.2148,  0.5430],
        ...,
        [ 6.0625, -4.2188, -1.1562, -4.1250],
        [ 0.7695,  0.4531,  0.0312,  1.9375],
        [-3.1875,  1.7500,  0.2754,  2.4844]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4688,  1.1094,  0.8984, -0.3613],
        [ 4.0938, -1.7344, -1.0156,  1.7656],
        [ 1.1172, -0.3320, -0.2148,  0.5430],
        ...,
        [ 6.0625, -4.2188, -1.1562, -4.1250],
        [ 0.7695,  0.4531,  0.0312,  1.9375],
        [-3.1875,  1.7500,  0.2754,  2.4844]], requires_grad=True)
2024-10-08 15:06:54,161 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3125,  1.2031,  0.9727, -0.2969],
        [ 3.6562, -1.3438, -1.0391,  1.9766],
        [ 0.8477, -0.2383, -0.2490,  0.6133],
        ...,
        [ 6.4688, -4.5625, -1.1406, -4.2812],
        [ 0.5938,  0.5820,  0.0378,  1.9922],
        [-3.4062,  1.8359,  0.2246,  2.5781]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3125,  1.2031,  0.9727, -0.2969],
        [ 3.6562, -1.3438, -1.0391,  1.9766],
        [ 0.8477, -0.2383, -0.2490,  0.6133],
        ...,
        [ 6.4688, -4.5625, -1.1406, -4.2812],
        [ 0.5938,  0.5820,  0.0378,  1.9922],
        [-3.4062,  1.8359,  0.2246,  2.5781]], requires_grad=True)
2024-10-08 15:06:54,420 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1562,  1.2734,  1.0391, -0.2402],
        [ 3.1875, -0.9570, -1.0703,  2.1875],
        [ 0.6133, -0.1611, -0.2773,  0.6641],
        ...,
        [ 6.8438, -4.8750, -1.1094, -4.4375],
        [ 0.4238,  0.6953,  0.0422,  2.0312],
        [-3.5781,  1.9062,  0.1797,  2.6562]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1562,  1.2734,  1.0391, -0.2402],
        [ 3.1875, -0.9570, -1.0703,  2.1875],
        [ 0.6133, -0.1611, -0.2773,  0.6641],
        ...,
        [ 6.8438, -4.8750, -1.1094, -4.4375],
        [ 0.4238,  0.6953,  0.0422,  2.0312],
        [-3.5781,  1.9062,  0.1797,  2.6562]], requires_grad=True)
2024-10-08 15:06:54,690 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.0469,  1.3594,  1.0859, -0.1621],
        [ 3.2344, -0.8867, -1.0078,  2.1875],
        [ 0.7461, -0.2080, -0.2656,  0.6133],
        ...,
        [ 6.6250, -4.9062, -1.1562, -4.4062],
        [ 0.5898,  0.6445,  0.0923,  1.9375],
        [-3.3281,  1.7188,  0.2070,  2.5469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.0469,  1.3594,  1.0859, -0.1621],
        [ 3.2344, -0.8867, -1.0078,  2.1875],
        [ 0.7461, -0.2080, -0.2656,  0.6133],
        ...,
        [ 6.6250, -4.9062, -1.1562, -4.4062],
        [ 0.5898,  0.6445,  0.0923,  1.9375],
        [-3.3281,  1.7188,  0.2070,  2.5469]], requires_grad=True)
2024-10-08 15:06:54,854 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.8906,  1.4297,  1.1250, -0.0713],
        [ 3.5000, -0.9258, -0.8945,  2.1094],
        [ 0.9336, -0.2432, -0.2520,  0.5977],
        ...,
        [ 6.5312, -5.0000, -1.1641, -4.5000],
        [ 0.6797,  0.6367,  0.1206,  1.9141],
        [-2.9688,  1.5078,  0.2490,  2.4375]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.8906,  1.4297,  1.1250, -0.0713],
        [ 3.5000, -0.9258, -0.8945,  2.1094],
        [ 0.9336, -0.2432, -0.2520,  0.5977],
        ...,
        [ 6.5312, -5.0000, -1.1641, -4.5000],
        [ 0.6797,  0.6367,  0.1206,  1.9141],
        [-2.9688,  1.5078,  0.2490,  2.4375]], requires_grad=True)
2024-10-08 15:06:55,266 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.7969,  1.4844,  1.1562, -0.0342],
        [ 3.7188, -0.9648, -0.7930,  2.0156],
        [ 1.1250, -0.2812, -0.2363,  0.5742],
        ...,
        [ 6.7188, -5.1562, -1.1172, -4.6562],
        [ 0.5039,  0.7109,  0.1079,  1.9766],
        [-2.5938,  1.3125,  0.2871,  2.3438]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.7969,  1.4844,  1.1562, -0.0342],
        [ 3.7188, -0.9648, -0.7930,  2.0156],
        [ 1.1250, -0.2812, -0.2363,  0.5742],
        ...,
        [ 6.7188, -5.1562, -1.1172, -4.6562],
        [ 0.5039,  0.7109,  0.1079,  1.9766],
        [-2.5938,  1.3125,  0.2871,  2.3438]], requires_grad=True)
2024-10-08 15:06:55,522 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5000,  1.4297,  1.2109, -0.1641],
        [ 3.7969, -0.9688, -0.7148,  1.9375],
        [ 1.0156, -0.2256, -0.2500,  0.6797],
        ...,
        [ 7.0625, -5.3750, -1.0391, -4.9062],
        [ 0.0635,  0.8984,  0.0574,  2.1719],
        [-2.4062,  1.2031,  0.3008,  2.3125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5000,  1.4297,  1.2109, -0.1641],
        [ 3.7969, -0.9688, -0.7148,  1.9375],
        [ 1.0156, -0.2256, -0.2500,  0.6797],
        ...,
        [ 7.0625, -5.3750, -1.0391, -4.9062],
        [ 0.0635,  0.8984,  0.0574,  2.1719],
        [-2.4062,  1.2031,  0.3008,  2.3125]], requires_grad=True)
2024-10-08 15:06:55,670 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5312,  1.4688,  1.2344, -0.1689],
        [ 3.8750, -0.9844, -0.6445,  1.8516],
        [ 1.1875, -0.2393, -0.2480,  0.7148],
        ...,
        [ 7.1875, -5.5312, -0.9766, -5.0625],
        [-0.0977,  0.9922,  0.0270,  2.2969],
        [-2.1406,  1.0625,  0.3184,  2.2656]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.5312,  1.4688,  1.2344, -0.1689],
        [ 3.8750, -0.9844, -0.6445,  1.8516],
        [ 1.1875, -0.2393, -0.2480,  0.7148],
        ...,
        [ 7.1875, -5.5312, -0.9766, -5.0625],
        [-0.0977,  0.9922,  0.0270,  2.2969],
        [-2.1406,  1.0625,  0.3184,  2.2656]], requires_grad=True)
2024-10-08 15:06:55,927 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.9375e+00,  1.5859e+00,  1.2500e+00, -8.4961e-02],
        [ 4.2500e+00, -1.1172e+00, -5.7422e-01,  1.6953e+00],
        [ 1.7422e+00, -3.5742e-01, -2.4219e-01,  6.5234e-01],
        ...,
        [ 6.6875e+00, -5.4375e+00, -9.2578e-01, -5.0625e+00],
        [ 8.8379e-02,  9.5312e-01,  4.4556e-03,  2.3125e+00],
        [-1.7656e+00,  8.7891e-01,  3.3594e-01,  2.1719e+00]],
       requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.9375e+00,  1.5859e+00,  1.2500e+00, -8.4961e-02],
        [ 4.2500e+00, -1.1172e+00, -5.7422e-01,  1.6953e+00],
        [ 1.7422e+00, -3.5742e-01, -2.4219e-01,  6.5234e-01],
        ...,
        [ 6.6875e+00, -5.4375e+00, -9.2578e-01, -5.0625e+00],
        [ 8.8379e-02,  9.5312e-01,  4.4556e-03,  2.3125e+00],
        [-1.7656e+00,  8.7891e-01,  3.3594e-01,  2.1719e+00]],
       requires_grad=True)
2024-10-08 15:06:56,190 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1875,  1.6562,  1.2500, -0.0435],
        [ 4.6875, -1.3203, -0.5312,  1.4844],
        [ 2.1719, -0.4355, -0.2314,  0.6211],
        ...,
        [ 6.0938, -5.2812, -0.8711, -5.0000],
        [ 0.3359,  0.8906, -0.0197,  2.2969],
        [-1.3984,  0.6836,  0.3438,  2.0469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1875,  1.6562,  1.2500, -0.0435],
        [ 4.6875, -1.3203, -0.5312,  1.4844],
        [ 2.1719, -0.4355, -0.2314,  0.6211],
        ...,
        [ 6.0938, -5.2812, -0.8711, -5.0000],
        [ 0.3359,  0.8906, -0.0197,  2.2969],
        [-1.3984,  0.6836,  0.3438,  2.0469]], requires_grad=True)
2024-10-08 15:06:56,441 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3750,  1.7031,  1.2422, -0.0205],
        [ 4.9688, -1.4062, -0.4492,  1.3594],
        [ 2.4844, -0.4727, -0.2070,  0.6172],
        ...,
        [ 5.6250, -5.1562, -0.8320, -4.9375],
        [ 0.4531,  0.8828, -0.0192,  2.3125],
        [-1.1250,  0.5352,  0.3574,  1.9375]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3750,  1.7031,  1.2422, -0.0205],
        [ 4.9688, -1.4062, -0.4492,  1.3594],
        [ 2.4844, -0.4727, -0.2070,  0.6172],
        ...,
        [ 5.6250, -5.1562, -0.8320, -4.9375],
        [ 0.4531,  0.8828, -0.0192,  2.3125],
        [-1.1250,  0.5352,  0.3574,  1.9375]], requires_grad=True)
2024-10-08 15:06:56,693 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4375,  1.6875,  1.2031, -0.0366],
        [ 5.1250, -1.4375, -0.3438,  1.2734],
        [ 2.7344, -0.4922, -0.1768,  0.6250],
        ...,
        [ 5.2188, -5.0312, -0.8086, -4.8750],
        [ 0.4531,  0.9453,  0.0325,  2.3750],
        [-0.9297,  0.4512,  0.3984,  1.8672]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4375,  1.6875,  1.2031, -0.0366],
        [ 5.1250, -1.4375, -0.3438,  1.2734],
        [ 2.7344, -0.4922, -0.1768,  0.6250],
        ...,
        [ 5.2188, -5.0312, -0.8086, -4.8750],
        [ 0.4531,  0.9453,  0.0325,  2.3750],
        [-0.9297,  0.4512,  0.3984,  1.8672]], requires_grad=True)
2024-10-08 15:06:56,851 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4844,  1.6875,  1.1719, -0.0422],
        [ 5.2812, -1.4844, -0.2793,  1.1719],
        [ 2.9531, -0.5156, -0.1572,  0.6211],
        ...,
        [ 4.8438, -4.9062, -0.7852, -4.8125],
        [ 0.4531,  0.9961,  0.0732,  2.4062],
        [-0.7383,  0.3379,  0.4023,  1.7734]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4844,  1.6875,  1.1719, -0.0422],
        [ 5.2812, -1.4844, -0.2793,  1.1719],
        [ 2.9531, -0.5156, -0.1572,  0.6211],
        ...,
        [ 4.8438, -4.9062, -0.7852, -4.8125],
        [ 0.4531,  0.9961,  0.0732,  2.4062],
        [-0.7383,  0.3379,  0.4023,  1.7734]], requires_grad=True)
2024-10-08 15:06:57,006 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5000,  1.6641,  1.1328, -0.0481],
        [ 5.4062, -1.5703, -0.2754,  1.0781],
        [ 3.1250, -0.5547, -0.1592,  0.6133],
        ...,
        [ 4.4688, -4.7188, -0.6914, -4.7188],
        [ 0.4629,  0.9961,  0.0698,  2.4219],
        [-0.5547,  0.1924,  0.3613,  1.6797]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5000,  1.6641,  1.1328, -0.0481],
        [ 5.4062, -1.5703, -0.2754,  1.0781],
        [ 3.1250, -0.5547, -0.1592,  0.6133],
        ...,
        [ 4.4688, -4.7188, -0.6914, -4.7188],
        [ 0.4629,  0.9961,  0.0698,  2.4219],
        [-0.5547,  0.1924,  0.3613,  1.6797]], requires_grad=True)
2024-10-08 15:06:57,166 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5000,  1.6406,  1.0859, -0.0508],
        [ 5.5000, -1.6562, -0.2891,  0.9844],
        [ 3.2812, -0.5859, -0.1592,  0.6016],
        ...,
        [ 4.1562, -4.5625, -0.6211, -4.6250],
        [ 0.4727,  0.9766,  0.0452,  2.4219],
        [-0.3965,  0.0728,  0.3320,  1.5938]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.5000,  1.6406,  1.0859, -0.0508],
        [ 5.5000, -1.6562, -0.2891,  0.9844],
        [ 3.2812, -0.5859, -0.1592,  0.6016],
        ...,
        [ 4.1562, -4.5625, -0.6211, -4.6250],
        [ 0.4727,  0.9766,  0.0452,  2.4219],
        [-0.3965,  0.0728,  0.3320,  1.5938]], requires_grad=True)
2024-10-08 15:06:57,324 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4844,  1.6016,  1.0312, -0.0574],
        [ 5.5625, -1.6875, -0.2480,  0.9062],
        [ 3.3906, -0.5938, -0.1328,  0.5938],
        ...,
        [ 3.8750, -4.4688, -0.5977, -4.5625],
        [ 0.4629,  0.9805,  0.0552,  2.4062],
        [-0.2451, -0.0415,  0.2969,  1.5156]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4844,  1.6016,  1.0312, -0.0574],
        [ 5.5625, -1.6875, -0.2480,  0.9062],
        [ 3.3906, -0.5938, -0.1328,  0.5938],
        ...,
        [ 3.8750, -4.4688, -0.5977, -4.5625],
        [ 0.4629,  0.9805,  0.0552,  2.4062],
        [-0.2451, -0.0415,  0.2969,  1.5156]], requires_grad=True)
2024-10-08 15:06:57,588 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4688,  1.5391,  0.9531, -0.0732],
        [ 5.5938, -1.6484, -0.1216,  0.8516],
        [ 3.4688, -0.5742, -0.0835,  0.5898],
        ...,
        [ 3.6406, -4.4062, -0.6289, -4.5000],
        [ 0.4336,  1.0156,  0.1025,  2.3906],
        [-0.1099, -0.1147,  0.2930,  1.4453]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4688,  1.5391,  0.9531, -0.0732],
        [ 5.5938, -1.6484, -0.1216,  0.8516],
        [ 3.4688, -0.5742, -0.0835,  0.5898],
        ...,
        [ 3.6406, -4.4062, -0.6289, -4.5000],
        [ 0.4336,  1.0156,  0.1025,  2.3906],
        [-0.1099, -0.1147,  0.2930,  1.4453]], requires_grad=True)
2024-10-08 15:06:57,851 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4375e+00,  1.4766e+00,  8.8672e-01, -8.5938e-02],
        [ 5.5938e+00, -1.6094e+00, -1.0010e-02,  7.9297e-01],
        [ 3.5156e+00, -5.6641e-01, -5.2246e-02,  5.8203e-01],
        ...,
        [ 3.4219e+00, -4.3125e+00, -6.1719e-01, -4.4062e+00],
        [ 3.9648e-01,  1.0078e+00,  1.0352e-01,  2.3594e+00],
        [-5.2795e-03, -2.1875e-01,  2.4707e-01,  1.3750e+00]],
       requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.4375e+00,  1.4766e+00,  8.8672e-01, -8.5938e-02],
        [ 5.5938e+00, -1.6094e+00, -1.0010e-02,  7.9297e-01],
        [ 3.5156e+00, -5.6641e-01, -5.2246e-02,  5.8203e-01],
        ...,
        [ 3.4219e+00, -4.3125e+00, -6.1719e-01, -4.4062e+00],
        [ 3.9648e-01,  1.0078e+00,  1.0352e-01,  2.3594e+00],
        [-5.2795e-03, -2.1875e-01,  2.4707e-01,  1.3750e+00]],
       requires_grad=True)
2024-10-08 15:06:58,113 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3906,  1.4297,  0.8320, -0.0928],
        [ 5.5938, -1.6250,  0.0184,  0.7383],
        [ 3.5469, -0.5859, -0.0554,  0.5703],
        ...,
        [ 3.2188, -4.1562, -0.5391, -4.3125],
        [ 0.3574,  0.9688,  0.0723,  2.3281],
        [ 0.0806, -0.3359,  0.1777,  1.3125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3906,  1.4297,  0.8320, -0.0928],
        [ 5.5938, -1.6250,  0.0184,  0.7383],
        [ 3.5469, -0.5859, -0.0554,  0.5703],
        ...,
        [ 3.2188, -4.1562, -0.5391, -4.3125],
        [ 0.3574,  0.9688,  0.0723,  2.3281],
        [ 0.0806, -0.3359,  0.1777,  1.3125]], requires_grad=True)
2024-10-08 15:06:58,365 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3281,  1.3750,  0.7656, -0.1055],
        [ 5.5312, -1.5469,  0.1416,  0.7031],
        [ 3.5625, -0.5938, -0.0508,  0.5625],
        ...,
        [ 3.0938, -4.0625, -0.5391, -4.2188],
        [ 0.2891,  0.9805,  0.0889,  2.3125],
        [ 0.1226, -0.3770,  0.1719,  1.2578]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.3281,  1.3750,  0.7656, -0.1055],
        [ 5.5312, -1.5469,  0.1416,  0.7031],
        [ 3.5625, -0.5938, -0.0508,  0.5625],
        ...,
        [ 3.0938, -4.0625, -0.5391, -4.2188],
        [ 0.2891,  0.9805,  0.0889,  2.3125],
        [ 0.1226, -0.3770,  0.1719,  1.2578]], requires_grad=True)
2024-10-08 15:06:58,623 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.2500,  1.2734,  0.6641, -0.1387],
        [ 5.3750, -1.3203,  0.4023,  0.7109],
        [ 3.5312, -0.5664, -0.0177,  0.5625],
        ...,
        [ 3.0312, -4.0625, -0.6250, -4.1562],
        [ 0.1865,  1.0391,  0.1543,  2.3125],
        [ 0.1147, -0.3340,  0.2324,  1.2266]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.2500,  1.2734,  0.6641, -0.1387],
        [ 5.3750, -1.3203,  0.4023,  0.7109],
        [ 3.5312, -0.5664, -0.0177,  0.5625],
        ...,
        [ 3.0312, -4.0625, -0.6250, -4.1562],
        [ 0.1865,  1.0391,  0.1543,  2.3125],
        [ 0.1147, -0.3340,  0.2324,  1.2266]], requires_grad=True)
2024-10-08 15:06:58,772 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1875,  1.2109,  0.6016, -0.1553],
        [ 5.2188, -1.1562,  0.5977,  0.7031],
        [ 3.5156, -0.5742, -0.0150,  0.5547],
        ...,
        [ 2.9375, -3.9844, -0.6484, -4.0625],
        [ 0.1001,  1.0625,  0.1895,  2.2969],
        [ 0.1226, -0.3691,  0.2285,  1.1719]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1875,  1.2109,  0.6016, -0.1553],
        [ 5.2188, -1.1562,  0.5977,  0.7031],
        [ 3.5156, -0.5742, -0.0150,  0.5547],
        ...,
        [ 2.9375, -3.9844, -0.6484, -4.0625],
        [ 0.1001,  1.0625,  0.1895,  2.2969],
        [ 0.1226, -0.3691,  0.2285,  1.1719]], requires_grad=True)
2024-10-08 15:06:58,936 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1406,  1.1719,  0.5586, -0.1660],
        [ 5.0938, -1.0703,  0.7148,  0.6797],
        [ 3.4844, -0.5898, -0.0205,  0.5391],
        ...,
        [ 2.8125, -3.7969, -0.5820, -3.9531],
        [ 0.0601,  0.9805,  0.1445,  2.2500],
        [ 0.1377, -0.4629,  0.1807,  1.1094]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1406,  1.1719,  0.5586, -0.1660],
        [ 5.0938, -1.0703,  0.7148,  0.6797],
        [ 3.4844, -0.5898, -0.0205,  0.5391],
        ...,
        [ 2.8125, -3.7969, -0.5820, -3.9531],
        [ 0.0601,  0.9805,  0.1445,  2.2500],
        [ 0.1377, -0.4629,  0.1807,  1.1094]], requires_grad=True)
2024-10-08 15:06:59,085 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1875,  1.2578,  0.5859, -0.1230],
        [ 5.0000, -1.0703,  0.7656,  0.6328],
        [ 3.5156, -0.6719, -0.0698,  0.4961],
        ...,
        [ 2.6562, -3.5625, -0.4824, -3.8125],
        [ 0.0420,  0.8594,  0.0801,  2.1719],
        [ 0.1611, -0.5625,  0.1279,  1.0469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1875,  1.2578,  0.5859, -0.1230],
        [ 5.0000, -1.0703,  0.7656,  0.6328],
        [ 3.5156, -0.6719, -0.0698,  0.4961],
        ...,
        [ 2.6562, -3.5625, -0.4824, -3.8125],
        [ 0.0420,  0.8594,  0.0801,  2.1719],
        [ 0.1611, -0.5625,  0.1279,  1.0469]], requires_grad=True)
2024-10-08 15:06:59,349 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.2031e+00,  1.2891e+00,  5.9375e-01, -1.1426e-01],
        [ 4.8750e+00, -1.0234e+00,  8.3203e-01,  6.1328e-01],
        [ 3.5469e+00, -7.4219e-01, -1.0986e-01,  4.6289e-01],
        ...,
        [ 2.5156e+00, -3.3438e+00, -3.9062e-01, -3.6875e+00],
        [ 3.5858e-03,  7.6953e-01,  3.0029e-02,  2.1094e+00],
        [ 1.7969e-01, -6.3672e-01,  8.7402e-02,  9.9609e-01]],
       requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.2031e+00,  1.2891e+00,  5.9375e-01, -1.1426e-01],
        [ 4.8750e+00, -1.0234e+00,  8.3203e-01,  6.1328e-01],
        [ 3.5469e+00, -7.4219e-01, -1.0986e-01,  4.6289e-01],
        ...,
        [ 2.5156e+00, -3.3438e+00, -3.9062e-01, -3.6875e+00],
        [ 3.5858e-03,  7.6953e-01,  3.0029e-02,  2.1094e+00],
        [ 1.7969e-01, -6.3672e-01,  8.7402e-02,  9.9609e-01]],
       requires_grad=True)
2024-10-08 15:06:59,608 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1250,  1.2578,  0.5742, -0.1484],
        [ 4.5938, -0.7383,  0.9844,  0.7227],
        [ 3.5312, -0.7578, -0.1289,  0.4648],
        ...,
        [ 2.6406, -3.3906, -0.4004, -3.6875],
        [-0.2197,  0.8828,  0.0549,  2.1562],
        [ 0.1348, -0.5469,  0.1025,  1.0469]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-3.1250,  1.2578,  0.5742, -0.1484],
        [ 4.5938, -0.7383,  0.9844,  0.7227],
        [ 3.5312, -0.7578, -0.1289,  0.4648],
        ...,
        [ 2.6406, -3.3906, -0.4004, -3.6875],
        [-0.2197,  0.8828,  0.0549,  2.1562],
        [ 0.1348, -0.5469,  0.1025,  1.0469]], requires_grad=True)
2024-10-08 15:06:59,865 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.9844,  1.1719,  0.5430, -0.2227],
        [ 4.2812, -0.3809,  1.1484,  0.8750],
        [ 3.4531, -0.7266, -0.1318,  0.4961],
        ...,
        [ 2.7969, -3.4844, -0.4316, -3.7188],
        [-0.4316,  0.9922,  0.0806,  2.2031],
        [ 0.0654, -0.4180,  0.1289,  1.1172]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.9844,  1.1719,  0.5430, -0.2227],
        [ 4.2812, -0.3809,  1.1484,  0.8750],
        [ 3.4531, -0.7266, -0.1318,  0.4961],
        ...,
        [ 2.7969, -3.4844, -0.4316, -3.7188],
        [-0.4316,  0.9922,  0.0806,  2.2031],
        [ 0.0654, -0.4180,  0.1289,  1.1172]], requires_grad=True)
2024-10-08 15:07:00,021 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.8750,  1.1016,  0.5156, -0.2871],
        [ 4.0312, -0.1226,  1.2734,  0.9805],
        [ 3.4219, -0.7266, -0.1416,  0.5039],
        ...,
        [ 2.8750, -3.5000, -0.4453, -3.7188],
        [-0.5156,  1.0000,  0.0825,  2.2031],
        [ 0.0276, -0.3398,  0.1436,  1.1641]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.8750,  1.1016,  0.5156, -0.2871],
        [ 4.0312, -0.1226,  1.2734,  0.9805],
        [ 3.4219, -0.7266, -0.1416,  0.5039],
        ...,
        [ 2.8750, -3.5000, -0.4453, -3.7188],
        [-0.5156,  1.0000,  0.0825,  2.2031],
        [ 0.0276, -0.3398,  0.1436,  1.1641]], requires_grad=True)
2024-10-08 15:07:00,272 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.6875,  1.0156,  0.4844, -0.3418],
        [ 3.7969,  0.0942,  1.3750,  1.0625],
        [ 3.5156, -0.8008, -0.1670,  0.4766],
        ...,
        [ 2.6094, -3.2969, -0.4062, -3.6406],
        [-0.3848,  0.8555,  0.0520,  2.1406],
        [ 0.0923, -0.3457,  0.1426,  1.1797]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.6875,  1.0156,  0.4844, -0.3418],
        [ 3.7969,  0.0942,  1.3750,  1.0625],
        [ 3.5156, -0.8008, -0.1670,  0.4766],
        ...,
        [ 2.6094, -3.2969, -0.4062, -3.6406],
        [-0.3848,  0.8555,  0.0520,  2.1406],
        [ 0.0923, -0.3457,  0.1426,  1.1797]], requires_grad=True)
2024-10-08 15:07:00,526 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4844,  0.9453,  0.4629, -0.3613],
        [ 3.5469,  0.3223,  1.4688,  1.1406],
        [ 3.4688, -0.7773, -0.1680,  0.4980],
        ...,
        [ 2.6094, -3.2500, -0.4004, -3.5938],
        [-0.4258,  0.8164,  0.0420,  2.0938],
        [ 0.0742, -0.2236,  0.1680,  1.2500]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.4844,  0.9453,  0.4629, -0.3613],
        [ 3.5469,  0.3223,  1.4688,  1.1406],
        [ 3.4688, -0.7773, -0.1680,  0.4980],
        ...,
        [ 2.6094, -3.2500, -0.4004, -3.5938],
        [-0.4258,  0.8164,  0.0420,  2.0938],
        [ 0.0742, -0.2236,  0.1680,  1.2500]], requires_grad=True)
2024-10-08 15:07:00,782 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.2656,  0.8008,  0.4199, -0.4473],
        [ 3.2812,  0.5703,  1.5547,  1.2344],
        [ 3.3594, -0.7109, -0.1582,  0.5352],
        ...,
        [ 2.6719, -3.2500, -0.4062, -3.5469],
        [-0.5234,  0.8633,  0.0530,  2.0781],
        [ 0.0292, -0.0913,  0.1934,  1.3125]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-2.2656,  0.8008,  0.4199, -0.4473],
        [ 3.2812,  0.5703,  1.5547,  1.2344],
        [ 3.3594, -0.7109, -0.1582,  0.5352],
        ...,
        [ 2.6719, -3.2500, -0.4062, -3.5469],
        [-0.5234,  0.8633,  0.0530,  2.0781],
        [ 0.0292, -0.0913,  0.1934,  1.3125]], requires_grad=True)
2024-10-08 15:07:01,043 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.9531,  0.6406,  0.3770, -0.5117],
        [ 3.2031,  0.6797,  1.6016,  1.3047],
        [ 3.2812, -0.6523, -0.1494,  0.5703],
        ...,
        [ 2.3906, -3.0469, -0.3652, -3.4688],
        [-0.4629,  0.8008,  0.0393,  2.0469],
        [ 0.0435, -0.0342,  0.2021,  1.3516]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.9531,  0.6406,  0.3770, -0.5117],
        [ 3.2031,  0.6797,  1.6016,  1.3047],
        [ 3.2812, -0.6523, -0.1494,  0.5703],
        ...,
        [ 2.3906, -3.0469, -0.3652, -3.4688],
        [-0.4629,  0.8008,  0.0393,  2.0469],
        [ 0.0435, -0.0342,  0.2021,  1.3516]], requires_grad=True)
2024-10-08 15:07:01,202 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.6562,  0.4961,  0.3379, -0.5625],
        [ 3.1094,  0.7969,  1.6484,  1.3672],
        [ 3.1875, -0.5898, -0.1387,  0.6016],
        ...,
        [ 2.1562, -2.9062, -0.3379, -3.4062],
        [-0.4199,  0.7578,  0.0297,  2.0156],
        [ 0.0322,  0.0698,  0.2217,  1.3984]], requires_grad=True)
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - LOG: Parameter containing:
tensor([[-1.6562,  0.4961,  0.3379, -0.5625],
        [ 3.1094,  0.7969,  1.6484,  1.3672],
        [ 3.1875, -0.5898, -0.1387,  0.6016],
        ...,
        [ 2.1562, -2.9062, -0.3379, -3.4062],
        [-0.4199,  0.7578,  0.0297,  2.0156],
        [ 0.0322,  0.0698,  0.2217,  1.3984]], requires_grad=True)
2024-10-08 15:07:01,392 a61583b9-75eb-4f84-b103-dd6c1b3ba32a - COMPLETED: Your job has been completed.
INFO:nnsight_remote:a61583b9-75eb-4f84-b103-dd6c1b3ba32a - COMPLETED: Your job has been completed.
Downloading result: 100%|██████████| 911k/911k [00:00<00:00, 2.31MB/s]
[ ]:
print(model)
LlamaForCausalLM(
  (model): LlamaModel(
    (embed_tokens): Embedding(128256, 8192)
    (layers): ModuleList(
      (0-79): 80 x LlamaDecoderLayer(
        (self_attn): LlamaSdpaAttention(
          (q_proj): Linear(in_features=8192, out_features=8192, bias=False)
          (k_proj): Linear(in_features=8192, out_features=1024, bias=False)
          (v_proj): Linear(in_features=8192, out_features=1024, bias=False)
          (o_proj): Linear(in_features=8192, out_features=8192, bias=False)
          (rotary_emb): LlamaRotaryEmbedding()
        )
        (mlp): LlamaMLP(
          (gate_proj): Linear(in_features=8192, out_features=28672, bias=False)
          (up_proj): Linear(in_features=8192, out_features=28672, bias=False)
          (down_proj): Linear(in_features=28672, out_features=8192, bias=False)
          (act_fn): SiLU()
        )
        (input_layernorm): LlamaRMSNorm((8192,), eps=1e-05)
        (post_attention_layernorm): LlamaRMSNorm((8192,), eps=1e-05)
      )
    )
    (norm): LlamaRMSNorm((8192,), eps=1e-05)
    (rotary_emb): LlamaRotaryEmbedding()
  )
  (lm_head): Linear(in_features=8192, out_features=128256, bias=False)
  (generator): WrapperModule()
)

In addition to the weights changing, we know the LoRA has been applied because there is a difference in the model’s architecture. The 11th block of the model no longer has the standard MLP layer and instead contains the LoRA.

Now it is time to test out whether our fine tuned model is able to predict the sentiment of a given sentence.

[ ]:
# With lora. Will output "negative".
with model.generate("I'm upset", remote=True) as generator:
  lora()
  out = model.lm_head.output.save()

# The model outputs the sentiment as tokens first.
token_ids = out.argmax(dim=-1)

# Convert the tokens to either positive or negative
count_positive = (token_ids == 1).sum().item()
count_negative = (token_ids == 0).sum().item()

# Determine the overall sentiment of the entire sentence
if count_positive > count_negative:
  print("\nPrediction with LoRA: Positive\n")
else:
  print("\nPrediction with LoRA: Negative\n")

# Then without. It will try to complete the sentence rather than output the
# sentiment analysis.

with model.generate("I'm upset", remote=True) as generator:
    out = model.lm_head.output.save()

print("\nPrediction without LoRA:", model.tokenizer.decode(out.argmax(dim=-1)[0]))
2024-10-08 15:16:19,547 1e738b58-e05d-47f9-93c4-fb9ae84602b9 - RECEIVED: Your job has been received and is waiting approval.
INFO:nnsight_remote:1e738b58-e05d-47f9-93c4-fb9ae84602b9 - RECEIVED: Your job has been received and is waiting approval.
2024-10-08 15:16:19,586 1e738b58-e05d-47f9-93c4-fb9ae84602b9 - RUNNING: Your job has started running.
INFO:nnsight_remote:1e738b58-e05d-47f9-93c4-fb9ae84602b9 - RUNNING: Your job has started running.
2024-10-08 15:16:19,598 1e738b58-e05d-47f9-93c4-fb9ae84602b9 - APPROVED: Your job was approved and is waiting to be run.
INFO:nnsight_remote:1e738b58-e05d-47f9-93c4-fb9ae84602b9 - APPROVED: Your job was approved and is waiting to be run.
2024-10-08 15:16:20,109 1e738b58-e05d-47f9-93c4-fb9ae84602b9 - COMPLETED: Your job has been completed.
INFO:nnsight_remote:1e738b58-e05d-47f9-93c4-fb9ae84602b9 - COMPLETED: Your job has been completed.
Downloading result: 100%|██████████| 1.03M/1.03M [00:00<00:00, 1.98MB/s]

Prediction with LoRA: Negative

2024-10-08 15:16:22,933 1ad601ee-b03b-4e6c-9f43-6dcd3cc9a02f - RECEIVED: Your job has been received and is waiting approval.
INFO:nnsight_remote:1ad601ee-b03b-4e6c-9f43-6dcd3cc9a02f - RECEIVED: Your job has been received and is waiting approval.
2024-10-08 15:16:25,291 1ad601ee-b03b-4e6c-9f43-6dcd3cc9a02f - APPROVED: Your job was approved and is waiting to be run.
INFO:nnsight_remote:1ad601ee-b03b-4e6c-9f43-6dcd3cc9a02f - APPROVED: Your job was approved and is waiting to be run.
2024-10-08 15:16:25,302 1ad601ee-b03b-4e6c-9f43-6dcd3cc9a02f - RUNNING: Your job has started running.
INFO:nnsight_remote:1ad601ee-b03b-4e6c-9f43-6dcd3cc9a02f - RUNNING: Your job has started running.
2024-10-08 15:16:25,478 1ad601ee-b03b-4e6c-9f43-6dcd3cc9a02f - COMPLETED: Your job has been completed.
INFO:nnsight_remote:1ad601ee-b03b-4e6c-9f43-6dcd3cc9a02f - COMPLETED: Your job has been completed.
Downloading result: 100%|██████████| 1.03M/1.03M [00:00<00:00, 2.59MB/s]

Prediction without LoRA: Question have a that