Dataset Viewer
Duplicate
The dataset viewer is not available for this split.
Cannot load the dataset split (in streaming mode) to extract the first rows.
Error code:   StreamingRowsError
Exception:    CastError
Message:      Couldn't cast
type: string
version: int64
id: string
timestamp: string
cwd: string
parentId: string
message: struct<role: string, content: list<item: struct<type: string, text: string>>>
  child 0, role: string
  child 1, content: list<item: struct<type: string, text: string>>
      child 0, item: struct<type: string, text: string>
          child 0, type: string
          child 1, text: string
api: string
provider: string
model: string
usage: struct<input: int64, output: int64>
  child 0, input: int64
  child 1, output: int64
stopReason: string
to
{'type': Value('string'), 'version': Value('int64'), 'id': Value('string'), 'timestamp': Value('string'), 'cwd': Value('string'), 'parentId': Value('null'), 'message': {'role': Value('string'), 'content': List({'type': Value('string'), 'text': Value('string')})}}
because column names don't match
Traceback:    Traceback (most recent call last):
                File "/src/services/worker/src/worker/utils.py", line 99, in get_rows_or_raise
                  return get_rows(
                         ^^^^^^^^^
                File "/src/libs/libcommon/src/libcommon/utils.py", line 272, in decorator
                  return func(*args, **kwargs)
                         ^^^^^^^^^^^^^^^^^^^^^
                File "/src/services/worker/src/worker/utils.py", line 77, in get_rows
                  rows_plus_one = list(itertools.islice(ds, rows_max_number + 1))
                                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/iterable_dataset.py", line 2690, in __iter__
                  for key, example in ex_iterable:
                                      ^^^^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/iterable_dataset.py", line 2227, in __iter__
                  for key, pa_table in self._iter_arrow():
                                       ^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/iterable_dataset.py", line 2251, in _iter_arrow
                  for key, pa_table in self.ex_iterable._iter_arrow():
                                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/iterable_dataset.py", line 494, in _iter_arrow
                  for key, pa_table in iterator:
                                       ^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/iterable_dataset.py", line 384, in _iter_arrow
                  for key, pa_table in self.generate_tables_fn(**gen_kwags):
                                       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/packaged_modules/json/json.py", line 289, in _generate_tables
                  self._cast_table(pa_table, json_field_paths=json_field_paths),
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/packaged_modules/json/json.py", line 124, in _cast_table
                  pa_table = table_cast(pa_table, self.info.features.arrow_schema)
                             ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/table.py", line 2272, in table_cast
                  return cast_table_to_schema(table, schema)
                         ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
                File "/usr/local/lib/python3.12/site-packages/datasets/table.py", line 2218, in cast_table_to_schema
                  raise CastError(
              datasets.table.CastError: Couldn't cast
              type: string
              version: int64
              id: string
              timestamp: string
              cwd: string
              parentId: string
              message: struct<role: string, content: list<item: struct<type: string, text: string>>>
                child 0, role: string
                child 1, content: list<item: struct<type: string, text: string>>
                    child 0, item: struct<type: string, text: string>
                        child 0, type: string
                        child 1, text: string
              api: string
              provider: string
              model: string
              usage: struct<input: int64, output: int64>
                child 0, input: int64
                child 1, output: int64
              stopReason: string
              to
              {'type': Value('string'), 'version': Value('int64'), 'id': Value('string'), 'timestamp': Value('string'), 'cwd': Value('string'), 'parentId': Value('null'), 'message': {'role': Value('string'), 'content': List({'type': Value('string'), 'text': Value('string')})}}
              because column names don't match

Need help to make the dataset viewer work? Make sure to review how to configure the dataset viewer, and open a discussion for direct support.

No dataset card yet

Downloads last month
27