New York · Senate Bill · 2025-2026 Regular Sessions
SB6955
New York Senate Bill 6955-A — An Act to amend the general business law, in relation to establishing the artificial intelligence training data transparency act

Status ● Introduced Effective N/A Passage Likelihood L

WHAT THIS BILL REGULATES · 1 REQUIREMENT TYPE

How Is This Bill Enforced

Enforcement Authority
No enforcement mechanism, private right of action, or designated enforcement agency is specified in the bill text. The bill imposes a disclosure obligation but does not prescribe penalties or an enforcement pathway.
Private Right of Action
No private right of action. Enforcement is exclusive to the designated authority.
Penalties
The bill does not specify any monetary penalties, remedies, injunctive relief, or attorney's fees for noncompliance.

What This Bill Requires

Verbatim statutory text on the left; plain-language analysis and a per-section checklist on the right. Numbered markers cross-link to the matching checklist row.

Statutory Text
Analysis & Obligations
Gen. Bus. Law § 1430
Short title

This act shall be known and may be cited as the "artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1) training data transparency act".

Establishes the short title of the new article as the "artificial intelligence training data transparency act." This is a naming provision and creates no compliance obligation.

Gen. Bus. Law § 1431
Definitions

(1) "Artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1)" or "artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1) technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.

(2) "DeveloperDeveloper"Developer" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an artificial intelligence model or service for use by members of the public.Gen. Bus. Law § 1431(2)" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifiesSubstantially modifies"Substantially modifies" or "substantial modification" means a new version, new release, or other update to a generative artificial intelligence model or service that materially changes its functionality or performance, including the results of retraining or fine tuning.Gen. Bus. Law § 1431(4) an artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1) model or service for use by members of the public.

(3) "Generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3)" means a class of AI modelsAI model"AI model" means an information system or component of an information system that implements artificial intelligence technology and uses computational, statistical, or machine-learning techniques to produce outputs from a given set of inputs.Gen. Bus. Law § 1431(8) that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.

(4) "Substantially modifiesSubstantially modifies"Substantially modifies" or "substantial modification" means a new version, new release, or other update to a generative artificial intelligence model or service that materially changes its functionality or performance, including the results of retraining or fine tuning.Gen. Bus. Law § 1431(4)" or "substantial modification" means a new version, new release, or other update to a generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3) model or service that materially changes its functionality or performance, including the results of retraining or fine tuning.

(5) "Synthetic data generationSynthetic data generation"Synthetic data generation" means a process in which seed data is used to create artificial data that have some of the statistical characteristics of the seed data.Gen. Bus. Law § 1431(5)" means a process in which seed data is used to create artificial data that have some of the statistical characteristics of the seed data.

(6) "Train a generative artificial intelligence model or serviceTrain a generative artificial intelligence model or service"Train a generative artificial intelligence model or service" includes testing, validating, or fine tuning by the developer of the artificial intelligence model or service.Gen. Bus. Law § 1431(6)" includes testing, validating, or fine tuning by the developer of the artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1) model or service.

(7) "Aggregate consumer informationAggregate consumer information"Aggregate consumer information" means information that relates to a group of consumers, from which individual consumer identities have been removed, that is not linked or reasonably linkable to any consumer or household, including via a device. Aggregate consumer information does not mean one or more individual consumer records that have been de-identified.Gen. Bus. Law § 1431(7)" means information that relates to a group of consumers, from which individual consumer identities have been removed, that is not linked or reasonably linkable to any consumer or household, including via a device. Aggregate consumer informationAggregate consumer information"Aggregate consumer information" means information that relates to a group of consumers, from which individual consumer identities have been removed, that is not linked or reasonably linkable to any consumer or household, including via a device. Aggregate consumer information does not mean one or more individual consumer records that have been de-identified.Gen. Bus. Law § 1431(7) does not mean one or more individual consumer records that have been de-identified.

(8) "AI modelAI model"AI model" means an information system or component of an information system that implements artificial intelligence technology and uses computational, statistical, or machine-learning techniques to produce outputs from a given set of inputs.Gen. Bus. Law § 1431(8)" means an information system or component of an information system that implements artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1) technology and uses computational, statistical, or machine-learning techniques to produce outputs from a given set of inputs.

Defines eight key terms used throughout the article, including "artificial intelligence," "developer," "generative artificial intelligence," "substantial modification," "synthetic data generation," "train," "aggregate consumer information," and "AI model." Notably, the definition of "developer" encompasses persons, partnerships, state or local government agencies, and corporations, extending the disclosure obligation beyond private-sector actors. The definition of "train" is broadly drawn to include testing, validating, and fine tuning.

Gen. Bus. Law § 1432
Data used to train generative artificial intelligence models or services
Developer

(1)(a)–(l) 1 On or before January first, two thousand twenty-seven, and prior to each time thereafter that a generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3) model or service, or a substantial modification to a generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3) model or service, released on or after January first, two thousand twenty-two, is made publicly available to New Yorkers for use, regardless of whether the terms of such use include compensation, the developer of such model or service shall post on the developerDeveloper"Developer" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an artificial intelligence model or service for use by members of the public.Gen. Bus. Law § 1431(2)'s website documentation regarding the data used by the developerDeveloper"Developer" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an artificial intelligence model or service for use by members of the public.Gen. Bus. Law § 1431(2) to train the generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3) model or service, including a high-level summary of the datasets used in the development of the generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3) model or service, including, but not limited to: (a) the sources or owners of the datasets; (b) a description of how the datasets further the intended purpose of the artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1) model or service; (c) the number of data points included in the datasets, which may be in general ranges, and with estimated figures for dynamic datasets; (d) a description of the types of data points within the datasets. For purposes of this paragraph, the following definitions apply: (i) as applied to datasets that include labels, "types of data points" means the types of labels used; and (ii) as applied to datasets without labeling, "types of data points" refers to the general characteristics; (e) whether the datasets include any data protected by copyright, trademark, or patent, or whether the datasets are entirely in the public domain; (f) whether the datasets were purchased or licensed by the developerDeveloper"Developer" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an artificial intelligence model or service for use by members of the public.Gen. Bus. Law § 1431(2); (g) whether the datasets include personal information or personal identifying information, as defined in section eight hundred ninety-nine-aaa of this chapter; (h) whether the datasets include aggregate consumer informationAggregate consumer information"Aggregate consumer information" means information that relates to a group of consumers, from which individual consumer identities have been removed, that is not linked or reasonably linkable to any consumer or household, including via a device. Aggregate consumer information does not mean one or more individual consumer records that have been de-identified.Gen. Bus. Law § 1431(7); (i) whether there was any cleaning, processing, or other modification to the datasets by the developerDeveloper"Developer" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an artificial intelligence model or service for use by members of the public.Gen. Bus. Law § 1431(2), including the intended purpose of those efforts in relation to the artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1) model or service; (j) the time period during which the data in the datasets were collected, including a notice if the data collection is ongoing; (k) the dates the datasets were first used during the development of the artificial intelligenceArtificial intelligence"Artificial intelligence" or "artificial intelligence technology" means a machine-based system that can, for a given set of human-defined objectives, make predictions, recommendations, or decisions influencing real or virtual environments, and that uses machine- and human-based inputs to perceive real and virtual environments, abstract such perceptions into models through analysis in an automated manner, and use model inference to formulate options for information or action.Gen. Bus. Law § 1431(1) model or service; and (l) whether the generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3) model or service used or continuously uses synthetic data generationSynthetic data generation"Synthetic data generation" means a process in which seed data is used to create artificial data that have some of the statistical characteristics of the seed data.Gen. Bus. Law § 1431(5) in its development. A developerDeveloper"Developer" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an artificial intelligence model or service for use by members of the public.Gen. Bus. Law § 1431(2) may include a description of the functional need or desired purpose of the synthetic data in relation to the intended purpose of the model or service.

(2)(a)–(b) 1 A developerDeveloper"Developer" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an artificial intelligence model or service for use by members of the public.Gen. Bus. Law § 1431(2) shall not be required to post documentation regarding the data used to train a generative artificial intelligence model or serviceTrain a generative artificial intelligence model or service"Train a generative artificial intelligence model or service" includes testing, validating, or fine tuning by the developer of the artificial intelligence model or service.Gen. Bus. Law § 1431(6) for any of the following: (a) A generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3) model or service whose sole purpose is the operation of aircraft in the national airspace; or (b) A generative artificial intelligenceGenerative artificial intelligence"Generative artificial intelligence" means a class of AI models that emulate the structure and characteristics of input data to generate derived synthetic content, including, but not limited to, images, videos, audio, text, and other digital content.Gen. Bus. Law § 1431(3) model or service developed for national security, military, or defense purposes that is made available only to a federal entity.

Section 1432 is the sole operative provision of the bill. It requires developers of generative AI models or services to post on their website detailed documentation about the data used to train their models, covering twelve enumerated categories of information. The obligation applies to any generative AI model or service made publicly available to New Yorkers — whether free or compensated — that was released on or after January 1, 2022. The initial posting deadline is January 1, 2027, with ongoing disclosure required before each subsequent release or substantial modification.

Subsection 2 carves out two narrow exemptions: generative AI models whose sole purpose is aircraft operation in the national airspace, and models developed for national security, military, or defense purposes made available only to a federal entity. The disclosure requirement is a public website posting obligation — there is no regulator submission requirement and no enforcement mechanism specified in the bill.

Compliance actions 1 item
1
DevelopersDeveloper"Developer" means a person, partnership, state or local government agency, or corporation that designs, codes, produces, or substantially modifies an artificial intelligence model or service for use by members of the public.Gen. Bus. Law § 1431(2) must post on their website, on or before January 1, 2027, and before each subsequent public release or substantial modification of a generative AI modelAI model"AI model" means an information system or component of an information system that implements artificial intelligence technology and uses computational, statistical, or machine-learning techniques to produce outputs from a given set of inputs.Gen. Bus. Law § 1431(8) or service released on or after January 1, 2022 and made available to New Yorkers, documentation regarding the data used to train the model or service. The documentation must include a high-level summary of the training datasets covering at least twelve categories: (1) sources or owners of the datasets; (2) how the datasets further the model's intended purpose; (3) the number of data points (in general ranges, with estimates for dynamic datasets); (4) the types of data points (label types for labeled datasets, general characteristics for unlabeled datasets); (5) whether the datasets include data protected by copyright, trademark, or patent, or are entirely in the public domain; (6) whether the datasets were purchased or licensed; (7) whether the datasets include personal information or personal identifying information; (8) whether the datasets include aggregate consumer informationAggregate consumer information"Aggregate consumer information" means information that relates to a group of consumers, from which individual consumer identities have been removed, that is not linked or reasonably linkable to any consumer or household, including via a device. Aggregate consumer information does not mean one or more individual consumer records that have been de-identified.Gen. Bus. Law § 1431(7); (9) whether there was any cleaning, processing, or other modification, and the purpose of those efforts; (10) the time period during which data was collected, including notice if collection is ongoing; (11) the dates datasets were first used in development; and (12) whether the model uses or continuously uses synthetic data generationSynthetic data generation"Synthetic data generation" means a process in which seed data is used to create artificial data that have some of the statistical characteristics of the seed data.Gen. Bus. Law § 1431(5). Exemptions apply for generative AI modelsAI model"AI model" means an information system or component of an information system that implements artificial intelligence technology and uses computational, statistical, or machine-learning techniques to produce outputs from a given set of inputs.Gen. Bus. Law § 1431(8) whose sole purpose is aircraft operation in the national airspace, and for models developed for national security, military, or defense purposes available only to a federal entity.
T-03.2

Passage Likelihood

Low
Status Introduced
Chamber No passage
Committee No action
Majority party Yes
Bipartisan No
Prior session None

Legislative History

2025-03-27 REFERRED TO INTERNET AND TECHNOLOGY
2026-01-07 REFERRED TO INTERNET AND TECHNOLOGY
2026-02-18 AMEND AND RECOMMIT TO INTERNET AND TECHNOLOGY
2026-02-18 PRINT NUMBER 6955A
2026-02-25 1ST REPORT CAL.433
2026-02-26 2ND REPORT CAL.
2026-03-04 ADVANCED TO THIRD READING

Entry Last Reviewed

2026-05-20
AI generated