Axiological Foundations of Generative AI Systems: An alignment beyond Human Values

  • Del Carpio Castro, Luis Alfonso (Investigador principal)
  • Fernandez Concha, Rafael Alejandro (Coinvestigador)
  • Marchena Sekli, Giulio Franz (Coinvestigador)
  • Solorzano Muñante, Maria Del Carmen (Otro)

Proyecto: Investigación

Detalles del proyecto

Descripción

SIN RESUMEN

Objetivo General

The main objective of this research is to critically examine and strengthen the axiological (values-based) foundations of GAI systems. In particular, the project aims to identify the values (explicit or implicit) that current state-of-the-art generative models operate under, evaluate the alignment of these values with widely endorsed human and societal values, and develop a framework or set of guidelines to improve value alignment in future generative AI systems.

Objetivos Especificos

OE1: Determine and map out the explicit design principles or value guidelines that have been used in the development of systems like ChatGPT, Gemini, and Claude (e.g., safety rules, content policies, RLHF reward models). OE2: Empirically evaluate the behavior of these AI systems in ethically salient scenarios to infer the values that appear to guide their outputs. OE3: Compare the models’ responses and underlying policies with established human values and ethical frameworks. Identify areas where the AI’s values seem misaligned or insufficient OE4: Gather insights from key stakeholders –including philosophers, linguists, management scholars and sociologists– on what values they believe GAI should embody.

Nivel de Investigación

Investigacion basica

Enfoque de Investigación

Disciplinario

Tipo de Proyecto

CONCURSO ANUAL DE INVESTIGACIÓN

Líneas de Investigación

  • 2 — Administración
  • 10 — Ciencia computacional
  • 52 — Filosofía práctica (moral, social y política)

Áreas de conocimiento OCDE

Ciencias sociales - Otras ciencias sociales - Otras ciencias sociales

Entidad Financiadora

PONTIFICIA UNIVERSIDAD CATÓLICA DEL PERÚ
Título cortoAXIOLOG FOUNDAT GENERA AI SYST
EstadoActivo
Fecha de inicio/Fecha fin1/09/2531/08/27