pub enum EvaluationResults {
Show 23 variants ExactMatchResults(ExactMatchResults), BleuResults(BleuResults), RougeResults(RougeResults), FluencyResult(FluencyResult), CoherenceResult(CoherenceResult), SafetyResult(SafetyResult), GroundednessResult(GroundednessResult), FulfillmentResult(FulfillmentResult), SummarizationQualityResult(SummarizationQualityResult), PairwiseSummarizationQualityResult(PairwiseSummarizationQualityResult), SummarizationHelpfulnessResult(SummarizationHelpfulnessResult), SummarizationVerbosityResult(SummarizationVerbosityResult), QuestionAnsweringQualityResult(QuestionAnsweringQualityResult), PairwiseQuestionAnsweringQualityResult(PairwiseQuestionAnsweringQualityResult), QuestionAnsweringRelevanceResult(QuestionAnsweringRelevanceResult), QuestionAnsweringHelpfulnessResult(QuestionAnsweringHelpfulnessResult), QuestionAnsweringCorrectnessResult(QuestionAnsweringCorrectnessResult), PointwiseMetricResult(PointwiseMetricResult), PairwiseMetricResult(PairwiseMetricResult), ToolCallValidResults(ToolCallValidResults), ToolNameMatchResults(ToolNameMatchResults), ToolParameterKeyMatchResults(ToolParameterKeyMatchResults), ToolParameterKvMatchResults(ToolParameterKvMatchResults),
}
Expand description

Evaluation results will be served in the same order as presented in EvaluationRequest.instances.

Variants§

§

ExactMatchResults(ExactMatchResults)

Auto metric evaluation results. Results for exact match metric.

§

BleuResults(BleuResults)

Results for bleu metric.

§

RougeResults(RougeResults)

Results for rouge metric.

§

FluencyResult(FluencyResult)

LLM-based metric evaluation result. General text generation metrics, applicable to other categories. Result for fluency metric.

§

CoherenceResult(CoherenceResult)

Result for coherence metric.

§

SafetyResult(SafetyResult)

Result for safety metric.

§

GroundednessResult(GroundednessResult)

Result for groundedness metric.

§

FulfillmentResult(FulfillmentResult)

Result for fulfillment metric.

§

SummarizationQualityResult(SummarizationQualityResult)

Summarization only metrics. Result for summarization quality metric.

§

PairwiseSummarizationQualityResult(PairwiseSummarizationQualityResult)

Result for pairwise summarization quality metric.

§

SummarizationHelpfulnessResult(SummarizationHelpfulnessResult)

Result for summarization helpfulness metric.

§

SummarizationVerbosityResult(SummarizationVerbosityResult)

Result for summarization verbosity metric.

§

QuestionAnsweringQualityResult(QuestionAnsweringQualityResult)

Question answering only metrics. Result for question answering quality metric.

§

PairwiseQuestionAnsweringQualityResult(PairwiseQuestionAnsweringQualityResult)

Result for pairwise question answering quality metric.

§

QuestionAnsweringRelevanceResult(QuestionAnsweringRelevanceResult)

Result for question answering relevance metric.

§

QuestionAnsweringHelpfulnessResult(QuestionAnsweringHelpfulnessResult)

Result for question answering helpfulness metric.

§

QuestionAnsweringCorrectnessResult(QuestionAnsweringCorrectnessResult)

Result for question answering correctness metric.

§

PointwiseMetricResult(PointwiseMetricResult)

Generic metrics. Result for pointwise metric.

§

PairwiseMetricResult(PairwiseMetricResult)

Result for pairwise metric.

§

ToolCallValidResults(ToolCallValidResults)

Tool call metrics. Results for tool call valid metric.

§

ToolNameMatchResults(ToolNameMatchResults)

Results for tool name match metric.

§

ToolParameterKeyMatchResults(ToolParameterKeyMatchResults)

Results for tool parameter key match metric.

§

ToolParameterKvMatchResults(ToolParameterKvMatchResults)

Results for tool parameter key value match metric.

Implementations§

source§

impl EvaluationResults

source

pub fn encode(&self, buf: &mut impl BufMut)

Encodes the message to a buffer.

source

pub fn merge( field: &mut Option<EvaluationResults>, tag: u32, wire_type: WireType, buf: &mut impl Buf, ctx: DecodeContext, ) -> Result<(), DecodeError>

Decodes an instance of the message from a buffer, and merges it into self.

source

pub fn encoded_len(&self) -> usize

Returns the encoded length of the message without a length delimiter.

Trait Implementations§

source§

impl Clone for EvaluationResults

source§

fn clone(&self) -> EvaluationResults

Returns a copy of the value. Read more
1.0.0 · source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
source§

impl Debug for EvaluationResults

source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
source§

impl PartialEq for EvaluationResults

source§

fn eq(&self, other: &EvaluationResults) -> bool

This method tests for self and other values to be equal, and is used by ==.
1.0.0 · source§

fn ne(&self, other: &Rhs) -> bool

This method tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
source§

impl StructuralPartialEq for EvaluationResults

Auto Trait Implementations§

Blanket Implementations§

source§

impl<T> Any for T
where T: 'static + ?Sized,

source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
source§

impl<T> Borrow<T> for T
where T: ?Sized,

source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
source§

impl<T> From<T> for T

source§

fn from(t: T) -> T

Returns the argument unchanged.

§

impl<T> FromRef<T> for T
where T: Clone,

§

fn from_ref(input: &T) -> T

Converts to this type from a reference to the input type.
§

impl<T> Instrument for T

§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided [Span], returning an Instrumented wrapper. Read more
§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
source§

impl<T, U> Into<U> for T
where U: From<T>,

source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

source§

impl<T> IntoRequest<T> for T

source§

fn into_request(self) -> Request<T>

Wrap the input message T in a tonic::Request
source§

impl<T> ToOwned for T
where T: Clone,

§

type Owned = T

The resulting type after obtaining ownership.
source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

§

type Error = Infallible

The type returned in the event of a conversion error.
source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

§

fn vzip(self) -> V

§

impl<T> WithSubscriber for T

§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a [WithDispatch] wrapper. Read more
§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a [WithDispatch] wrapper. Read more