PickupNet / gen30
Viewer
!new Driver('driver56')
!driver56.id := 'D056'
!driver56.name := 'Scott Lang'
!new Driver('driver57')
!driver57.id := 'D057'
!driver57.name := 'Hope van Dyne'
!new Shipment('shipment59')
!shipment59.id := 'SHP12403'
!shipment59.status := #ASSIGNED
!new Shipment('shipment60')
!shipment60.id := 'SHP12404'
!shipment60.status := #UNDERWAY
!new Address('address59')
!address59.text := '5757 Quantum Realm Ln, San Francisco'
!new Address('address60')
!address60.text := '5858 Pym Tech Dr, San Francisco'
!new GeoLocation('geoLocation59')
!geoLocation59.latitude := 37.7749
!geoLocation59.longitude := -122.4194
!new GeoLocation('geoLocation60')
!geoLocation60.latitude := 37.7840
!geoLocation60.longitude := -122.4080
!insert (address59, geoLocation59) into AddressContainsGeoLocation
!insert (address60, geoLocation60) into AddressContainsGeoLocation
!new Customer('customer59')
!customer59.id := 'CUST059'
!customer59.name := 'Hank Pym'
!customer59.twitterUserName := '@originalantman'
!new Customer('customer60')
!customer60.id := 'CUST060'
!customer60.name := 'Janet van Dyne'
!customer60.twitterUserName := '@waspqueen'
!new Station('station30')
!insert (station30, customer59) into StationContainsCustomer
!insert (station30, customer60) into StationContainsCustomer
!insert (station30, shipment59) into StationShipment
!insert (station30, shipment60) into StationShipment
!insert (station30, driver56) into StationContainsDriver
!insert (station30, driver57) into StationContainsDriver
!insert (shipment59, address59) into ShipmentContainsPickUpAddress
!insert (shipment59, address60) into ShipmentContainsDeliveryAddress
!insert (shipment60, address60) into ShipmentContainsPickUpAddress
!insert (shipment60, address59) into ShipmentContainsDeliveryAddress
!insert (customer59, shipment59) into CustomerConsistsOfShipment
!insert (customer60, shipment60) into CustomerConsistsOfShipment
!insert (driver57, shipment59) into DriverShipment model PickupNet
enum ShipmentStatus {
NEW,
ASSIGNED,
UNDERWAY,
DELIVERED
}
class Driver
attributes
id : String
name : String
end
class Shipment
attributes
id : String
status : ShipmentStatus
end
class Address
attributes
text : String
end
class GeoLocation
attributes
latitude : Real
longitude : Real
end
class Station
end
class Customer
attributes
id : String
name : String
twitterUserName : String
end
association DriverShipment between
Driver [0..1] role driver
Shipment [0..*] role assignments
end
association ShipmentContainsPickUpAddress between
Shipment [0..*] role hasPickUpShipment
Address [1] role pickUpAddress
end
association ShipmentContainsDeliveryAddress between
Shipment [0..*] role hasDeliveryShipment
Address [1] role shipToAddress
end
composition AddressContainsGeoLocation between
Address [1]
GeoLocation [1] role geoLocation
end
composition CustomerConsistsOfShipment between
Customer [1] role orderer
Shipment [0..*] role orders
end
composition StationContainsCustomer between
Station [1]
Customer [0..*] role customers
end
association StationShipment between
Station [1]
Shipment [0..*] role shipments
end
composition StationContainsDriver between
Station [1]
Driver [0..*] role drivers
end
constraints
context Shipment inv uniqueShipmentId:
Shipment.allInstances->isUnique(s | s.id)
context Driver inv uniqueDriverId:
Driver.allInstances->isUnique(d | d.id)
context Customer inv uniqueCustomerId:
Customer.allInstances->isUnique(c | c.id)
context Shipment inv differentPickupAndDeliveryAddress:
self.pickUpAddress <> self.shipToAddress Given a conceptual model expressed in the UML-based Specification Environment (USE), your task is to generate valid and realistic instances that conform to the provided model. <requirements> - Instances must be syntactically correct according to the USE syntax_reference. - Avoid unnecessary comments and output the instance in plain text (i.e., not markdown). - Make sure instances fulfill all the model's constraints, and that multiplicities, relationships, and attributes are valid and realistic. - Provide multiple instances with diverse data values and structure. </requirements> <syntax_reference> Here there is a snippet showing how to create objects and set values in the specific .soil language required: -- This is a comment example -- Primitive data types: -- Integer i.e. 1, 2, 3, etc. -- Real i.e. 1.0, 21.89, 322.05556, etc. -- Boolean i.e. true or false -- String i.e. 'Hello World' -- You can create instances with the following syntax: !new <instance type>('<instance name>') -- Example: !new Client('client1') !new Store('store4') -- You can assign values to attributes for a created instance with the following syntax: !<instance name>.<attribute name> := <value> -- Example for different data types: !client1.clientId := 1 -- For Integer !client1.balance := 1123.45 -- For Real !client1.name := 'John' -- For Strings !store4.available := true -- For Boolean -- You can create associations between instances with the following syntax: !insert (<instance name1>, <instance name2>) into <association name> -- Example: !insert (client1, store4) into ClientStore -- Custom data types usage: -- dataType Location -- operations -- Location(x : Real, y : Real) -- some other operations -- end -- You can create custom data types by calling the constructor directly; in this case, the constructor of Location requires two arguments: x and y of type Real. So it can be used as follows: !store4.location := Location(14.0, 289.0) -- Enums usage: -- enum Type { Clothes, Shoes } -- Can be used as follows: !store4.type := #Clothes </syntax_reference> Please generate another instance that is structurally and semantically different from the previous ones. <role>
You are an expert software and system modeler. You are able to assess the semantic quality of object models that have been created to conform to a domain model. The models are defined in USE (UML-based Specification Environment) and OCL (Object Constraint Language).
Your primary capability is "Semantic Reality Checking". You do not just check for syntactic correctness; you check for real-world plausibility and logical consistency within a given domain.
</role>
<context>
The user will provide two types of content:
1. **Domain Model (.use)**: A class diagram definition including classes, attributes, enums, relationships, multiplicities and roles.
2. **Object Model (.soil)**: An object model. This object model can be seen as a script composed of instructions for the creation of objects, relationships and setting attribute values (snapshot).
Your goal is to act as a judge to determine if the object model represents a **REALISTIC** scenario based on the domain model and common sense real-world logic.
</context>
<definitions>
- **Realistic**: The object model is syntactically correct AND semantically plausible (e.g., A 'Person' has an age between 0 and 120; a 'Car' has a positive price).
- **Unrealistic**: The object model contains contradictions, impossible physical values, or nonsensical relationships (e.g., A 'Person' is their own father; a 'Product' has a negative weight).
- **Doubtful**: You cannot determine whether the object model is realistic or not.
</definitions>
<instructions>
Follow this thinking process strictly before generating the final output:
1. **Analyze the Domain (.use)**: Understand the classes and what they represent in the real world.
2. **Analyze the Instances (.soil)**: Map the created objects to their classes. Look at the specific values assigned to attributes and the relationships created between objects.
3. **Evaluate Semantics**:
- Apply "Common Sense Knowledge" to the attribute values.
- Check cardinality and relationship logic beyond simple OCL constraints.
- Identify any outliers or logical fallacies.
4. **Determine Verdict**: Select one of the defined labels (Realistic/Unrealistic/Doubtful).
</instructions>
<constraints>
- **Tone**: Objective, Analytical, Technical.
- **Verbosity**: Low. Be direct.
- **Reasoning**: The "Why" section must be concise and specific, citing variable names, objects, or relationships when possible.
- Do not output the internal thinking process. Only output the final formatted result.
</constraints>
<output_format>
Structure your response exactly as follows:
**Response**: [Realistic | Unrealistic | Doubtful]
**Why**: [Concise explanation of your reasoning. If Unrealistic, specify the exact objects, values or relationships that break realism.]
</output_format>
<examples>
Example 1:
Input:
<domain_model>
class Person
attributes
age: Integer
end
class Pet
attributes
name: String
end
association Ownership between
Person [1] role owner
Pet [*] role pets
end
</domain_model>
<object_model>
!new Person('p1')
!p1.age := 250
!new Pet('pet1')
!pet1.name := 'Luna'
… 1.000 more pets creation …
!pet1000.name := 'Max'
!insert (p1, pet1) into Ownership
…1.000 more pets associated with p1 …
!insert (p1, pet1000) into Ownership
</object_model>
Output:
**Response**: Unrealistic
**Why**: The object 'p1' of class 'Person' has an age of 250, which exceeds the biologically plausible lifespan of a human. Although it is not plausible that 1 same person owns 1.000 pets.
Example 2:
Input:
<domain_model>
class Car
attributes
brand: String
end
class Person
attributes
name: String
end
association Ownership between
Person [1] role owner
Car [*] role cars
end
</domain_model>
<object_model>
!new Person('p1')
!p1.age := 19
!new Car('c1')
!c1.brand := 'Toyota'
!insert (p1, c1) into Ownership
</object_model>
Output:
**Response**: Realistic
**Why**: The object 'c1' has a valid, recognized real-world car brand assigned, and its plausible that a teenager has only one car.
Example 3:
Input:
<domain_model>
class Component
attributes
setting_val: Integer
config_mode: String
end
</domain_model>
<object_model>
!new Component('c1')
!c1.setting_val := 8080
!c1.config_mode := 'Legacy'
</object_model>
Output:
**Response**: Doubtful
**Why**: The class 'Component' and attribute 'setting_val' are generic and lack specific real-world semantic context. Without knowing what specific physical or software system this represents, it is impossible to determine if '8080' is a realistic value or an outlier.
</examples> <domain_model>
model PickupNet
enum ShipmentStatus {
NEW,
ASSIGNED,
UNDERWAY,
DELIVERED
}
class Driver
attributes
id : String
name : String
end
class Shipment
attributes
id : String
status : ShipmentStatus
end
class Address
attributes
text : String
end
class GeoLocation
attributes
latitude : Real
longitude : Real
end
class Station
end
class Customer
attributes
id : String
name : String
twitterUserName : String
end
association DriverShipment between
Driver [0..1] role driver
Shipment [0..*] role assignments
end
association ShipmentContainsPickUpAddress between
Shipment [0..*] role hasPickUpShipment
Address [1] role pickUpAddress
end
association ShipmentContainsDeliveryAddress between
Shipment [0..*] role hasDeliveryShipment
Address [1] role shipToAddress
end
composition AddressContainsGeoLocation between
Address [1]
GeoLocation [1] role geoLocation
end
composition CustomerConsistsOfShipment between
Customer [1] role orderer
Shipment [0..*] role orders
end
composition StationContainsCustomer between
Station [1]
Customer [0..*] role customers
end
association StationShipment between
Station [1]
Shipment [0..*] role shipments
end
composition StationContainsDriver between
Station [1]
Driver [0..*] role drivers
end
constraints
context Shipment inv uniqueShipmentId:
Shipment.allInstances->isUnique(s | s.id)
context Driver inv uniqueDriverId:
Driver.allInstances->isUnique(d | d.id)
context Customer inv uniqueCustomerId:
Customer.allInstances->isUnique(c | c.id)
context Shipment inv differentPickupAndDeliveryAddress:
self.pickUpAddress <> self.shipToAddress
</domain_model>
<object_model>
!new Driver('driver56')
!driver56.id := 'D056'
!driver56.name := 'Scott Lang'
!new Driver('driver57')
!driver57.id := 'D057'
!driver57.name := 'Hope van Dyne'
!new Shipment('shipment59')
!shipment59.id := 'SHP12403'
!shipment59.status := #ASSIGNED
!new Shipment('shipment60')
!shipment60.id := 'SHP12404'
!shipment60.status := #UNDERWAY
!new Address('address59')
!address59.text := '5757 Quantum Realm Ln, San Francisco'
!new Address('address60')
!address60.text := '5858 Pym Tech Dr, San Francisco'
!new GeoLocation('geoLocation59')
!geoLocation59.latitude := 37.7749
!geoLocation59.longitude := -122.4194
!new GeoLocation('geoLocation60')
!geoLocation60.latitude := 37.7840
!geoLocation60.longitude := -122.4080
!insert (address59, geoLocation59) into AddressContainsGeoLocation
!insert (address60, geoLocation60) into AddressContainsGeoLocation
!new Customer('customer59')
!customer59.id := 'CUST059'
!customer59.name := 'Hank Pym'
!customer59.twitterUserName := '@originalantman'
!new Customer('customer60')
!customer60.id := 'CUST060'
!customer60.name := 'Janet van Dyne'
!customer60.twitterUserName := '@waspqueen'
!new Station('station30')
!insert (station30, customer59) into StationContainsCustomer
!insert (station30, customer60) into StationContainsCustomer
!insert (station30, shipment59) into StationShipment
!insert (station30, shipment60) into StationShipment
!insert (station30, driver56) into StationContainsDriver
!insert (station30, driver57) into StationContainsDriver
!insert (shipment59, address59) into ShipmentContainsPickUpAddress
!insert (shipment59, address60) into ShipmentContainsDeliveryAddress
!insert (shipment60, address60) into ShipmentContainsPickUpAddress
!insert (shipment60, address59) into ShipmentContainsDeliveryAddress
!insert (customer59, shipment59) into CustomerConsistsOfShipment
!insert (customer60, shipment60) into CustomerConsistsOfShipment
!insert (driver57, shipment59) into DriverShipment
</object_model> Shipment.status
Evenness (active groups) =
1.0000
Evenness (all groups) = 0.5000
LLM as a Judge
Unrealistic
The shipment 'shipment60' has its status set to '#UNDERWAY' (indicating it is currently in transit), but there is no 'Driver' assigned to it via the 'DriverShipment' association. In a real-world delivery system, an active physical transit requires an assigned driver or carrier.
Metrics
Stats
Stats Breakdown of the total cost and elapsed time for generating the instances. - Elapsed Time = Console Time (ie. Processing Time + API Calls)
- Cost = (input tokens * input price) + (output tokens * output price)
Stats
Breakdown of the total cost and elapsed time for generating the instances.
- Elapsed Time = Console Time (ie. Processing Time + API Calls)
- Cost = (input tokens * input price) + (output tokens * output price)
| Total Cost | $0.05 |
Validation
Validation Measures the correctness of the instantiation using the USE check function. - Syntax = 1 - (Total Number of syntax errors [use check] / Total Number of lines [instance])
- Multiplicities = 1 - (Total Number of multiplicities errors [use check] / Total Number of relationships ([instance] !insert))
- Invariants = 1 - (Total Number of invariants errors [use check] / Total Number of invariants ([model] constraints))
Validation
Measures the correctness of the instantiation using the USE check function.
- Syntax = 1 - (Total Number of syntax errors [use check] / Total Number of lines [instance])
- Multiplicities = 1 - (Total Number of multiplicities errors [use check] / Total Number of relationships ([instance] !insert))
- Invariants = 1 - (Total Number of invariants errors [use check] / Total Number of invariants ([model] constraints))
| Syntax | 0/46 |
| Multiplicities | 0/15 |
| Invariants | 0/4 |
Diversity
Diversity Measures the variability of the generated instances. Attributes (NumericEquals, StringEquals, StringLv): It identifies how much the LLM repeats specific values versus generating unique data points across instances (100%: Diverse, 0%: Repetitive). We group all generated attributes into bags (numeric and string) and then perform pairwise comparisons between every element to obtain. Structure (GED): Measures the Graph Edit Distance (GED) similarity between instances. Distribution (Shannon): Measures the entropy and evenness (balanced distribution) of the generated enum values. - NumericEquals = Total number of numeric attribute pairs with different values / Total number of possible pairs (n * (n - 1) / 2)
- StringEquals = Total number of string attribute pairs that are NOT exactly identical / Total number of possible pairs (n * (n - 1) / 2)
- StringLv = Sum of (Levenshtein Distance(a, b) / max(length(a), length(b))) for all string pairs / Total number of possible pairs (n * (n - 1) / 2)
- GED = Similarity = 1 - (GED / (0.5 * (GED_to_empty_A + GED_to_empty_B))). 1 = red = identical graphs, <=0.5 = green = different graphs. We consider as edit operations: Nodes, Edges, Node_Labels and Edge_Labels [https://github.com/a-coman/ged]
- Shannon (Active) = Entropy / log2(Number of unique groups actually generated). Measures how evenly the generated values are distributed, considering only the categories the LLM actually used.
- Shannon (All) = Entropy / log2(Total number of valid groups defined in the model). Measures how evenly the generated values are distributed against the full spectrum of all possible valid options defined in the .use file.
Diversity
Measures the variability of the generated instances. Attributes (NumericEquals, StringEquals, StringLv): It identifies how much the LLM repeats specific values versus generating unique data points across instances (100%: Diverse, 0%: Repetitive). We group all generated attributes into bags (numeric and string) and then perform pairwise comparisons between every element to obtain. Structure (GED): Measures the Graph Edit Distance (GED) similarity between instances. Distribution (Shannon): Measures the entropy and evenness (balanced distribution) of the generated enum values.
- NumericEquals = Total number of numeric attribute pairs with different values / Total number of possible pairs (n * (n - 1) / 2)
- StringEquals = Total number of string attribute pairs that are NOT exactly identical / Total number of possible pairs (n * (n - 1) / 2)
- StringLv = Sum of (Levenshtein Distance(a, b) / max(length(a), length(b))) for all string pairs / Total number of possible pairs (n * (n - 1) / 2)
- GED = Similarity = 1 - (GED / (0.5 * (GED_to_empty_A + GED_to_empty_B))). 1 = red = identical graphs, <=0.5 = green = different graphs. We consider as edit operations: Nodes, Edges, Node_Labels and Edge_Labels [https://github.com/a-coman/ged]
- Shannon (Active) = Entropy / log2(Number of unique groups actually generated). Measures how evenly the generated values are distributed, considering only the categories the LLM actually used.
- Shannon (All) = Entropy / log2(Total number of valid groups defined in the model). Measures how evenly the generated values are distributed against the full spectrum of all possible valid options defined in the .use file.
| Numeric | 100.0% |
| String Equals | 100.0% |
| String LV | 89.4% |
| Shannon (Active) | 1.000 ± 0.000 |
| Shannon (All) | 0.500 ± 0.000 |
Coverage
Model Coverage Measures the breadth of the instantiation. It answers: "How much of the structural blueprint (the model) was used?" - Classes = Total Unique Classes instantiated (!new) in the .soil / Total Number of classes (class) in the model .use
- Attributes = Total Unique Attributes instantiated (!Class.Attribute or !set) in the .soil / Total Number of attributes (attributes) in the model .use
- Relationships = Total Unique Relationships instantiated (!insert) in the .soil / Total Number of relationships (association, composition, aggregation) in the model .use
Model Coverage
Measures the breadth of the instantiation. It answers: "How much of the structural blueprint (the model) was used?"
- Classes = Total Unique Classes instantiated (!new) in the .soil / Total Number of classes (class) in the model .use
- Attributes = Total Unique Attributes instantiated (!Class.Attribute or !set) in the .soil / Total Number of attributes (attributes) in the model .use
- Relationships = Total Unique Relationships instantiated (!insert) in the .soil / Total Number of relationships (association, composition, aggregation) in the model .use
| Classes | 100.0% |
| Attributes | 100.0% |
| Relationships | 100.0% |
Instantiation
Instance Instantiation Measures the depth or density of the data. It answers: "Of the objects the LLM decided to create, how many of their available 'slots' did it fill?" - Classes = Total Number of classes (!new) in the instance / Total possible that could have been instantiated (infinity)
- Attributes = Total Number of attributes (!Class.Attribute or !set) in the instance / Total possible that could have been instantiated (sum(number of classes instantiated of that type * Class.Attributes))
- Relationships = Total Number of relationships (!insert) in the instance / Total possible that could have been instantiated (infinity)
Instance Instantiation
Measures the depth or density of the data. It answers: "Of the objects the LLM decided to create, how many of their available 'slots' did it fill?"
- Classes = Total Number of classes (!new) in the instance / Total possible that could have been instantiated (infinity)
- Attributes = Total Number of attributes (!Class.Attribute or !set) in the instance / Total possible that could have been instantiated (sum(number of classes instantiated of that type * Class.Attributes))
- Relationships = Total Number of relationships (!insert) in the instance / Total possible that could have been instantiated (infinity)
| Classes | 11/∞ |
| Attributes | 20/20 |
| Relationships | 15/∞ |