AlignPose is a multi-view 6D object pose estimation method. It accepts a set of RGB images captured from multiple viewpoints with known camera poses and a set of 3D object models as input. Its primary goal is to estimate the 6D poses of all object instances within the scene. The approach consists of three steps: first, it extracts single-view pose candidates from all viewpoints using a single-view estimator. These candidates are then aggregated into a common coordinate system, where non-maximum suppression (NMS) removes redundancies. Finally, a novel multi-view feature-metric refinement is applied to ensure the resulting poses are consistent across all views.