---------------------------------------------
current file for pfam to scop mapping is  based on minimal 0.75 agreement between scop and pfam (intersection/union)
    without multi chain scop entries. Scop domains covering complete chains where assumed to always be above threshold (if on the right chain)


current file for pfam to superfam mapping is ~/p/uniref90_keywords/uniprot_xml_keywords/pfam__annotation_stats/pfam2ssf_agreement_thresh0.5.txt based on minimal 0.5 agreement between scop and ssf (intersection/union)
   agreement was calculated for each protein having  both the Pfam and SSF signature (based on the protein2ipr.dat file downloaded on Feb 2008) and averaged across all pfam ssf pairs.


Legend:
 1.  relation [ XXXXXXX ] : YYYYYYY %LINKAGE (=|existing_edges|/(|cluster1|*|cluster2|)) ProtoLevel
 where relation is the tree relatedness of YYYYYYYY to the best cluster XXXXXXXXX (sibling, parent, or given)
 and %linkage is the proportion of existing blast edges, from that possible (i.e. the sparsity level of the cluster)
 the size of the cluster is |cluster1| + |cluster2|
 2.	best keyword for cluster C is K with Jaccard = J [ TP FP TN FN] Specificity Sensitivity
---------------------------------------------


-------------------====== ( 1 ) 6690234_PF00186_PF00303            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00303  is 6548420 with Jaccard = 1.0000	|PF00303|=265 [ 265	0	1099946	0 ]
 parent [ 6548420 ] : 6690234 0.0974543 (=8709/(293*305)) 90.3238
  given [ 6548420 ] : 6548420 0.645833 (=930/(5*288)) 37.7485
	best keyword for cluster 6548420 is PF00303 with Jaccard = 1.0000	[ 265	0	1099946	0 ]	1.0000 1.0000
sibling [ 6548420 ] : 6635680 0.299342 (=91/(1*304)) 75.9263
	best keyword for cluster 6635680 is PF00186 with Jaccard = 0.8836	[ 281	0	1099893	37 ]	1.0000 0.8836

	SUGGESTING RELATEDNESS OF:   
	A> PF00303 ( PF00303   Thymidylate synthase )
	B> PF00186 ( PF00186   Dihydrofolate reductase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00186| = 318 ,	 |PF00303| = 265 , 	|PF00186^PF00303| = 32	( 10.1% and 12.1% )

	both PF00303 and PF00186 have PDB structures
		PF00186 	   c.71.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00303 SSF55831 0.980 (average over 946 mutual instances, PF00303 949 appearances, SSF55831 1043 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 2 ) 6734619_PF00342_PF00923            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00342  is 6723612 with Jaccard = 1.0000	|PF00342|=340 [ 340	0	1099871	0 ]
 parent [ 6723612 ] : 6734619 0.0281211 (=3859/(364*377)) 97.2335
  given [ 6723612 ] : 6723612 0.0415525 (=182/(365*12)) 95.9425
	best keyword for cluster 6723612 is PF00342 with Jaccard = 1.0000	[ 340	0	1099871	0 ]	1.0000 1.0000
sibling [ 6723612 ] : 6692021 0.0958333 (=138/(360*4)) 90.6723
	best keyword for cluster 6692021 is PF00923 with Jaccard = 0.9701	[ 325	0	1099876	10 ]	1.0000 0.9701

	SUGGESTING RELATEDNESS OF:   
	A> PF00342 ( PF00342   Phosphoglucose isomerase )
	B> PF00923 ( PF00923   Transaldolase )

	Only A has a clan ( CL0067.7 ).

	the two keywords coincide on Uniref90 proteins: |PF00342| = 340 ,	 |PF00923| = 335 , 	|PF00342^PF00923| = 10	( 2.9% and 3.0% )

	both PF00342 and PF00923 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 3 ) 6561262_PF00434_PF05868            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00434  is 6528497 with Jaccard = 1.0000	|PF00434|=45 [ 45	0	1100166	0 ]
 parent [ 6528497 ] : 6561262 0.613043 (=141/(5*46)) 47.982
  given [ 6528497 ] : 6528497 0.755556 (=34/(1*45)) 25.0067
	best keyword for cluster 6528497 is PF00434 with Jaccard = 1.0000	[ 45	0	1100166	0 ]	1.0000 1.0000
sibling [ 6528497 ] : 6284839 1 (=4/(1*4)) 1.00235e-10
	best keyword for cluster 6284839 is PF05868 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00434 ( PF00434   Glycoprotein VP7 )
	B> PF05868 ( PF05868   Rotavirus major outer capsid protein VP7 )

	they come from the same clan: CL0217.4 : PF05868	PF00434
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF00434 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 4 ) 6735955_PF00509_PF04369            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00509  is 6533060 with Jaccard = 1.0000	|PF00509|=86 [ 86	0	1100125	0 ]
 parent [ 6533060 ] : 6735955 0.0492424 (=26/(88*6)) 97.3689
  given [ 6533060 ] : 6533060 0.737255 (=188/(3*85)) 27.7465
	best keyword for cluster 6533060 is PF00509 with Jaccard = 1.0000	[ 86	0	1100125	0 ]	1.0000 1.0000
sibling [ 6533060 ] : 6698780 0.125 (=1/(2*4)) 92
	best keyword for cluster 6698780 is PF04369 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00509 ( PF00509   Hemagglutinin )
	B> PF04369 ( PF04369   Lactococcin-like family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00509 has a PDB structure (may not be up to date)
	PF00509 	   b.19.1.2  h.3.1.1  j.79.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00509 SSF49818 0.772 (average over 12960 mutual instances, PF00509 12960 appearances, SSF49818 13846 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 5 ) 6746863_PF00527_PF02703            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00527  is 6670369 with Jaccard = 1.0000	|PF00527|=114 [ 114	0	1100097	0 ]
 parent [ 6670369 ] : 6746863 0.0252395 (=137/(118*46)) 98.3783
  given [ 6670369 ] : 6670369 0.157895 (=72/(114*4)) 85.6465
	best keyword for cluster 6670369 is PF00527 with Jaccard = 1.0000	[ 114	0	1100097	0 ]	1.0000 1.0000
sibling [ 6670369 ] : 6720998 0.0444444 (=2/(1*45)) 95.5556
	best keyword for cluster 6720998 is PF02703 with Jaccard = 0.9744	[ 38	0	1100172	1 ]	1.0000 0.9744

	SUGGESTING RELATEDNESS OF:   
	A> PF00527 ( PF00527   E7 protein, Early protein )
	B> PF02703 ( PF02703   Early E1A protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00527 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 6 ) 6764184_PF00677_PF06534            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00677  is 6469508 with Jaccard = 1.0000	|PF00677|=292 [ 292	0	1099919	0 ]
 parent [ 6469508 ] : 6764184 0.00781752 (=73/(322*29)) 99.4743
  given [ 6469508 ] : 6469508 0.967607 (=926/(3*319)) 3.6831
	best keyword for cluster 6469508 is PF00677 with Jaccard = 1.0000	[ 292	0	1099919	0 ]	1.0000 1.0000
sibling [ 6469508 ] : 6759748 0.0357143 (=1/(1*28)) 99.25
	best keyword for cluster 6759748 is PF06534 with Jaccard = 1.0000	[ 19	0	1100192	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00677 ( PF00677   Lumazine binding domain )
	B> PF06534 ( PF06534   Repulsive guidance molecule (RGM) C-terminus )

	Only A has a clan ( CL0076.7 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF00677 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 7 ) 6737306_PF00023_PF00710            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00710  is 6733339 with Jaccard = 1.0000	|PF00710|=234 [ 234	0	1099977	0 ]
 parent [ 6733339 ] : 6737306 0.0271625 (=37869/(263*5301)) 97.5146
  given [ 6733339 ] : 6733339 0.0350195 (=54/(257*6)) 97.0906
	best keyword for cluster 6733339 is PF00710 with Jaccard = 1.0000	[ 234	0	1099977	0 ]	1.0000 1.0000
sibling [ 6733339 ] : 6735540 0.0283899 (=40585/(285*5016)) 97.3283
	best keyword for cluster 6735540 is PF00023 with Jaccard = 0.6616	[ 3381	1032	1095101	697 ]	0.7661 0.8291

	SUGGESTING RELATEDNESS OF:   
	A> PF00710 ( PF00710   Asparaginase )
	B> PF00023 ( PF00023   Ankyrin repeat )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00023| = 4078 ,	 |PF00710| = 234 , 	|PF00023^PF00710| = 17	( 0.4% and 7.3% )

	both PF00710 and PF00023 have PDB structures
	PF00710 	   c.88.1.1
	PF00023 	   d.211.1.1  i.11.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00710 SSF53774 0.964 (average over 850 mutual instances, PF00710 892 appearances, SSF53774 893 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 8 ) 6739214_PF00747_PF06261            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00747  is 6546329 with Jaccard = 1.0000	|PF00747|=37 [ 37	0	1100174	0 ]
 parent [ 6546329 ] : 6739214 0.0304054 (=9/(37*8)) 97.7051
  given [ 6546329 ] : 6546329 0.694444 (=25/(1*36)) 36.0094
	best keyword for cluster 6546329 is PF00747 with Jaccard = 1.0000	[ 37	0	1100174	0 ]	1.0000 1.0000
sibling [ 6546329 ] : 6714599 0.0666667 (=1/(3*5)) 94.6667
	best keyword for cluster 6714599 is PF06261 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00747 ( PF00747   ssDNA binding protein )
	B> PF06261 ( PF06261   Actinobacillus actinomycetemcomitans leukotoxin activator LktC )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00747 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 9 ) 6715673_PF00815_PF01502            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00815  is 6472857 with Jaccard = 1.0000	|PF00815|=275 [ 275	0	1099936	0 ]
 parent [ 6472857 ] : 6715673 0.0524554 (=5815/(298*372)) 94.8236
  given [ 6472857 ] : 6472857 0.959596 (=285/(1*297)) 4.23058
	best keyword for cluster 6472857 is PF00815 with Jaccard = 1.0000	[ 275	0	1099936	0 ]	1.0000 1.0000
sibling [ 6472857 ] : 6597974 0.396393 (=12090/(250*122)) 60.5218
	best keyword for cluster 6597974 is PF01502 with Jaccard = 0.6435	[ 231	110	1099852	18 ]	0.6774 0.9277

	SUGGESTING RELATEDNESS OF:   
	A> PF00815 ( PF00815   Histidinol dehydrogenase )
	B> PF01502 ( PF01502   Phosphoribosyl-AMP cyclohydrolase )

	Only A has a clan ( CL0099.8 ).

	the two keywords coincide on Uniref90 proteins: |PF00815| = 275 ,	 |PF01502| = 249 , 	|PF00815^PF01502| = 18	( 6.5% and 7.2% )

	both PF00815 and PF01502 have PDB structures
	PF00815 	   c.82.1.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF00815 SSF53720 0.961 (average over 870 mutual instances, PF00815 872 appearances, SSF53720 10501 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 10 ) 6676446_PF00252_PF00826            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00826  is 6512604 with Jaccard = 1.0000	|PF00826|=81 [ 81	0	1100130	0 ]
 parent [ 6512604 ] : 6676446 0.145481 (=3818/(81*324)) 87.3159
  given [ 6512604 ] : 6512604 0.835443 (=132/(2*79)) 16.6684
	best keyword for cluster 6512604 is PF00826 with Jaccard = 1.0000	[ 81	0	1100130	0 ]	1.0000 1.0000
sibling [ 6512604 ] : 6536208 0.762422 (=491/(2*322)) 29.7211
	best keyword for cluster 6536208 is PF00252 with Jaccard = 0.9967	[ 298	0	1099912	1 ]	1.0000 0.9967

	SUGGESTING RELATEDNESS OF:   
	A> PF00826 (  )
	B> PF00252 ( PF00252   Ribosomal protein L16p/L10e )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00826 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00252 SSF54686 0.963 (average over 1521 mutual instances, PF00252 1523 appearances, SSF54686 1907 appearances)
     2	PF00826 SSF54686 0.973 (average over 383 mutual instances, PF00826 384 appearances, SSF54686 1907 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 11 ) 6753863_PF00576_PF01014            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01014  is 6397254 with Jaccard = 1.0000	|PF01014|=56 [ 56	0	1100155	0 ]
 parent [ 6397254 ] : 6753863 0.0110909 (=110/(57*174)) 98.8959
  given [ 6397254 ] : 6397254 1 (=260/(5*52)) 0.00586315
	best keyword for cluster 6397254 is PF01014 with Jaccard = 1.0000	[ 56	0	1100155	0 ]	1.0000 1.0000
sibling [ 6397254 ] : 6727672 0.0365636 (=246/(58*116)) 96.444
	best keyword for cluster 6727672 is PF00576 with Jaccard = 0.9904	[ 103	0	1100107	1 ]	1.0000 0.9904

	SUGGESTING RELATEDNESS OF:   
	A> PF01014 ( PF01014   Uricase )
	B> PF00576 ( PF00576   HIUase/Transthyretin family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01014 and PF00576 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00576 SSF49472 0.961 (average over 322 mutual instances, PF00576 322 appearances, SSF49472 324 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 12 ) 6760276_PF01019_PF01112            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01019  is 6640942 with Jaccard = 1.0000	|PF01019|=345 [ 345	0	1099866	0 ]
 parent [ 6640942 ] : 6760276 0.0118467 (=787/(384*173)) 99.2777
  given [ 6640942 ] : 6640942 0.228947 (=348/(4*380)) 77.3229
	best keyword for cluster 6640942 is PF01019 with Jaccard = 1.0000	[ 345	0	1099866	0 ]	1.0000 1.0000
sibling [ 6640942 ] : 6754308 0.0116279 (=2/(1*172)) 98.9256
	best keyword for cluster 6754308 is PF01112 with Jaccard = 0.9872	[ 154	0	1100055	2 ]	1.0000 0.9872

	SUGGESTING RELATEDNESS OF:   
	A> PF01019 ( PF01019   Gamma-glutamyltranspeptidase )
	B> PF01112 ( PF01112   Asparaginase )

	they come from the same clan: CL0052.11 : PF00227	PF03577	PF01804	PF01019	PF00310	PF02275	PF01112	PF03417
	the two keywords do not coincide on UniRef90 proteins

	only  PF01019 has a PDB structure (may not be up to date)
		PF01112 	   d.153.1.5

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 13 ) 6733689_PF01117_PF03318            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01117  is 6713880 with Jaccard = 1.0000	|PF01117|=22 [ 22	0	1100189	0 ]
 parent [ 6713880 ] : 6733689 0.0365226 (=71/(27*72)) 97.1339
  given [ 6713880 ] : 6713880 0.0545455 (=6/(22*5)) 94.5455
	best keyword for cluster 6713880 is PF01117 with Jaccard = 1.0000	[ 22	0	1100189	0 ]	1.0000 1.0000
sibling [ 6713880 ] : 6729888 0.0378378 (=49/(37*35)) 96.7126
	best keyword for cluster 6729888 is PF03318 with Jaccard = 0.7143	[ 5	2	1100204	0 ]	0.7143 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01117 ( PF01117   Aerolysin toxin )
	B> PF03318 ( PF03318   Clostridium epsilon toxin ETX/Bacillus mosquitocidal toxin MTX2 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01117 and PF03318 have PDB structures
	PF01117 	   f.8.1.1
	PF03318 	   f.8.1.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 14 ) 6735800_PF01194_PF05864            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01194  is 6638420 with Jaccard = 1.0000	|PF01194|=56 [ 56	0	1100155	0 ]
 parent [ 6638420 ] : 6735800 0.05 (=30/(10*60)) 97.3523
  given [ 6638420 ] : 6638420 0.336207 (=39/(2*58)) 76.5962
	best keyword for cluster 6638420 is PF01194 with Jaccard = 1.0000	[ 56	0	1100155	0 ]	1.0000 1.0000
sibling [ 6638420 ] : 6249523 1 (=16/(2*8)) 2.3064e-13
	best keyword for cluster 6249523 is PF05864 with Jaccard = 1.0000	[ 10	0	1100201	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01194 ( PF01194   RNA polymerases N / 8 kDa subunit )
	B> PF05864 ( PF05864   Chordopoxvirus DNA-directed RNA polymerase 7 kDa polypeptide (RPO7) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01194 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01194 SSF46924 0.927 (average over 147 mutual instances, PF01194 149 appearances, SSF46924 149 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 15 ) 6749383_PF01219_PF01569            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01219  is 6610135 with Jaccard = 1.0000	|PF01219|=164 [ 164	0	1100047	0 ]
 parent [ 6610135 ] : 6749383 0.0174943 (=3306/(186*1016)) 98.5736
  given [ 6610135 ] : 6610135 0.367568 (=68/(1*185)) 66.8222
	best keyword for cluster 6610135 is PF01219 with Jaccard = 1.0000	[ 164	0	1100047	0 ]	1.0000 1.0000
sibling [ 6610135 ] : 6737017 0.0366534 (=1189/(33*983)) 97.486
	best keyword for cluster 6737017 is PF01569 with Jaccard = 0.8878	[ 831	2	1099275	103 ]	0.9976 0.8897

	SUGGESTING RELATEDNESS OF:   
	A> PF01219 ( PF01219   Prokaryotic diacylglycerol kinase )
	B> PF01569 ( PF01569   PAP2 superfamily )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01219| = 164 ,	 |PF01569| = 934 , 	|PF01219^PF01569| = 14	( 8.5% and 1.5% )

	only  PF01219 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01569 SSF48317 0.620 (average over 2481 mutual instances, PF01569 2537 appearances, SSF48317 2657 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 16 ) 6703534_PF00539_PF01254            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01254  is 6350147 with Jaccard = 1.0000	|PF01254|=8 [ 8	0	1100203	0 ]
 parent [ 6350147 ] : 6703534 0.0990763 (=665/(8*839)) 92.8234
  given [ 6350147 ] : 6350147 1 (=12/(2*6)) 5.00009e-06
	best keyword for cluster 6350147 is PF01254 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 6350147 ] : 6699902 0.11223 (=468/(5*834)) 92.1622
	best keyword for cluster 6699902 is PF00539 with Jaccard = 0.9987	[ 743	0	1099467	1 ]	1.0000 0.9987

	SUGGESTING RELATEDNESS OF:   
	A> PF01254 ( PF01254   Nuclear transition protein 2 )
	B> PF00539 ( PF00539   Transactivating regulatory protein (Tat) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01254 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 17 ) 6754889_PF01102_PF01401            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01401  is 6713308 with Jaccard = 1.0000	|PF01401|=62 [ 62	0	1100149	0 ]
 parent [ 6713308 ] : 6754889 0.0124533 (=30/(73*33)) 98.9624
  given [ 6713308 ] : 6713308 0.0555556 (=4/(1*72)) 94.456
	best keyword for cluster 6713308 is PF01401 with Jaccard = 1.0000	[ 62	0	1100149	0 ]	1.0000 1.0000
sibling [ 6713308 ] : 6747216 0.03125 (=1/(1*32)) 98.4062
	best keyword for cluster 6747216 is PF01102 with Jaccard = 0.9643	[ 27	0	1100183	1 ]	1.0000 0.9643

	SUGGESTING RELATEDNESS OF:   
	A> PF01401 ( PF01401   Angiotensin-converting enzyme )
	B> PF01102 ( PF01102   Glycophorin A )

	Only A has a clan ( CL0126.12 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF01401 and PF01102 have PDB structures
		PF01102 	   j.35.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 18 ) 6737568_PF00844_PF01489            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01489  is 6607303 with Jaccard = 1.0000	|PF01489|=72 [ 72	0	1100139	0 ]
 parent [ 6607303 ] : 6737568 0.0346004 (=365/(77*137)) 97.5423
  given [ 6607303 ] : 6607303 0.394737 (=30/(1*76)) 65.4621
	best keyword for cluster 6607303 is PF01489 with Jaccard = 1.0000	[ 72	0	1100139	0 ]	1.0000 1.0000
sibling [ 6607303 ] : 6729233 0.0882353 (=12/(1*136)) 96.6397
	best keyword for cluster 6729233 is PF00844 with Jaccard = 1.0000	[ 116	0	1100095	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01489 ( PF01489   Geminivirus nuclear export factor BR1 )
	B> PF00844 ( PF00844   Geminivirus coat protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01489 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 19 ) 6619151_PF00429_PF01611            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01611  is 6038248 with Jaccard = 1.0000	|PF01611|=10 [ 10	0	1100201	0 ]
 parent [ 6038248 ] : 6619151 0.365891 (=472/(10*129)) 70.0204
  given [ 6038248 ] : 6038248 1 (=16/(2*8)) 8.59438e-31
	best keyword for cluster 6038248 is PF01611 with Jaccard = 1.0000	[ 10	0	1100201	0 ]	1.0000 1.0000
sibling [ 6038248 ] : 6611450 0.343548 (=213/(124*5)) 67.2264
	best keyword for cluster 6611450 is PF00429 with Jaccard = 0.7042	[ 100	14	1100069	28 ]	0.8772 0.7812

	SUGGESTING RELATEDNESS OF:   
	A> PF01611 ( PF01611   Filovirus glycoprotein )
	B> PF00429 ( PF00429   ENV polyprotein (coat polyprotein) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01611 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 20 ) 6707990_PF00693_PF01712            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01712  is 6700567 with Jaccard = 1.0000	|PF01712|=135 [ 135	0	1100076	0 ]
 parent [ 6700567 ] : 6707990 0.0793176 (=716/(51*177)) 93.6278
  given [ 6700567 ] : 6700567 0.0976331 (=132/(169*8)) 92.2806
	best keyword for cluster 6700567 is PF01712 with Jaccard = 1.0000	[ 135	0	1100076	0 ]	1.0000 1.0000
sibling [ 6700567 ] : 6515153 0.848837 (=292/(8*43)) 17.8121
	best keyword for cluster 6515153 is PF00693 with Jaccard = 0.8600	[ 43	7	1100161	0 ]	0.8600 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01712 ( PF01712   Deoxynucleoside kinase )
	B> PF00693 ( PF00693   Thymidine kinase from herpesvirus )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01712 and PF00693 have PDB structures
	PF01712 	   c.37.1.1
	PF00693 	   c.37.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 21 ) 6775595_PF01785_PF04269            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01785  is 6761627 with Jaccard = 1.0000	|PF01785|=37 [ 37	0	1100174	0 ]
 parent [ 6761627 ] : 6775595 0.00214707 (=8/(69*54)) 99.8706
  given [ 6761627 ] : 6761627 0.00649351 (=5/(14*55)) 99.3507
	best keyword for cluster 6761627 is PF01785 with Jaccard = 1.0000	[ 37	0	1100174	0 ]	1.0000 1.0000
sibling [ 6761627 ] : 6765753 0.00601504 (=4/(19*35)) 99.5457
	best keyword for cluster 6765753 is PF04269 with Jaccard = 0.9333	[ 14	1	1100196	0 ]	0.9333 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01785 ( PF01785   Closterovirus coat protein )
	B> PF04269 ( PF04269   Protein of unknown function, DUF440 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01785 has a PDB structure (may not be up to date)
		PF04269 	   d.17.7.1

	SUPERFAM mapping significantly overlapping:
     1	PF04269 SSF102816 0.992 (average over 88 mutual instances, PF04269 88 appearances, SSF102816 88 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 22 ) 6751324_PF01819_PF07279            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01819  is 6668358 with Jaccard = 1.0000	|PF01819|=8 [ 8	0	1100203	0 ]
 parent [ 6668358 ] : 6751324 0.0153846 (=2/(10*13)) 98.72
  given [ 6668358 ] : 6668358 0.16 (=4/(5*5)) 85.0497
	best keyword for cluster 6668358 is PF01819 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 6668358 ] : 6733487 0.047619 (=2/(7*6)) 97.1072
	best keyword for cluster 6733487 is PF07279 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01819 ( PF01819   Levivirus coat protein )
	B> PF07279 ( PF07279   Protein of unknown function (DUF1442) )

	Only B has a clan ( CL0102.14 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF01819 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01819 SSF55405 0.991 (average over 31 mutual instances, PF01819 31 appearances, SSF55405 34 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 23 ) 6700595_PF01874_PF03802            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01874  is 6637228 with Jaccard = 1.0000	|PF01874|=102 [ 102	0	1100109	0 ]
 parent [ 6637228 ] : 6700595 0.0785047 (=210/(25*107)) 92.2838
  given [ 6637228 ] : 6637228 0.276003 (=674/(33*74)) 76.3039
	best keyword for cluster 6637228 is PF01874 with Jaccard = 1.0000	[ 102	0	1100109	0 ]	1.0000 1.0000
sibling [ 6637228 ] : 6464752 0.974026 (=150/(11*14)) 2.96668
	best keyword for cluster 6464752 is PF03802 with Jaccard = 0.7419	[ 23	0	1100180	8 ]	1.0000 0.7419

	SUGGESTING RELATEDNESS OF:   
	A> PF01874 ( PF01874   ATP:dephospho-CoA triphosphoribosyl transferase )
	B> PF03802 ( PF03802   Apo-citrate lyase phosphoribosyl-dephospho-CoA transferase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01874| = 102 ,	 |PF03802| = 31 , 	|PF01874^PF03802| = 8	( 7.8% and 25.8% )

	Neither  PF01874 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 24 ) 6708998_PF01907_PF06869            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01907  is 6473568 with Jaccard = 1.0000	|PF01907|=66 [ 66	0	1100145	0 ]
 parent [ 6473568 ] : 6708998 0.0963365 (=71/(67*11)) 93.7936
  given [ 6473568 ] : 6473568 0.959574 (=902/(47*20)) 4.39396
	best keyword for cluster 6473568 is PF01907 with Jaccard = 1.0000	[ 66	0	1100145	0 ]	1.0000 1.0000
sibling [ 6473568 ] : 6541382 0.666667 (=20/(5*6)) 33.3334
	best keyword for cluster 6541382 is PF06869 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01907 ( PF01907   Ribosomal protein L37e )
	B> PF06869 ( PF06869   Protein of unknown function (DUF1258) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01907 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01907 SSF57829 0.973 (average over 178 mutual instances, PF01907 178 appearances, SSF57829 837 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 25 ) 6734323_PF01910_PF07615            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01910  is 6549099 with Jaccard = 1.0000	|PF01910|=106 [ 106	0	1100105	0 ]
 parent [ 6549099 ] : 6734323 0.038214 (=95/(113*22)) 97.204
  given [ 6549099 ] : 6549099 0.643151 (=1878/(40*73)) 38.1115
	best keyword for cluster 6549099 is PF01910 with Jaccard = 1.0000	[ 106	0	1100105	0 ]	1.0000 1.0000
sibling [ 6549099 ] : 6703912 0.142857 (=16/(14*8)) 92.8925
	best keyword for cluster 6703912 is PF07615 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01910 ( PF01910   Domain of unknown function DUF77 )
	B> PF07615 ( PF07615   YKOF-related Family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01910 and PF07615 have PDB structures
	PF01910 	   d.58.48.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF01910 SSF89957 0.936 (average over 259 mutual instances, PF01910 259 appearances, SSF89957 271 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 26 ) 6663981_PF01775_PF01911            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01911  is 6491337 with Jaccard = 1.0000	|PF01911|=17 [ 17	0	1100194	0 ]
 parent [ 6491337 ] : 6663981 0.206437 (=186/(17*53)) 84.0597
  given [ 6491337 ] : 6491337 0.939394 (=62/(6*11)) 8.66675
	best keyword for cluster 6491337 is PF01911 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000
sibling [ 6491337 ] : 6625242 0.286667 (=43/(50*3)) 72.577
	best keyword for cluster 6625242 is PF01775 with Jaccard = 1.0000	[ 49	0	1100162	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01911 ( PF01911   Ribosomal LX protein )
	B> PF01775 ( PF01775   Ribosomal L18ae protein family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01911 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 27 ) 6676966_PF01917_PF04975            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01917  is 6652451 with Jaccard = 1.0000	|PF01917|=60 [ 60	0	1100151	0 ]
 parent [ 6652451 ] : 6676966 0.135742 (=139/(16*64)) 87.4912
  given [ 6652451 ] : 6652451 0.220275 (=176/(47*17)) 80.81
	best keyword for cluster 6652451 is PF01917 with Jaccard = 1.0000	[ 60	0	1100151	0 ]	1.0000 1.0000
sibling [ 6652451 ] : 6555978 0.615385 (=24/(3*13)) 43.5642
	best keyword for cluster 6555978 is PF04975 with Jaccard = 1.0000	[ 13	0	1100198	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01917 ( PF01917   Archaebacterial flagellin )
	B> PF04975 ( PF04975   Archaeal flagellar protein G )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01917 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 28 ) 6760000_PF01960_PF03576            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01960  is 6316795 with Jaccard = 1.0000	|PF01960|=204 [ 204	0	1100007	0 ]
 parent [ 6316795 ] : 6760000 0.0108387 (=267/(218*113)) 99.2638
  given [ 6316795 ] : 6316795 1 (=217/(1*217)) 2.30416e-08
	best keyword for cluster 6316795 is PF01960 with Jaccard = 1.0000	[ 204	0	1100007	0 ]	1.0000 1.0000
sibling [ 6316795 ] : 6759753 0.00892857 (=1/(1*112)) 99.25
	best keyword for cluster 6759753 is PF03576 with Jaccard = 1.0000	[ 96	0	1100115	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01960 ( PF01960   ArgJ family )
	B> PF03576 ( PF03576   Peptidase family S58 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01960 and PF03576 have PDB structures
	PF01960 	   d.154.1.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF03576 SSF56266 0.924 (average over 295 mutual instances, PF03576 295 appearances, SSF56266 883 appearances)
     2	PF01960 SSF56266 0.979 (average over 568 mutual instances, PF01960 572 appearances, SSF56266 883 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 29 ) 6618498_PF01343_PF01972            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01972  is 6465557 with Jaccard = 1.0000	|PF01972|=33 [ 33	0	1100178	0 ]
 parent [ 6465557 ] : 6618498 0.332616 (=4634/(36*387)) 69.8682
  given [ 6465557 ] : 6465557 0.969697 (=96/(3*33)) 3.09155
	best keyword for cluster 6465557 is PF01972 with Jaccard = 1.0000	[ 33	0	1100178	0 ]	1.0000 1.0000
sibling [ 6465557 ] : 6606401 0.377922 (=291/(2*385)) 64.8742
	best keyword for cluster 6606401 is PF01343 with Jaccard = 0.9508	[ 348	0	1099845	18 ]	1.0000 0.9508

	SUGGESTING RELATEDNESS OF:   
	A> PF01972 ( PF01972   Protein of unknown function DUF114 )
	B> PF01343 ( PF01343   Peptidase family S49 )

	they come from the same clan: CL0127.6 : PF03255	PF01039	PF00574	PF01972	PF00378	PF06833	PF03572	PF01343
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01972 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 30 ) 6760043_PF01998_PF05805            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01998  is 6645338 with Jaccard = 1.0000	|PF01998|=25 [ 25	0	1100186	0 ]
 parent [ 6645338 ] : 6760043 0.00788177 (=8/(29*35)) 99.2662
  given [ 6645338 ] : 6645338 0.230769 (=18/(26*3)) 78.5426
	best keyword for cluster 6645338 is PF01998 with Jaccard = 1.0000	[ 25	0	1100186	0 ]	1.0000 1.0000
sibling [ 6645338 ] : 6717386 0.06 (=9/(30*5)) 95.0668
	best keyword for cluster 6717386 is PF05805 with Jaccard = 1.0000	[ 29	0	1100182	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01998 ( PF01998   Protein of unknown function DUF131 )
	B> PF05805 ( PF05805   L6 membrane protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01998 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 31 ) 6561248_PF02031_PF05547            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02031  is 6164337 with Jaccard = 1.0000	|PF02031|=7 [ 7	0	1100204	0 ]
 parent [ 6164337 ] : 6561248 0.62406 (=166/(7*38)) 47.9608
  given [ 6164337 ] : 6164337 1 (=6/(1*6)) 3.63333e-20
	best keyword for cluster 6164337 is PF02031 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6164337 ] : 6509585 0.858238 (=224/(29*9)) 15.11
	best keyword for cluster 6509585 is PF05547 with Jaccard = 0.6444	[ 29	2	1100166	14 ]	0.9355 0.6744

	SUGGESTING RELATEDNESS OF:   
	A> PF02031 ( PF02031   Streptomyces extracellular neutral proteinase (M7) family )
	B> PF05547 ( PF05547   Immune inhibitor A peptidase M6 )

	they come from the same clan: CL0126.12 : PF08325	PF01421	PF01752	PF01457	PF02031	PF09471	PF05299	PF05547	PF05572	PF01434	PF01447	PF02128	PF02102	PF02074	PF01432	PF01742	PF01401	PF01431	PF05548	PF00413	PF01433	PF01863	PF07998	PF01400
	the two keywords do not coincide on UniRef90 proteins

	only  PF02031 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 32 ) 6616117_PF02055_PF02057            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02057  is 6536926 with Jaccard = 1.0000	|PF02057|=11 [ 11	0	1100200	0 ]
 parent [ 6536926 ] : 6616117 0.359566 (=265/(67*11)) 68.9668
  given [ 6536926 ] : 6536926 0.7 (=7/(1*10)) 30.0036
	best keyword for cluster 6536926 is PF02057 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000
sibling [ 6536926 ] : 6486390 0.942353 (=801/(50*17)) 7.20224
	best keyword for cluster 6486390 is PF02055 with Jaccard = 0.8689	[ 53	5	1100150	3 ]	0.9138 0.9464

	SUGGESTING RELATEDNESS OF:   
	A> PF02057 ( PF02057   Glycosyl hydrolase family 59 )
	B> PF02055 ( PF02055   O-Glycosyl hydrolase family 30 )

	they come from the same clan: CL0058.10 : PF07971	PF02446	PF03198	PF02324	PF02057	PF01630	PF07745	PF02449	PF01229	PF01301	PF01055	PF02055	PF00933	PF02836	PF02156	PF01183	PF00728	PF00704	PF00332	PF01373	PF00331	PF00232	PF02638	PF00150	PF00128	PF02065
	the two keywords do not coincide on UniRef90 proteins

	only  PF02057 has a PDB structure (may not be up to date)
		PF02055 	   c.1.8.3

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 33 ) 6767530_PF02083_PF03303            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02083  is 6734368 with Jaccard = 1.0000	|PF02083|=17 [ 17	0	1100194	0 ]
 parent [ 6734368 ] : 6767530 0.00408163 (=5/(25*49)) 99.6199
  given [ 6734368 ] : 6734368 0.0441176 (=6/(17*8)) 97.211
	best keyword for cluster 6734368 is PF02083 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000
sibling [ 6734368 ] : 6757855 0.0117647 (=6/(15*34)) 99.1449
	best keyword for cluster 6757855 is PF03303 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02083 ( PF02083   Urotensin II )
	B> PF03303 ( PF03303   WTF protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02083 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 34 ) 6746226_PF00705_PF02144            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02144  is 6724671 with Jaccard = 1.0000	|PF02144|=23 [ 23	0	1100188	0 ]
 parent [ 6724671 ] : 6746226 0.0206774 (=116/(110*51)) 98.3284
  given [ 6724671 ] : 6724671 0.0415225 (=24/(34*17)) 96.0645
	best keyword for cluster 6724671 is PF02144 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000
sibling [ 6724671 ] : 6649498 0.247706 (=27/(1*109)) 79.9756
	best keyword for cluster 6649498 is PF00705 with Jaccard = 0.9515	[ 98	5	1100108	0 ]	0.9515 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02144 ( PF02144   Repair protein Rad1/Rec1/Rad17 )
	B> PF00705 ( PF00705   Proliferating cell nuclear antigen, N-terminal domain )

	Only B has a clan ( CL0060.7 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF02144 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 35 ) 6568023_PF00096_PF02200            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02200  is 6030065 with Jaccard = 1.0000	|PF02200|=27 [ 27	0	1100184	0 ]
 parent [ 6030065 ] : 6568023 0.505667 (=55771/(28*3939)) 50.5762
  given [ 6030065 ] : 6030065 1 (=27/(1*27)) 1.63293e-31
	best keyword for cluster 6030065 is PF02200 with Jaccard = 1.0000	[ 27	0	1100184	0 ]	1.0000 1.0000
sibling [ 6030065 ] : 6565745 0.579583 (=13677/(6*3933)) 50.1386
	best keyword for cluster 6565745 is PF00096 with Jaccard = 0.7430	[ 3636	8	1095317	1250 ]	0.9978 0.7442

	SUGGESTING RELATEDNESS OF:   
	A> PF02200 ( PF02200   STE like transcription factor )
	B> PF00096 ( PF00096   Zinc finger, C2H2 type )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00096| = 4886 ,	 |PF02200| = 27 , 	|PF00096^PF02200| = 15	( 0.3% and 55.6% )

	only  PF02200 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 36 ) 6632161_PF02263_PF05879            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02263  is 6600235 with Jaccard = 1.0000	|PF02263|=91 [ 91	0	1100120	0 ]
 parent [ 6600235 ] : 6632161 0.279375 (=1341/(96*50)) 75.1732
  given [ 6600235 ] : 6600235 0.398148 (=215/(90*6)) 61.7589
	best keyword for cluster 6600235 is PF02263 with Jaccard = 1.0000	[ 91	0	1100120	0 ]	1.0000 1.0000
sibling [ 6600235 ] : 6618709 0.330357 (=111/(42*8)) 69.9763
	best keyword for cluster 6618709 is PF05879 with Jaccard = 0.8936	[ 42	0	1100164	5 ]	1.0000 0.8936

	SUGGESTING RELATEDNESS OF:   
	A> PF02263 ( PF02263   Guanylate-binding protein, N-terminal domain )
	B> PF05879 ( PF05879   Root hair defective 3 GTP-binding protein (RHD3) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02263 has a PDB structure (may not be up to date)
	PF02263 	   c.37.1.8
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 37 ) 6745265_PF01160_PF02315            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02315  is 6425638 with Jaccard = 1.0000	|PF02315|=12 [ 12	0	1100199	0 ]
 parent [ 6425638 ] : 6745265 0.0311355 (=34/(12*91)) 98.2522
  given [ 6425638 ] : 6425638 1 (=11/(1*11)) 0.188364
	best keyword for cluster 6425638 is PF02315 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 6425638 ] : 6706795 0.0666667 (=6/(1*90)) 93.4249
	best keyword for cluster 6706795 is PF01160 with Jaccard = 1.0000	[ 38	0	1100173	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02315 ( PF02315   Methanol dehydrogenase beta subunit )
	B> PF01160 ( PF01160   Vertebrate endogenous opioids neuropeptide )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02315 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF02315 SSF48666 0.766 (average over 20 mutual instances, PF02315 20 appearances, SSF48666 20 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 38 ) 6749298_PF02350_PF04007            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02350  is 6621592 with Jaccard = 1.0000	|PF02350|=268 [ 268	0	1099943	0 ]
 parent [ 6621592 ] : 6749298 0.0184441 (=202/(296*37)) 98.5682
  given [ 6621592 ] : 6621592 0.29932 (=176/(2*294)) 71.0835
	best keyword for cluster 6621592 is PF02350 with Jaccard = 1.0000	[ 268	0	1099943	0 ]	1.0000 1.0000
sibling [ 6621592 ] : 6738192 0.0314685 (=9/(26*11)) 97.6077
	best keyword for cluster 6738192 is PF04007 with Jaccard = 0.9259	[ 25	1	1100184	1 ]	0.9615 0.9615

	SUGGESTING RELATEDNESS OF:   
	A> PF02350 ( PF02350   UDP-N-acetylglucosamine 2-epimerase )
	B> PF04007 ( PF04007   Protein of unknown function (DUF354) )

	they come from the same clan: CL0113.8 : PF06925	PF02684	PF04464	PF04101	PF01075	PF03033	PF00982	PF00534	PF05693	PF02350	PF04007	PF06722	PF05159	PF08660	PF00343	PF00201
	the two keywords do not coincide on UniRef90 proteins

	only  PF02350 has a PDB structure (may not be up to date)
	PF02350 	   c.87.1.3
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 39 ) 6749540_PF02386_PF03814            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02386  is 6559302 with Jaccard = 1.0000	|PF02386|=326 [ 326	0	1099885	0 ]
 parent [ 6559302 ] : 6749540 0.0191244 (=747/(372*105)) 98.5865
  given [ 6559302 ] : 6559302 0.567523 (=19470/(203*169)) 46.1176
	best keyword for cluster 6559302 is PF02386 with Jaccard = 1.0000	[ 326	0	1099885	0 ]	1.0000 1.0000
sibling [ 6559302 ] : 6743108 0.026 (=13/(100*5)) 98.0632
	best keyword for cluster 6743108 is PF03814 with Jaccard = 1.0000	[ 94	0	1100117	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02386 ( PF02386   Cation transport protein )
	B> PF03814 ( PF03814   Potassium-transporting ATPase A subunit )

	Only A has a clan ( CL0030.10 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02386 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 40 ) 6637846_PF02439_PF05393            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02439  is 6414549 with Jaccard = 1.0000	|PF02439|=17 [ 17	0	1100194	0 ]
 parent [ 6414549 ] : 6637846 0.270588 (=69/(17*15)) 76.4309
  given [ 6414549 ] : 6414549 1 (=72/(9*8)) 0.0556366
	best keyword for cluster 6414549 is PF02439 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000
sibling [ 6414549 ] : 6546736 0.694444 (=25/(3*12)) 36.4043
	best keyword for cluster 6546736 is PF05393 with Jaccard = 0.7500	[ 3	1	1100207	0 ]	0.7500 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02439 ( PF02439   Adenovirus E3 region protein CR2 )
	B> PF05393 ( PF05393   Human adenovirus early E3A glycoprotein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02439 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 41 ) 6726696_PF02443_PF07305            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02443  is 6506052 with Jaccard = 1.0000	|PF02443|=21 [ 21	0	1100190	0 ]
 parent [ 6506052 ] : 6726696 0.046875 (=9/(24*8)) 96.3226
  given [ 6506052 ] : 6506052 0.861111 (=93/(6*18)) 13.9414
	best keyword for cluster 6506052 is PF02443 with Jaccard = 1.0000	[ 21	0	1100190	0 ]	1.0000 1.0000
sibling [ 6506052 ] : 6672808 0.142857 (=1/(1*7)) 86.2857
	best keyword for cluster 6672808 is PF07305 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02443 ( PF02443   Circovirus ORF-2 protein )
	B> PF07305 ( PF07305   Protein of unknown function (DUF1454) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02443 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 42 ) 6775898_PF00482_PF02529            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02529  is 6512765 with Jaccard = 1.0000	|PF02529|=27 [ 27	0	1100184	0 ]
 parent [ 6512765 ] : 6775898 0.00229885 (=54/(27*870)) 99.8776
  given [ 6512765 ] : 6512765 0.9 (=45/(2*25)) 16.6988
	best keyword for cluster 6512765 is PF02529 with Jaccard = 1.0000	[ 27	0	1100184	0 ]	1.0000 1.0000
sibling [ 6512765 ] : 6773386 0.00243759 (=79/(831*39)) 99.8154
	best keyword for cluster 6773386 is PF00482 with Jaccard = 0.9794	[ 712	3	1099484	12 ]	0.9958 0.9834

	SUGGESTING RELATEDNESS OF:   
	A> PF02529 ( PF02529   Cytochrome B6-F complex subunit 5 )
	B> PF00482 ( PF00482   Bacterial type II secretion system protein F domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02529 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF02529 SSF103446 0.807 (average over 170 mutual instances, PF02529 170 appearances, SSF103446 172 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 43 ) 6729102_PF02632_PF07155            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02632  is 6574799 with Jaccard = 1.0000	|PF02632|=156 [ 156	0	1100055	0 ]
 parent [ 6574799 ] : 6729102 0.0455752 (=2336/(172*298)) 96.6228
  given [ 6574799 ] : 6574799 0.485294 (=165/(2*170)) 52.1508
	best keyword for cluster 6574799 is PF02632 with Jaccard = 1.0000	[ 156	0	1100055	0 ]	1.0000 1.0000
sibling [ 6574799 ] : 6712034 0.0725709 (=717/(38*260)) 94.2517
	best keyword for cluster 6712034 is PF07155 with Jaccard = 0.8462	[ 22	4	1100185	0 ]	0.8462 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02632 ( PF02632   BioY family )
	B> PF07155 ( PF07155   Protein of unknown function (DUF1393) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02632 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 44 ) 6666590_PF02621_PF02642            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02642  is 6529533 with Jaccard = 1.0000	|PF02642|=45 [ 45	0	1100166	0 ]
 parent [ 6529533 ] : 6666590 0.178286 (=491/(51*54)) 84.6206
  given [ 6529533 ] : 6529533 0.747826 (=172/(5*46)) 25.607
	best keyword for cluster 6529533 is PF02642 with Jaccard = 1.0000	[ 45	0	1100166	0 ]	1.0000 1.0000
sibling [ 6529533 ] : 6598018 0.413462 (=43/(2*52)) 60.5863
	best keyword for cluster 6598018 is PF02621 with Jaccard = 0.9762	[ 41	0	1100169	1 ]	1.0000 0.9762

	SUGGESTING RELATEDNESS OF:   
	A> PF02642 ( PF02642   Uncharacterized ACR, COG2107 )
	B> PF02621 ( PF02621   Uncharacterized ACR, COG1427 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02642 has a PDB structure (may not be up to date)
	PF02642 	   c.94.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 45 ) 6756280_PF02659_PF03596            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02659  is 6560928 with Jaccard = 1.0000	|PF02659|=78 [ 78	0	1100133	0 ]
 parent [ 6560928 ] : 6756280 0.013824 (=71/(107*48)) 99.0487
  given [ 6560928 ] : 6560928 0.57875 (=1389/(32*75)) 47.5867
	best keyword for cluster 6560928 is PF02659 with Jaccard = 1.0000	[ 78	0	1100133	0 ]	1.0000 1.0000
sibling [ 6560928 ] : 6717408 0.0592334 (=17/(41*7)) 95.0708
	best keyword for cluster 6717408 is PF03596 with Jaccard = 0.9667	[ 29	0	1100181	1 ]	1.0000 0.9667

	SUGGESTING RELATEDNESS OF:   
	A> PF02659 ( PF02659   Domain of unknown function DUF )
	B> PF03596 ( PF03596   Cadmium resistance transporter )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02659 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 46 ) 6560926_PF02667_PF03806            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02667  is 6536414 with Jaccard = 1.0000	|PF02667|=37 [ 37	0	1100174	0 ]
 parent [ 6536414 ] : 6560926 0.557653 (=1625/(62*47)) 47.5842
  given [ 6536414 ] : 6536414 0.717391 (=33/(1*46)) 29.927
	best keyword for cluster 6536414 is PF02667 with Jaccard = 1.0000	[ 37	0	1100174	0 ]	1.0000 1.0000
sibling [ 6536414 ] : 6482805 0.937853 (=166/(59*3)) 6.3391
	best keyword for cluster 6482805 is PF03806 with Jaccard = 0.9286	[ 52	4	1100155	0 ]	0.9286 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02667 ( PF02667   Short chain fatty acid transporter )
	B> PF03806 ( PF03806   AbgT putative transporter family )

	they come from the same clan: CL0182.8 : PF06450	PF00939	PF03553	PF07158	PF02652	PF02447	PF04165	PF07854	PF07399	PF03606	PF03605	PF06808	PF03600	PF02040	PF00873	PF03806	PF02667
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02667 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF03806 SSF103473 0.746 (average over 1 mutual instances, PF03806 1 appearances, SSF103473 39293 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 47 ) 6757239_PF00696_PF02670            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02670  is 6456649 with Jaccard = 1.0000	|PF02670|=230 [ 230	0	1099981	0 ]
 parent [ 6456649 ] : 6757239 0.0127026 (=5532/(249*1749)) 99.1056
  given [ 6456649 ] : 6456649 0.985095 (=727/(3*246)) 1.92559
	best keyword for cluster 6456649 is PF02670 with Jaccard = 1.0000	[ 230	0	1099981	0 ]	1.0000 1.0000
sibling [ 6456649 ] : 6750126 0.0200229 (=35/(1*1748)) 98.6267
	best keyword for cluster 6750126 is PF00696 with Jaccard = 0.8169	[ 1298	265	1098622	26 ]	0.8305 0.9804

	SUGGESTING RELATEDNESS OF:   
	A> PF02670 ( PF02670   1-deoxy-D-xylulose 5-phosphate reductoisomerase )
	B> PF00696 ( PF00696   Amino acid kinase family )

	Only A has a clan ( CL0063.17 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF02670 and PF00696 have PDB structures
	PF02670 	   c.2.1.3
	PF00696 	   c.73.1.1  c.73.1.2  c.73.1.3

	SUPERFAM mapping significantly overlapping:
     1	PF02670 SSF51735 0.860 (average over 693 mutual instances, PF02670 693 appearances, SSF51735 164772 appearances)
     2	PF00696 SSF53633 0.922 (average over 4687 mutual instances, PF00696 5933 appearances, SSF53633 7277 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 48 ) 6707567_PF02118_PF02688            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02688  is 6360931 with Jaccard = 1.0000	|PF02688|=62 [ 62	0	1100149	0 ]
 parent [ 6360931 ] : 6707567 0.0743728 (=664/(62*144)) 93.5584
  given [ 6360931 ] : 6360931 1 (=561/(11*51)) 2.72794e-05
	best keyword for cluster 6360931 is PF02688 with Jaccard = 1.0000	[ 62	0	1100149	0 ]	1.0000 1.0000
sibling [ 6360931 ] : 6683987 0.130022 (=466/(32*112)) 89.0859
	best keyword for cluster 6683987 is PF02118 with Jaccard = 0.6087	[ 42	26	1100142	1 ]	0.6176 0.9767

	SUGGESTING RELATEDNESS OF:   
	A> PF02688 ( PF02688   Domain of unknown function DUF215 )
	B> PF02118 ( PF02118   C.elegans Srg family integral membrane protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02688 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 49 ) 6645831_PF02701_PF05344            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02701  is 6301493 with Jaccard = 1.0000	|PF02701|=95 [ 95	0	1100116	0 ]
 parent [ 6301493 ] : 6645831 0.253506 (=235/(103*9)) 78.7358
  given [ 6301493 ] : 6301493 1 (=396/(4*99)) 1.9965e-09
	best keyword for cluster 6301493 is PF02701 with Jaccard = 1.0000	[ 95	0	1100116	0 ]	1.0000 1.0000
sibling [ 6301493 ] : 6606497 0.5 (=7/(2*7)) 64.995
	best keyword for cluster 6606497 is PF05344 with Jaccard = 0.8000	[ 4	0	1100206	1 ]	1.0000 0.8000

	SUGGESTING RELATEDNESS OF:   
	A> PF02701 ( PF02701   Dof domain, zinc finger )
	B> PF05344 ( PF05344   Domain of Unknown Function (DUF746) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02701 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 50 ) 6690965_PF01375_PF02917            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02917  is 6488710 with Jaccard = 1.0000	|PF02917|=8 [ 8	0	1100203	0 ]
 parent [ 6488710 ] : 6690965 0.1125 (=9/(8*10)) 90.4783
  given [ 6488710 ] : 6488710 1 (=12/(2*6)) 7.89477
	best keyword for cluster 6488710 is PF02917 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 6488710 ] : 6685327 0.222222 (=2/(1*9)) 89.3333
	best keyword for cluster 6685327 is PF01375 with Jaccard = 1.0000	[ 4	0	1100207	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02917 ( PF02917   Pertussis toxin, subunit 1 )
	B> PF01375 ( PF01375   Heat-labile enterotoxin alpha chain )

	they come from the same clan: CL0084.8 : PF02917	PF00644	PF01375	PF02763	PF03496	PF01129
	the two keywords do not coincide on UniRef90 proteins

	both PF02917 and PF01375 have PDB structures
		PF01375 	   d.166.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 51 ) 6754260_PF02935_PF04762            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02935  is 6604280 with Jaccard = 1.0000	|PF02935|=20 [ 20	0	1100191	0 ]
 parent [ 6604280 ] : 6754260 0.0117845 (=14/(27*44)) 98.9231
  given [ 6604280 ] : 6604280 0.5 (=13/(1*26)) 63.7
	best keyword for cluster 6604280 is PF02935 with Jaccard = 1.0000	[ 20	0	1100191	0 ]	1.0000 1.0000
sibling [ 6604280 ] : 6751636 0.0232558 (=1/(1*43)) 98.7442
	best keyword for cluster 6751636 is PF04762 with Jaccard = 1.0000	[ 32	0	1100179	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02935 ( PF02935   Cytochrome c oxidase subunit VIIc )
	B> PF04762 ( PF04762   IKI3 family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02935 has a PDB structure (may not be up to date)
	PF02935 	   f.23.6.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 52 ) 6496729_PF02977_PF06801            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02977  is 6101122 with Jaccard = 1.0000	|PF02977|=2 [ 2	0	1100209	0 ]
 parent [ 6101122 ] : 6496729 1 (=8/(2*4)) 10.2116
  given [ 6101122 ] : 6101122 1 (=1/(1*1)) 2e-25
	best keyword for cluster 6101122 is PF02977 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000
sibling [ 6101122 ] : 6387427 1 (=3/(1*3)) 0.0014
	best keyword for cluster 6387427 is PF06801 with Jaccard = 1.0000	[ 4	0	1100207	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02977 ( PF02977   Carboxypeptidase A inhibitor )
	B> PF06801 ( PF06801   Protein of unknown function, DUF1532 )

	Only A has a clan ( CL0096.7 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF02977 has a PDB structure (may not be up to date)
	PF02977 	   g.3.2.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF02977 SSF57027 0.991 (average over 3 mutual instances, PF02977 3 appearances, SSF57027 43 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 53 ) 6545097_PF03019_PF05744            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03019  is 5862882 with Jaccard = 1.0000	|PF03019|=2 [ 2	0	1100209	0 ]
 parent [ 5862882 ] : 6545097 0.75 (=3/(2*2)) 35.25
  given [ 5862882 ] : 5862882 1 (=1/(1*1)) 2e-47
	best keyword for cluster 5862882 is PF03019 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000
sibling [ 5862882 ] : 5848995 1 (=1/(1*1)) 7e-49
	best keyword for cluster 5848995 is PF05744 with Jaccard = 0.6667	[ 2	0	1100208	1 ]	1.0000 0.6667

	SUGGESTING RELATEDNESS OF:   
	A> PF03019 ( PF03019   Furovirus P26 )
	B> PF05744 ( PF05744   Benyvirus P25 protein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03019| = 2 ,	 |PF05744| = 3 , 	|PF03019^PF05744| = 1	( 50.0% and 33.3% )

	Neither  PF03019 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 54 ) 6753592_PF01784_PF03091            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03091  is 6576312 with Jaccard = 1.0000	|PF03091|=146 [ 146	0	1100065	0 ]
 parent [ 6576312 ] : 6753592 0.0164445 (=808/(155*317)) 98.876
  given [ 6576312 ] : 6576312 0.477124 (=146/(2*153)) 52.6496
	best keyword for cluster 6576312 is PF03091 with Jaccard = 1.0000	[ 146	0	1100065	0 ]	1.0000 1.0000
sibling [ 6576312 ] : 6724392 0.0411765 (=210/(300*17)) 96.0361
	best keyword for cluster 6724392 is PF01784 with Jaccard = 0.9913	[ 229	0	1099980	2 ]	1.0000 0.9913

	SUGGESTING RELATEDNESS OF:   
	A> PF03091 ( PF03091   CutA1 divalent ion tolerance protein )
	B> PF01784 ( PF01784   NIF3 (NGG1p interacting factor 3) )

	Only A has a clan ( CL0089.8 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF03091 and PF01784 have PDB structures
	PF03091 	   d.58.5.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF01784 SSF102705 0.943 (average over 691 mutual instances, PF01784 822 appearances, SSF102705 692 appearances)
     2	PF03091 SSF54913 0.947 (average over 389 mutual instances, PF03091 390 appearances, SSF54913 2763 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 55 ) 6721673_PF00793_PF03102            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03102  is 6497825 with Jaccard = 1.0000	|PF03102|=146 [ 146	0	1100065	0 ]
 parent [ 6497825 ] : 6721673 0.0524902 (=4437/(158*535)) 95.6569
  given [ 6497825 ] : 6497825 0.901075 (=419/(3*155)) 10.8848
	best keyword for cluster 6497825 is PF03102 with Jaccard = 1.0000	[ 146	0	1100065	0 ]	1.0000 1.0000
sibling [ 6497825 ] : 6655682 0.227403 (=16167/(246*289)) 81.9951
	best keyword for cluster 6655682 is PF00793 with Jaccard = 0.9895	[ 473	0	1099733	5 ]	1.0000 0.9895

	SUGGESTING RELATEDNESS OF:   
	A> PF03102 ( PF03102   NeuB family )
	B> PF00793 ( PF00793   DAHP synthetase I family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF03102 and PF00793 have PDB structures
		PF00793 	   c.1.10.4

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 56 ) 6703844_PF03014_PF03115            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03115  is 6662633 with Jaccard = 1.0000	|PF03115|=23 [ 23	0	1100188	0 ]
 parent [ 6662633 ] : 6703844 0.094086 (=35/(31*12)) 92.8751
  given [ 6662633 ] : 6662633 0.161905 (=34/(21*10)) 83.813
	best keyword for cluster 6662633 is PF03115 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000
sibling [ 6662633 ] : 6668281 0.15 (=3/(2*10)) 85.0074
	best keyword for cluster 6668281 is PF03014 with Jaccard = 0.9000	[ 9	1	1100201	0 ]	0.9000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03115 ( PF03115   Astrovirus capsid protein precursor )
	B> PF03014 ( PF03014   Structural protein 2 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03115 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 57 ) 6612540_PF01671_PF03158            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03158  is 6323662 with Jaccard = 1.0000	|PF03158|=11 [ 11	0	1100200	0 ]
 parent [ 6323662 ] : 6612540 0.35 (=77/(11*20)) 67.5518
  given [ 6323662 ] : 6323662 1 (=18/(2*9)) 7.22961e-08
	best keyword for cluster 6323662 is PF03158 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000
sibling [ 6323662 ] : 6604515 0.361111 (=13/(2*18)) 63.9222
	best keyword for cluster 6604515 is PF01671 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03158 ( PF03158   Multigene family 530 protein )
	B> PF01671 ( PF01671   African swine fever virus multigene family 360 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03158 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF03158 SSF48403 0.558 (average over 1 mutual instances, PF03158 1 appearances, SSF48403 17044 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 58 ) 6766171_PF03192_PF03628            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03192  is 6660260 with Jaccard = 1.0000	|PF03192|=25 [ 25	0	1100186	0 ]
 parent [ 6660260 ] : 6766171 0.00725953 (=4/(29*19)) 99.5626
  given [ 6660260 ] : 6660260 0.191667 (=23/(24*5)) 83.399
	best keyword for cluster 6660260 is PF03192 with Jaccard = 1.0000	[ 25	0	1100186	0 ]	1.0000 1.0000
sibling [ 6660260 ] : 6721474 0.0641026 (=5/(6*13)) 95.6315
	best keyword for cluster 6721474 is PF03628 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03192 ( PF03192   Pyrococcus protein of unknown function, DUF257 )
	B> PF03628 ( PF03628   PapG chaperone-binding domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03192 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 59 ) 6739274_PF03194_PF04659            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03194  is 6493721 with Jaccard = 1.0000	|PF03194|=60 [ 60	0	1100151	0 ]
 parent [ 6493721 ] : 6739274 0.0302154 (=94/(61*51)) 97.7126
  given [ 6493721 ] : 6493721 0.940678 (=111/(2*59)) 9.36073
	best keyword for cluster 6493721 is PF03194 with Jaccard = 1.0000	[ 60	0	1100151	0 ]	1.0000 1.0000
sibling [ 6493721 ] : 6726960 0.0464286 (=26/(16*35)) 96.3518
	best keyword for cluster 6726960 is PF04659 with Jaccard = 0.9500	[ 19	1	1100191	0 ]	0.9500 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03194 ( PF03194   LUC7 N_terminus )
	B> PF04659 ( PF04659   Archaeal flagella protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03194 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 60 ) 6718989_PF02153_PF03201            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03201  is 6353238 with Jaccard = 1.0000	|PF03201|=14 [ 14	0	1100197	0 ]
 parent [ 6353238 ] : 6718989 0.0564436 (=226/(14*286)) 95.2773
  given [ 6353238 ] : 6353238 1 (=48/(6*8)) 8.34744e-06
	best keyword for cluster 6353238 is PF03201 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000
sibling [ 6353238 ] : 6705140 0.0912281 (=26/(1*285)) 93.13
	best keyword for cluster 6705140 is PF02153 with Jaccard = 0.8755	[ 232	20	1099946	13 ]	0.9206 0.9469

	SUGGESTING RELATEDNESS OF:   
	A> PF03201 ( PF03201   H2-forming N5,N10-methylenetetrahydromethanopterin dehydrogenase )
	B> PF02153 ( PF02153   Prephenate dehydrogenase )

	Only B has a clan ( CL0063.17 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03201 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF02153 SSF51735 0.580 (average over 737 mutual instances, PF02153 915 appearances, SSF51735 164772 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 61 ) 6716237_PF03220_PF08095            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03220  is 5890004 with Jaccard = 1.0000	|PF03220|=12 [ 12	0	1100199	0 ]
 parent [ 5890004 ] : 6716237 0.0769231 (=6/(13*6)) 94.9231
  given [ 5890004 ] : 5890004 1 (=12/(1*12)) 1.1491e-44
	best keyword for cluster 5890004 is PF03220 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 5890004 ] : 6697977 0.125 (=1/(4*2)) 91.875
	best keyword for cluster 6697977 is PF08095 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03220 ( PF03220   Tombusvirus P19 core protein )
	B> PF08095 ( PF08095   Hefutoxin family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF03220 and PF08095 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF03220 SSF103145 0.842 (average over 54 mutual instances, PF03220 54 appearances, SSF103145 54 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 62 ) 6740829_PF03240_PF06981            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03240  is 6685134 with Jaccard = 1.0000	|PF03240|=123 [ 123	0	1100088	0 ]
 parent [ 6685134 ] : 6740829 0.0287168 (=762/(145*183)) 97.859
  given [ 6685134 ] : 6685134 0.120567 (=68/(141*4)) 89.3038
	best keyword for cluster 6685134 is PF03240 with Jaccard = 1.0000	[ 123	0	1100088	0 ]	1.0000 1.0000
sibling [ 6685134 ] : 6729729 0.0408685 (=64/(9*174)) 96.6955
	best keyword for cluster 6729729 is PF06981 with Jaccard = 0.9813	[ 105	2	1100104	0 ]	0.9813 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03240 (  )
	B> PF06981 (  )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03240 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 63 ) 6765561_PF02237_PF03309            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03309  is 6592249 with Jaccard = 1.0000	|PF03309|=164 [ 164	0	1100047	0 ]
 parent [ 6592249 ] : 6765561 0.00467695 (=367/(190*413)) 99.5374
  given [ 6592249 ] : 6592249 0.428571 (=81/(1*189)) 58.0669
	best keyword for cluster 6592249 is PF03309 with Jaccard = 1.0000	[ 164	0	1100047	0 ]	1.0000 1.0000
sibling [ 6592249 ] : 6731294 0.0325159 (=419/(34*379)) 96.8599
	best keyword for cluster 6731294 is PF02237 with Jaccard = 0.6176	[ 210	128	1099871	2 ]	0.6213 0.9906

	SUGGESTING RELATEDNESS OF:   
	A> PF03309 ( PF03309   Bordetella pertussis Bvg accessory factor family )
	B> PF02237 ( PF02237   Biotin protein ligase C terminal domain )

	Only B has a clan ( CL0206.5 ).

	the two keywords coincide on Uniref90 proteins: |PF02237| = 212 ,	 |PF03309| = 164 , 	|PF02237^PF03309| = 1	( 0.5% and 0.6% )

	only  PF03309 has a PDB structure (may not be up to date)
		PF02237 	   b.34.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF02237 SSF50037 0.995 (average over 385 mutual instances, PF02237 387 appearances, SSF50037 2023 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 64 ) 6718725_PF03314_PF05637            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03314  is 6116985 with Jaccard = 1.0000	|PF03314|=18 [ 18	0	1100193	0 ]
 parent [ 6116985 ] : 6718725 0.0723514 (=112/(18*86)) 95.248
  given [ 6116985 ] : 6116985 1 (=56/(4*14)) 4.56494e-24
	best keyword for cluster 6116985 is PF03314 with Jaccard = 1.0000	[ 18	0	1100193	0 ]	1.0000 1.0000
sibling [ 6116985 ] : 6677423 0.152792 (=145/(73*13)) 87.5603
	best keyword for cluster 6677423 is PF05637 with Jaccard = 0.9706	[ 66	1	1100143	1 ]	0.9851 0.9851

	SUGGESTING RELATEDNESS OF:   
	A> PF03314 ( PF03314   Protein of unknown function, DUF273 )
	B> PF05637 ( PF05637   galactosyl transferase GMA12/MNN10 family )

	Only B has a clan ( CL0110.6 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03314 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 65 ) 6698651_PF03331_PF07977            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03331  is 6468662 with Jaccard = 1.0000	|PF03331|=135 [ 135	0	1100076	0 ]
 parent [ 6468662 ] : 6698651 0.0818869 (=5265/(152*423)) 91.9929
  given [ 6468662 ] : 6468662 0.966887 (=146/(1*151)) 3.51342
	best keyword for cluster 6468662 is PF03331 with Jaccard = 1.0000	[ 135	0	1100076	0 ]	1.0000 1.0000
sibling [ 6468662 ] : 6683841 0.143705 (=121/(2*421)) 89.05
	best keyword for cluster 6683841 is PF07977 with Jaccard = 0.8889	[ 312	0	1099860	39 ]	1.0000 0.8889

	SUGGESTING RELATEDNESS OF:   
	A> PF03331 ( PF03331   UDP-3-O-acyl N-acetylglycosamine deacetylase )
	B> PF07977 ( PF07977   FabA-like domain )

	Only B has a clan ( CL0050.7 ).

	the two keywords coincide on Uniref90 proteins: |PF03331| = 135 ,	 |PF07977| = 351 , 	|PF03331^PF07977| = 12	( 8.9% and 3.4% )

	both PF03331 and PF07977 have PDB structures
		PF07977 	   d.38.1.2  d.38.1.6

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 66 ) 6730442_PF03337_PF07868            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03337  is 6203794 with Jaccard = 1.0000	|PF03337|=12 [ 12	0	1100199	0 ]
 parent [ 6203794 ] : 6730442 0.0384615 (=1/(13*2)) 96.7692
  given [ 6203794 ] : 6203794 1 (=22/(2*11)) 5.95465e-17
	best keyword for cluster 6203794 is PF03337 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 6203794 ] : 6398744 1 (=1/(1*1)) 0.007
	best keyword for cluster 6398744 is PF07868 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03337 ( PF03337   Poxvirus F12L protein )
	B> PF07868 ( PF07868   Protein of unknown function (DUF1655) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF03337 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 67 ) 6769752_PF03369_PF04759            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03369  is 6379454 with Jaccard = 1.0000	|PF03369|=19 [ 19	0	1100192	0 ]
 parent [ 6379454 ] : 6769752 0.00679117 (=4/(19*31)) 99.7029
  given [ 6379454 ] : 6379454 1 (=18/(1*18)) 0.000444445
	best keyword for cluster 6379454 is PF03369 with Jaccard = 1.0000	[ 19	0	1100192	0 ]	1.0000 1.0000
sibling [ 6379454 ] : 6764022 0.0333333 (=1/(1*30)) 99.4667
	best keyword for cluster 6764022 is PF04759 with Jaccard = 1.0000	[ 24	0	1100187	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03369 ( PF03369   Herpesvirus UL3 protein )
	B> PF04759 ( PF04759   Protein of unknown function, DUF617 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03369 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 68 ) 6738682_PF01681_PF03380            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03380  is 6395617 with Jaccard = 1.0000	|PF03380|=16 [ 16	0	1100195	0 ]
 parent [ 6395617 ] : 6738682 0.0263158 (=8/(16*19)) 97.6541
  given [ 6395617 ] : 6395617 1 (=63/(7*9)) 0.00457322
	best keyword for cluster 6395617 is PF03380 with Jaccard = 1.0000	[ 16	0	1100195	0 ]	1.0000 1.0000
sibling [ 6395617 ] : 6695651 0.0857143 (=6/(14*5)) 91.4874
	best keyword for cluster 6695651 is PF01681 with Jaccard = 0.6667	[ 8	2	1100199	2 ]	0.8000 0.8000

	SUGGESTING RELATEDNESS OF:   
	A> PF03380 ( PF03380   Caenorhabditis protein of unknown function, DUF282 )
	B> PF01681 ( PF01681   C6 domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03380 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 69 ) 6714930_PF00115_PF03390            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03390  is 6545427 with Jaccard = 1.0000	|PF03390|=36 [ 36	0	1100175	0 ]
 parent [ 6545427 ] : 6714930 0.075415 (=11459/(41*3706)) 94.7136
  given [ 6545427 ] : 6545427 0.648649 (=96/(37*4)) 35.5371
	best keyword for cluster 6545427 is PF03390 with Jaccard = 1.0000	[ 36	0	1100175	0 ]	1.0000 1.0000
sibling [ 6545427 ] : 6702546 0.110931 (=411/(1*3705)) 92.6389
	best keyword for cluster 6702546 is PF00115 with Jaccard = 0.9964	[ 3288	0	1096911	12 ]	1.0000 0.9964

	SUGGESTING RELATEDNESS OF:   
	A> PF03390 ( PF03390   Bacterial sodium:citrate symporter )
	B> PF00115 ( PF00115   Cytochrome C and Quinol oxidase polypeptide I )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF03390 has a PDB structure (may not be up to date)
		PF00115 	   f.24.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00115 SSF81442 0.952 (average over 60836 mutual instances, PF00115 60939 appearances, SSF81442 60941 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 70 ) 6682295_PF03418_PF06866            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03418  is 6104243 with Jaccard = 1.0000	|PF03418|=23 [ 23	0	1100188	0 ]
 parent [ 6104243 ] : 6682295 0.148026 (=135/(24*38)) 88.7803
  given [ 6104243 ] : 6104243 1 (=23/(1*23)) 3.91787e-25
	best keyword for cluster 6104243 is PF03418 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000
sibling [ 6104243 ] : 6450664 0.986111 (=71/(2*36)) 1.39873
	best keyword for cluster 6450664 is PF06866 with Jaccard = 1.0000	[ 36	0	1100175	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03418 ( PF03418   Germination protease )
	B> PF06866 ( PF06866   Protein of unknown function (DUF1256) )

	they come from the same clan: CL0095.8 : PF01750	PF06866	PF03418
	the two keywords do not coincide on UniRef90 proteins

	only  PF03418 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 71 ) 6726044_PF00290_PF03437            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03437  is 6441796 with Jaccard = 1.0000	|PF03437|=37 [ 37	0	1100174	0 ]
 parent [ 6441796 ] : 6726044 0.0521127 (=592/(40*284)) 96.2404
  given [ 6441796 ] : 6441796 0.9925 (=397/(20*20)) 0.750139
	best keyword for cluster 6441796 is PF03437 with Jaccard = 1.0000	[ 37	0	1100174	0 ]	1.0000 1.0000
sibling [ 6441796 ] : 6467471 0.971467 (=2145/(8*276)) 3.33629
	best keyword for cluster 6467471 is PF00290 with Jaccard = 0.9059	[ 260	2	1099924	25 ]	0.9924 0.9123

	SUGGESTING RELATEDNESS OF:   
	A> PF03437 ( PF03437   BtpA family )
	B> PF00290 ( PF00290   Tryptophan synthase alpha chain )

	they come from the same clan: CL0036.17 : PF05690	PF01680	PF00834	PF01729	PF00697	PF03740	PF01884	PF00724	PF00215	PF03060	PF04095	PF04131	PF00478	PF00218	PF00977	PF01645	PF04309	PF01070	PF01207	PF04481	PF04476	PF01180	PF00701	PF01791	PF03932	PF03437	PF01081	PF00121	PF09370	PF02581	PF00290
	the two keywords do not coincide on UniRef90 proteins

	only  PF03437 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF03437 SSF51366 0.881 (average over 36 mutual instances, PF03437 64 appearances, SSF51366 8168 appearances)
     2	PF00290 SSF51366 0.965 (average over 971 mutual instances, PF00290 1015 appearances, SSF51366 8168 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 72 ) 6624418_PF03531_PF08512            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03531  is 6505640 with Jaccard = 1.0000	|PF03531|=46 [ 46	0	1100165	0 ]
 parent [ 6505640 ] : 6624418 0.301333 (=226/(15*50)) 72.328
  given [ 6505640 ] : 6505640 0.87234 (=123/(3*47)) 13.6458
	best keyword for cluster 6505640 is PF03531 with Jaccard = 1.0000	[ 46	0	1100165	0 ]	1.0000 1.0000
sibling [ 6505640 ] : 6388117 1 (=56/(7*8)) 0.00158046
	best keyword for cluster 6388117 is PF08512 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03531 ( PF03531   Structure-specific recognition protein (SSRP1) )
	B> PF08512 ( PF08512   Histone chaperone Rttp106-like )

	they come from the same clan: CL0215.5 : PF03531	PF08512
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03531 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 73 ) 6718707_PF03554_PF05702            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03554  is 6477431 with Jaccard = 1.0000	|PF03554|=29 [ 29	0	1100182	0 ]
 parent [ 6477431 ] : 6718707 0.0598911 (=33/(29*19)) 95.2443
  given [ 6477431 ] : 6477431 0.957143 (=201/(15*14)) 5.16296
	best keyword for cluster 6477431 is PF03554 with Jaccard = 1.0000	[ 29	0	1100182	0 ]	1.0000 1.0000
sibling [ 6477431 ] : 6699135 0.153846 (=12/(6*13)) 92.0256
	best keyword for cluster 6699135 is PF05702 with Jaccard = 1.0000	[ 13	0	1100198	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03554 ( PF03554   UL73 viral envelope glycoprotein )
	B> PF05702 ( PF05702   Herpesvirus UL49.5 envelope/tegument protein )

	they come from the same clan: CL0146.7 : PF05702	PF03554
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03554 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 74 ) 6607735_PF03569_PF07255            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03569  is 5663698 with Jaccard = 1.0000	|PF03569|=1 [ 1	0	1100210	0 ]
 parent [ 5663698 ] : 6607735 0.5 (=4/(4*2)) 65.625
  given [ 5663698 ] : 5663698 1 (=3/(1*3)) 4.66667e-71
	best keyword for cluster 5663698 is PF03569 with Jaccard = 1.0000	[ 1	0	1100210	0 ]	1.0000 1.0000
sibling [ 5663698 ] : 6235623 1 (=1/(1*1)) 2e-14
	best keyword for cluster 6235623 is PF07255 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03569 ( PF03569   Peptidase family C8 )
	B> PF07255 ( PF07255   Benyvirus 14KDa protein )

	Only A has a clan ( CL0125.9 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03569 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 75 ) 6705040_PF03584_PF04664            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03584  is 6596709 with Jaccard = 1.0000	|PF03584|=20 [ 20	0	1100191	0 ]
 parent [ 6596709 ] : 6705040 0.0785414 (=56/(23*31)) 93.11
  given [ 6596709 ] : 6596709 0.4 (=36/(18*5)) 60
	best keyword for cluster 6596709 is PF03584 with Jaccard = 1.0000	[ 20	0	1100191	0 ]	1.0000 1.0000
sibling [ 6596709 ] : 6641416 0.238095 (=20/(3*28)) 77.4404
	best keyword for cluster 6641416 is PF04664 with Jaccard = 1.0000	[ 26	0	1100185	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03584 ( PF03584   Herpesvirus ICP4-like protein N-terminal region )
	B> PF04664 ( PF04664   Opioid growth factor receptor (OGFr) conserved region )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03584 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 76 ) 6733155_PF03616_PF05684            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03616  is 6556795 with Jaccard = 1.0000	|PF03616|=60 [ 60	0	1100151	0 ]
 parent [ 6556795 ] : 6733155 0.0441067 (=119/(71*38)) 97.0673
  given [ 6556795 ] : 6556795 0.597727 (=526/(16*55)) 44.0769
	best keyword for cluster 6556795 is PF03616 with Jaccard = 1.0000	[ 60	0	1100151	0 ]	1.0000 1.0000
sibling [ 6556795 ] : 6651156 0.234375 (=45/(32*6)) 80.3941
	best keyword for cluster 6651156 is PF05684 with Jaccard = 1.0000	[ 30	0	1100181	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03616 ( PF03616   Sodium/glutamate symporter )
	B> PF05684 ( PF05684   Protein of unknown function (DUF819) )

	they come from the same clan: CL0064.7 : PF06826	PF03547	PF03601	PF05684	PF05982	PF03616	PF06965	PF00999	PF03977	PF01758
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03616 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 77 ) 6717710_PF03644_PF05903            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03644  is 6706567 with Jaccard = 1.0000	|PF03644|=39 [ 39	0	1100172	0 ]
 parent [ 6706567 ] : 6717710 0.0498732 (=236/(91*52)) 95.1135
  given [ 6706567 ] : 6706567 0.078125 (=15/(4*48)) 93.3974
	best keyword for cluster 6706567 is PF03644 with Jaccard = 1.0000	[ 39	0	1100172	0 ]	1.0000 1.0000
sibling [ 6706567 ] : 6534232 0.75129 (=1456/(34*57)) 28.38
	best keyword for cluster 6534232 is PF05903 with Jaccard = 0.8462	[ 88	0	1100107	16 ]	1.0000 0.8462

	SUGGESTING RELATEDNESS OF:   
	A> PF03644 ( PF03644   Glycosyl hydrolase family 85 )
	B> PF05903 ( PF05903   PPPDE putative peptidase domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03644| = 39 ,	 |PF05903| = 104 , 	|PF03644^PF05903| = 3	( 7.7% and 2.9% )

	Neither  PF03644 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 78 ) 6722841_PF03666_PF06218            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03666  is 6575662 with Jaccard = 1.0000	|PF03666|=32 [ 32	0	1100179	0 ]
 parent [ 6575662 ] : 6722841 0.0614187 (=71/(34*34)) 95.8374
  given [ 6575662 ] : 6575662 0.5 (=32/(2*32)) 52.4362
	best keyword for cluster 6575662 is PF03666 with Jaccard = 1.0000	[ 32	0	1100179	0 ]	1.0000 1.0000
sibling [ 6575662 ] : 6664821 0.2 (=24/(30*4)) 84.248
	best keyword for cluster 6664821 is PF06218 with Jaccard = 1.0000	[ 30	0	1100181	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03666 ( PF03666   Uncharacterised protein family (UPF0171) )
	B> PF06218 ( PF06218   Nitrogen permease regulator 2 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03666 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 79 ) 6558522_PF00429_PF03708            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03708  is 6497025 with Jaccard = 1.0000	|PF03708|=14 [ 14	0	1100197	0 ]
 parent [ 6497025 ] : 6558522 0.569444 (=984/(16*108)) 45.5788
  given [ 6497025 ] : 6497025 0.904762 (=57/(7*9)) 10.381
	best keyword for cluster 6497025 is PF03708 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000
sibling [ 6497025 ] : 6556826 0.570874 (=294/(5*103)) 44.105
	best keyword for cluster 6556826 is PF00429 with Jaccard = 0.7500	[ 96	0	1100083	32 ]	1.0000 0.7500

	SUGGESTING RELATEDNESS OF:   
	A> PF03708 ( PF03708   Avian retrovirus envelope protein, gp85 )
	B> PF00429 ( PF00429   ENV polyprotein (coat polyprotein) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF03708 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 80 ) 6752068_PF02130_PF03740            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03740  is 6479539 with Jaccard = 1.0000	|PF03740|=146 [ 146	0	1100065	0 ]
 parent [ 6479539 ] : 6752068 0.0123457 (=582/(162*291)) 98.7739
  given [ 6479539 ] : 6479539 0.944099 (=152/(1*161)) 5.5978
	best keyword for cluster 6479539 is PF03740 with Jaccard = 1.0000	[ 146	0	1100065	0 ]	1.0000 1.0000
sibling [ 6479539 ] : 6751948 0.0206897 (=6/(1*290)) 98.7655
	best keyword for cluster 6751948 is PF02130 with Jaccard = 0.9850	[ 263	0	1099944	4 ]	1.0000 0.9850

	SUGGESTING RELATEDNESS OF:   
	A> PF03740 ( PF03740   Pyridoxal phosphate biosynthesis protein PdxJ )
	B> PF02130 ( PF02130   Uncharacterized protein family UPF0054 )

	Only A has a clan ( CL0036.17 ).

	the two keywords coincide on Uniref90 proteins: |PF02130| = 267 ,	 |PF03740| = 146 , 	|PF02130^PF03740| = 1	( 0.4% and 0.7% )

	both PF03740 and PF02130 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF03740 SSF63892 0.984 (average over 495 mutual instances, PF03740 495 appearances, SSF63892 498 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 81 ) 6682244_PF03775_PF03961            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03775  is 6481608 with Jaccard = 1.0000	|PF03775|=122 [ 122	0	1100089	0 ]
 parent [ 6481608 ] : 6682244 0.151884 (=1572/(138*75)) 88.7541
  given [ 6481608 ] : 6481608 0.945726 (=4426/(60*78)) 6.05531
	best keyword for cluster 6481608 is PF03775 with Jaccard = 1.0000	[ 122	0	1100089	0 ]	1.0000 1.0000
sibling [ 6481608 ] : 6549351 0.630137 (=92/(2*73)) 38.3585
	best keyword for cluster 6549351 is PF03961 with Jaccard = 1.0000	[ 65	0	1100146	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03775 ( PF03775   Septum formation inhibitor MinC, C-terminal domain )
	B> PF03961 ( PF03961   Protein of unknown function (DUF342) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF03775 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF03775 SSF63848 0.956 (average over 426 mutual instances, PF03775 426 appearances, SSF63848 651 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 82 ) 6719091_PF03601_PF03812            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03812  is 6066077 with Jaccard = 1.0000	|PF03812|=24 [ 24	0	1100187	0 ]
 parent [ 6066077 ] : 6719091 0.0627907 (=270/(25*172)) 95.2908
  given [ 6066077 ] : 6066077 1 (=156/(12*13)) 2.11809e-28
	best keyword for cluster 6066077 is PF03812 with Jaccard = 1.0000	[ 24	0	1100187	0 ]	1.0000 1.0000
sibling [ 6066077 ] : 6626729 0.368421 (=63/(1*171)) 73.2716
	best keyword for cluster 6626729 is PF03601 with Jaccard = 0.9935	[ 152	0	1100058	1 ]	1.0000 0.9935

	SUGGESTING RELATEDNESS OF:   
	A> PF03812 ( PF03812   2-keto-3-deoxygluconate permease )
	B> PF03601 ( PF03601   Conserved hypothetical protein 698 )

	Only B has a clan ( CL0064.7 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03812 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 83 ) 6769312_PF03816_PF07349            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03816  is 6748358 with Jaccard = 1.0000	|PF03816|=268 [ 268	0	1099943	0 ]
 parent [ 6748358 ] : 6769312 0.00348211 (=77/(351*63)) 99.6875
  given [ 6748358 ] : 6748358 0.0176587 (=89/(336*15)) 98.4974
	best keyword for cluster 6748358 is PF03816 with Jaccard = 1.0000	[ 268	0	1099943	0 ]	1.0000 1.0000
sibling [ 6748358 ] : 6765690 0.00598086 (=5/(44*19)) 99.5427
	best keyword for cluster 6765690 is PF07349 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03816 ( PF03816   Cell envelope-related transcriptional attenuator domain )
	B> PF07349 ( PF07349   Protein of unknown function (DUF1478) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03816 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 84 ) 6740868_PF03870_PF05404            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03870  is 6522437 with Jaccard = 1.0000	|PF03870|=38 [ 38	0	1100173	0 ]
 parent [ 6522437 ] : 6740868 0.0277778 (=19/(18*38)) 97.864
  given [ 6522437 ] : 6522437 0.805556 (=58/(2*36)) 21.3546
	best keyword for cluster 6522437 is PF03870 with Jaccard = 1.0000	[ 38	0	1100173	0 ]	1.0000 1.0000
sibling [ 6522437 ] : 6410460 1 (=17/(1*17)) 0.0336493
	best keyword for cluster 6410460 is PF05404 with Jaccard = 1.0000	[ 18	0	1100193	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03870 ( PF03870   RNA polymerase Rpb8 )
	B> PF05404 ( PF05404   Translocon-associated protein, delta subunit precursor (TRAP-delta) )

	Only A has a clan ( CL0021.12 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF03870 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF03870 SSF50249 0.950 (average over 88 mutual instances, PF03870 88 appearances, SSF50249 52669 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 85 ) 6714365_PF01598_PF03897            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03897  is 6428067 with Jaccard = 1.0000	|PF03897|=41 [ 41	0	1100170	0 ]
 parent [ 6428067 ] : 6714365 0.0755003 (=1494/(51*388)) 94.6228
  given [ 6428067 ] : 6428067 1 (=650/(26*25)) 0.236971
	best keyword for cluster 6428067 is PF03897 with Jaccard = 1.0000	[ 41	0	1100170	0 ]	1.0000 1.0000
sibling [ 6428067 ] : 6689106 0.10733 (=246/(6*382)) 90.0997
	best keyword for cluster 6689106 is PF01598 with Jaccard = 0.9712	[ 202	4	1100003	2 ]	0.9806 0.9902

	SUGGESTING RELATEDNESS OF:   
	A> PF03897 (  )
	B> PF01598 (  )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03897 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 86 ) 6699580_PF02430_PF03993            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03993  is 6685662 with Jaccard = 1.0000	|PF03993|=23 [ 23	0	1100188	0 ]
 parent [ 6685662 ] : 6699580 0.0833333 (=91/(28*39)) 92.1105
  given [ 6685662 ] : 6685662 0.111111 (=12/(36*3)) 89.3977
	best keyword for cluster 6685662 is PF03993 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000
sibling [ 6685662 ] : 6681421 0.130435 (=15/(5*23)) 88.5474
	best keyword for cluster 6681421 is PF02430 with Jaccard = 0.9333	[ 14	1	1100196	0 ]	0.9333 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03993 ( PF03993   Domain of Unknown Function (DUF349) )
	B> PF02430 ( PF02430   Apical membrane antigen 1 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF03993 has a PDB structure (may not be up to date)
		PF02430 	   g.61.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 87 ) 6706703_PF04000_PF07493            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04000  is 6638412 with Jaccard = 1.0000	|PF04000|=46 [ 46	0	1100165	0 ]
 parent [ 6638412 ] : 6706703 0.0772201 (=220/(77*37)) 93.4166
  given [ 6638412 ] : 6638412 0.24487 (=358/(43*34)) 76.5882
	best keyword for cluster 6638412 is PF04000 with Jaccard = 1.0000	[ 46	0	1100165	0 ]	1.0000 1.0000
sibling [ 6638412 ] : 6680814 0.128788 (=17/(33*4)) 88.3827
	best keyword for cluster 6680814 is PF07493 with Jaccard = 0.9286	[ 26	0	1100183	2 ]	1.0000 0.9286

	SUGGESTING RELATEDNESS OF:   
	A> PF04000 ( PF04000   Sas10/Utp3/C1D family )
	B> PF07493 (  )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04000 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 88 ) 6766946_PF03621_PF04019            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04019  is 6592319 with Jaccard = 1.0000	|PF04019|=25 [ 25	0	1100186	0 ]
 parent [ 6592319 ] : 6766946 0.00694444 (=15/(27*80)) 99.596
  given [ 6592319 ] : 6592319 0.42 (=21/(25*2)) 58.1465
	best keyword for cluster 6592319 is PF04019 with Jaccard = 1.0000	[ 25	0	1100186	0 ]	1.0000 1.0000
sibling [ 6592319 ] : 6682588 0.125 (=38/(76*4)) 88.8624
	best keyword for cluster 6682588 is PF03621 with Jaccard = 0.9620	[ 76	0	1100132	3 ]	1.0000 0.9620

	SUGGESTING RELATEDNESS OF:   
	A> PF04019 ( PF04019   Protein of unknown function (DUF359) )
	B> PF03621 ( PF03621   MbtH-like protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04019 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 89 ) 6776165_PF01924_PF04029            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04029  is 6600910 with Jaccard = 1.0000	|PF04029|=58 [ 58	0	1100153	0 ]
 parent [ 6600910 ] : 6776165 0.0018528 (=18/(67*145)) 99.883
  given [ 6600910 ] : 6600910 0.407143 (=171/(7*60)) 62.0183
	best keyword for cluster 6600910 is PF04029 with Jaccard = 1.0000	[ 58	0	1100153	0 ]	1.0000 1.0000
sibling [ 6600910 ] : 6775481 0.00694444 (=1/(1*144)) 99.8681
	best keyword for cluster 6775481 is PF01924 with Jaccard = 1.0000	[ 121	0	1100090	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04029 ( PF04029   2-phosphosulpholactate phosphatase )
	B> PF01924 ( PF01924   Hydrogenase formation hypA family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04029 has a PDB structure (may not be up to date)
	PF04029 	   c.148.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 90 ) 6718005_PF04035_PF06093            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04035  is 6466949 with Jaccard = 1.0000	|PF04035|=25 [ 25	0	1100186	0 ]
 parent [ 6466949 ] : 6718005 0.0618182 (=51/(25*33)) 95.1466
  given [ 6466949 ] : 6466949 0.973333 (=146/(10*15)) 3.27405
	best keyword for cluster 6466949 is PF04035 with Jaccard = 1.0000	[ 25	0	1100186	0 ]	1.0000 1.0000
sibling [ 6466949 ] : 6510434 0.866667 (=78/(3*30)) 15.744
	best keyword for cluster 6510434 is PF06093 with Jaccard = 1.0000	[ 30	0	1100181	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04035 ( PF04035   Archaeal DNA-directed RNA polymerase subunit E'' (RpoE'' or RpoE2) )
	B> PF06093 ( PF06093   Transcription elongation protein Spt4 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04035 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 91 ) 6651698_PF04045_PF05452            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04045  is 6553244 with Jaccard = 1.0000	|PF04045|=36 [ 36	0	1100175	0 ]
 parent [ 6553244 ] : 6651698 0.203947 (=31/(38*4)) 80.5116
  given [ 6553244 ] : 6553244 0.609524 (=64/(3*35)) 41.3344
	best keyword for cluster 6553244 is PF04045 with Jaccard = 1.0000	[ 36	0	1100175	0 ]	1.0000 1.0000
sibling [ 6553244 ] : 6592303 0.5 (=2/(2*2)) 58.125
	best keyword for cluster 6592303 is PF05452 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04045 ( PF04045   Arp2/3 complex, 34 kD subunit p34-Arc )
	B> PF05452 ( PF05452   Clavanin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04045 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 92 ) 6681927_PF04099_PF04628            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04099  is 6582101 with Jaccard = 1.0000	|PF04099|=75 [ 75	0	1100136	0 ]
 parent [ 6582101 ] : 6681927 0.134615 (=630/(78*60)) 88.6667
  given [ 6582101 ] : 6582101 0.480263 (=73/(2*76)) 54.4222
	best keyword for cluster 6582101 is PF04099 with Jaccard = 1.0000	[ 75	0	1100136	0 ]	1.0000 1.0000
sibling [ 6582101 ] : 6659161 0.209821 (=47/(4*56)) 83.1383
	best keyword for cluster 6659161 is PF04628 with Jaccard = 0.9565	[ 22	1	1100188	0 ]	0.9565 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04099 ( PF04099   Sybindin-like family )
	B> PF04628 ( PF04628   Sedlin, N-terminal conserved region )

	they come from the same clan: CL0212.4 : PF01217	PF04628	PF04099
	the two keywords do not coincide on UniRef90 proteins

	only  PF04099 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF04628 SSF64356 0.915 (average over 153 mutual instances, PF04628 154 appearances, SSF64356 1711 appearances)
     2	PF04099 SSF64356 0.954 (average over 167 mutual instances, PF04099 170 appearances, SSF64356 1711 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 93 ) 6503008_PF04120_PF07300            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04120  is 5840337 with Jaccard = 1.0000	|PF04120|=7 [ 7	0	1100204	0 ]
 parent [ 5840337 ] : 6503008 0.920168 (=219/(7*34)) 12.5888
  given [ 5840337 ] : 5840337 1 (=12/(3*4)) 8.33752e-50
	best keyword for cluster 5840337 is PF04120 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 5840337 ] : 6462708 0.984375 (=63/(2*32)) 2.66949
	best keyword for cluster 6462708 is PF07300 with Jaccard = 0.8571	[ 24	0	1100183	4 ]	1.0000 0.8571

	SUGGESTING RELATEDNESS OF:   
	A> PF04120 ( PF04120   Low affinity iron permease )
	B> PF07300 ( PF07300   Protein of unknown function (DUF1452) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04120 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 94 ) 6729510_PF04132_PF04157            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04132  is 6607654 with Jaccard = 1.0000	|PF04132|=35 [ 35	0	1100176	0 ]
 parent [ 6607654 ] : 6729510 0.0415293 (=63/(37*41)) 96.6686
  given [ 6607654 ] : 6607654 0.35625 (=57/(32*5)) 65.5231
	best keyword for cluster 6607654 is PF04132 with Jaccard = 1.0000	[ 35	0	1100176	0 ]	1.0000 1.0000
sibling [ 6607654 ] : 6709014 0.0722222 (=13/(36*5)) 93.7974
	best keyword for cluster 6709014 is PF04157 with Jaccard = 0.9375	[ 30	1	1100179	1 ]	0.9677 0.9677

	SUGGESTING RELATEDNESS OF:   
	A> PF04132 (  )
	B> PF04157 ( PF04157   EAP30/Vps36 family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04132 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 95 ) 6771880_PF00909_PF04143            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04143  is 6641466 with Jaccard = 1.0000	|PF04143|=255 [ 255	0	1099956	0 ]
 parent [ 6641466 ] : 6771880 0.00381098 (=905/(328*724)) 99.7714
  given [ 6641466 ] : 6641466 0.270415 (=5752/(89*239)) 77.4524
	best keyword for cluster 6641466 is PF04143 with Jaccard = 1.0000	[ 255	0	1099956	0 ]	1.0000 1.0000
sibling [ 6641466 ] : 6770675 0.0055325 (=4/(1*723)) 99.7331
	best keyword for cluster 6770675 is PF00909 with Jaccard = 0.9448	[ 531	28	1099649	3 ]	0.9499 0.9944

	SUGGESTING RELATEDNESS OF:   
	A> PF04143 ( PF04143   YeeE/YedE family (DUF395) )
	B> PF00909 ( PF00909   Ammonium Transporter Family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04143 has a PDB structure (may not be up to date)
		PF00909 	   f.44.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00909 SSF111352 0.924 (average over 1549 mutual instances, PF00909 1587 appearances, SSF111352 1628 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 96 ) 6741418_PF04062_PF04189            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04189  is 6489650 with Jaccard = 1.0000	|PF04189|=45 [ 45	0	1100166	0 ]
 parent [ 6489650 ] : 6741418 0.0208333 (=37/(37*48)) 97.9167
  given [ 6489650 ] : 6489650 0.935652 (=538/(23*25)) 8.15847
	best keyword for cluster 6489650 is PF04189 with Jaccard = 1.0000	[ 45	0	1100166	0 ]	1.0000 1.0000
sibling [ 6489650 ] : 6442028 0.992424 (=131/(33*4)) 0.769978
	best keyword for cluster 6442028 is PF04062 with Jaccard = 0.9714	[ 34	0	1100176	1 ]	1.0000 0.9714

	SUGGESTING RELATEDNESS OF:   
	A> PF04189 ( PF04189   Gcd10p family )
	B> PF04062 ( PF04062   P21-ARC (ARP2/3 complex 21 kDa subunit) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04062| = 35 ,	 |PF04189| = 45 , 	|PF04062^PF04189| = 1	( 2.9% and 2.2% )

	Neither  PF04189 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF04062 SSF69060 0.984 (average over 81 mutual instances, PF04062 81 appearances, SSF69060 81 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 97 ) 6736052_PF04206_PF06587            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04206  is 5874250 with Jaccard = 1.0000	|PF04206|=11 [ 11	0	1100200	0 ]
 parent [ 5874250 ] : 6736052 0.0272727 (=3/(11*10)) 97.3776
  given [ 5874250 ] : 5874250 1 (=10/(1*10)) 3.00006e-46
	best keyword for cluster 5874250 is PF04206 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000
sibling [ 5874250 ] : 6708293 0.08 (=2/(5*5)) 93.68
	best keyword for cluster 6708293 is PF06587 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04206 ( PF04206   Tetrahydromethanopterin S-methyltransferase, subunit E )
	B> PF06587 ( PF06587   Protein of unknown function (DUF1137) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04206 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 98 ) 6726703_PF02302_PF04215            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04215  is 6296028 with Jaccard = 1.0000	|PF04215|=63 [ 63	0	1100148	0 ]
 parent [ 6296028 ] : 6726703 0.0394571 (=500/(64*198)) 96.3236
  given [ 6296028 ] : 6296028 1 (=828/(46*18)) 7.25879e-10
	best keyword for cluster 6296028 is PF04215 with Jaccard = 1.0000	[ 63	0	1100148	0 ]	1.0000 1.0000
sibling [ 6296028 ] : 6701543 0.101166 (=989/(94*104)) 92.4494
	best keyword for cluster 6701543 is PF02302 with Jaccard = 0.6496	[ 178	0	1099937	96 ]	1.0000 0.6496

	SUGGESTING RELATEDNESS OF:   
	A> PF04215 ( PF04215   Putative sugar-specific permease, SgaT/UlaA )
	B> PF02302 ( PF02302   PTS system, Lactose/Cellobiose specific IIB subunit )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02302| = 274 ,	 |PF04215| = 63 , 	|PF02302^PF04215| = 7	( 2.6% and 11.1% )

	only  PF04215 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 99 ) 6623162_PF02550_PF04223            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04223  is 6480810 with Jaccard = 1.0000	|PF04223|=32 [ 32	0	1100179	0 ]
 parent [ 6480810 ] : 6623162 0.319368 (=2325/(35*208)) 71.7707
  given [ 6480810 ] : 6480810 0.941176 (=32/(1*34)) 5.88235
	best keyword for cluster 6480810 is PF04223 with Jaccard = 1.0000	[ 32	0	1100179	0 ]	1.0000 1.0000
sibling [ 6480810 ] : 6579868 0.502415 (=104/(1*207)) 53.6536
	best keyword for cluster 6579868 is PF02550 with Jaccard = 1.0000	[ 182	0	1100029	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04223 ( PF04223   Citrate lyase, alpha subunit (CitF) )
	B> PF02550 ( PF02550   Acetyl-CoA hydrolase/transferase N-terminal domain )

	Only B has a clan ( CL0246.3 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF04223 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 100 ) 6699268_PF03441_PF04244            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04244  is 6537683 with Jaccard = 1.0000	|PF04244|=62 [ 62	0	1100149	0 ]
 parent [ 6537683 ] : 6699268 0.0910146 (=2822/(74*419)) 92.0514
  given [ 6537683 ] : 6537683 0.69863 (=51/(1*73)) 30.7167
	best keyword for cluster 6537683 is PF04244 with Jaccard = 1.0000	[ 62	0	1100149	0 ]	1.0000 1.0000
sibling [ 6537683 ] : 6685874 0.124699 (=207/(415*4)) 89.4406
	best keyword for cluster 6685874 is PF03441 with Jaccard = 0.9658	[ 367	7	1099831	6 ]	0.9813 0.9839

	SUGGESTING RELATEDNESS OF:   
	A> PF04244 ( PF04244   Deoxyribodipyrimidine photolyase-related protein )
	B> PF03441 ( PF03441   FAD binding domain of DNA photolyase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03441| = 373 ,	 |PF04244| = 62 , 	|PF03441^PF04244| = 2	( 0.5% and 3.2% )

	only  PF04244 has a PDB structure (may not be up to date)
		PF03441 	   a.99.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF03441 SSF48173 0.874 (average over 1104 mutual instances, PF03441 2070 appearances, SSF48173 2238 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 101 ) 6626515_PF04272_PF05366            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04272  is 6357188 with Jaccard = 1.0000	|PF04272|=3 [ 3	0	1100208	0 ]
 parent [ 6357188 ] : 6626515 0.333333 (=3/(3*3)) 73.09
  given [ 6357188 ] : 6357188 1 (=2/(1*2)) 1.51e-05
	best keyword for cluster 6357188 is PF04272 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000
sibling [ 6357188 ] : 6344397 1 (=2/(1*2)) 2e-06
	best keyword for cluster 6344397 is PF05366 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04272 ( PF04272   Phospholamban )
	B> PF05366 ( PF05366   Sarcolipin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF04272 and PF05366 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 102 ) 6698473_PF04315_PF08401            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04315  is 6686003 with Jaccard = 1.0000	|PF04315|=30 [ 30	0	1100181	0 ]
 parent [ 6686003 ] : 6698473 0.112251 (=591/(135*39)) 91.9559
  given [ 6686003 ] : 6686003 0.105263 (=4/(1*38)) 89.4737
	best keyword for cluster 6686003 is PF04315 with Jaccard = 1.0000	[ 30	0	1100181	0 ]	1.0000 1.0000
sibling [ 6686003 ] : 6683025 0.131016 (=441/(33*102)) 88.944
	best keyword for cluster 6683025 is PF08401 with Jaccard = 0.7978	[ 71	13	1100122	5 ]	0.8452 0.9342

	SUGGESTING RELATEDNESS OF:   
	A> PF04315 ( PF04315   Protein of unknown function, DUF462 )
	B> PF08401 ( PF08401   Domain of unknown function (DUF1738) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04315 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 103 ) 6738557_PF04335_PF04585            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04335  is 6671095 with Jaccard = 1.0000	|PF04335|=70 [ 70	0	1100141	0 ]
 parent [ 6671095 ] : 6738557 0.0286987 (=116/(86*47)) 97.6416
  given [ 6671095 ] : 6671095 0.15261 (=38/(3*83)) 85.8243
	best keyword for cluster 6671095 is PF04335 with Jaccard = 1.0000	[ 70	0	1100141	0 ]	1.0000 1.0000
sibling [ 6671095 ] : 6695417 0.116279 (=20/(43*4)) 91.4119
	best keyword for cluster 6695417 is PF04585 with Jaccard = 1.0000	[ 36	0	1100175	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04335 ( PF04335   VirB8 protein )
	B> PF04585 ( PF04585   Conjugal transfer protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04335 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 104 ) 6756909_PF04362_PF05683            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04362  is 6339235 with Jaccard = 1.0000	|PF04362|=60 [ 60	0	1100151	0 ]
 parent [ 6339235 ] : 6756909 0.0138408 (=208/(68*221)) 99.0862
  given [ 6339235 ] : 6339235 1 (=1107/(27*41)) 9.30839e-07
	best keyword for cluster 6339235 is PF04362 with Jaccard = 1.0000	[ 60	0	1100151	0 ]	1.0000 1.0000
sibling [ 6339235 ] : 6636984 0.254545 (=56/(1*220)) 76.2277
	best keyword for cluster 6636984 is PF05683 with Jaccard = 0.7000	[ 140	60	1100011	0 ]	0.7000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04362 ( PF04362   Bacterial Fe(2+) trafficking )
	B> PF05683 ( PF05683   Fumarase C-terminus )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF04362 and PF05683 have PDB structures
	PF04362 	   d.279.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF04362 SSF111148 0.897 (average over 269 mutual instances, PF04362 269 appearances, SSF111148 269 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 105 ) 6614466_PF01989_PF04412            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04412  is 6266441 with Jaccard = 1.0000	|PF04412|=34 [ 34	0	1100177	0 ]
 parent [ 6266441 ] : 6614466 0.319444 (=299/(36*26)) 68.2913
  given [ 6266441 ] : 6266441 1 (=320/(16*20)) 4.52681e-12
	best keyword for cluster 6266441 is PF04412 with Jaccard = 1.0000	[ 34	0	1100177	0 ]	1.0000 1.0000
sibling [ 6266441 ] : 6511975 0.84 (=21/(1*25)) 16.2514
	best keyword for cluster 6511975 is PF01989 with Jaccard = 0.6857	[ 24	0	1100176	11 ]	1.0000 0.6857

	SUGGESTING RELATEDNESS OF:   
	A> PF04412 ( PF04412   Protein of unknown function (DUF521) )
	B> PF01989 ( PF01989   Protein of unknown function DUF126 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01989| = 35 ,	 |PF04412| = 34 , 	|PF01989^PF04412| = 11	( 31.4% and 32.4% )

	Neither  PF04412 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 106 ) 6753963_PF03231_PF04461            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04461  is 6614442 with Jaccard = 1.0000	|PF04461|=104 [ 104	0	1100107	0 ]
 parent [ 6614442 ] : 6753963 0.0159774 (=34/(112*19)) 98.9024
  given [ 6614442 ] : 6614442 0.405405 (=45/(1*111)) 68.2712
	best keyword for cluster 6614442 is PF04461 with Jaccard = 1.0000	[ 104	0	1100107	0 ]	1.0000 1.0000
sibling [ 6614442 ] : 6724826 0.047619 (=4/(12*7)) 96.0875
	best keyword for cluster 6724826 is PF03231 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04461 ( PF04461   Protein of unknown function (DUF520) )
	B> PF03231 ( PF03231   Bunyavirus non-structural protein NS-S )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04461 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 107 ) 6701721_PF04472_PF07783            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04472  is 6500238 with Jaccard = 1.0000	|PF04472|=94 [ 94	0	1100117	0 ]
 parent [ 6500238 ] : 6701721 0.103061 (=202/(98*20)) 92.4861
  given [ 6500238 ] : 6500238 0.88996 (=1108/(83*15)) 11.6781
	best keyword for cluster 6500238 is PF04472 with Jaccard = 1.0000	[ 94	0	1100117	0 ]	1.0000 1.0000
sibling [ 6500238 ] : 6659324 0.176471 (=9/(17*3)) 83.2219
	best keyword for cluster 6659324 is PF07783 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04472 ( PF04472   Protein of unknown function (DUF552) )
	B> PF07783 ( PF07783   Protein of unknown function (DUF1621) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04472 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 108 ) 6757417_PF04284_PF04474            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04474  is 6593323 with Jaccard = 1.0000	|PF04474|=55 [ 55	0	1100156	0 ]
 parent [ 6593323 ] : 6757417 0.0111111 (=36/(60*54)) 99.1166
  given [ 6593323 ] : 6593323 0.422414 (=49/(2*58)) 58.5054
	best keyword for cluster 6593323 is PF04474 with Jaccard = 1.0000	[ 55	0	1100156	0 ]	1.0000 1.0000
sibling [ 6593323 ] : 6749918 0.0206677 (=13/(37*17)) 98.6137
	best keyword for cluster 6749918 is PF04284 with Jaccard = 1.0000	[ 34	0	1100177	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04474 ( PF04474   Protein of unknown function (DUF554) )
	B> PF04284 ( PF04284   Protein of unknown function (DUF441) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04474 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 109 ) 6727414_PF02250_PF04491            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04491  is 6382237 with Jaccard = 1.0000	|PF04491|=6 [ 6	0	1100205	0 ]
 parent [ 6382237 ] : 6727414 0.0438596 (=5/(6*19)) 96.4088
  given [ 6382237 ] : 6382237 1 (=9/(3*3)) 0.000666756
	best keyword for cluster 6382237 is PF04491 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 6382237 ] : 6647837 0.25 (=22/(8*11)) 79.2808
	best keyword for cluster 6647837 is PF02250 with Jaccard = 0.9444	[ 17	0	1100193	1 ]	1.0000 0.9444

	SUGGESTING RELATEDNESS OF:   
	A> PF04491 ( PF04491   Poxvirus T4 protein, N terminus )
	B> PF02250 ( PF02250   35kD major secreted virus protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04491 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF02250 SSF49889 0.911 (average over 94 mutual instances, PF02250 94 appearances, SSF49889 94 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 110 ) 6624840_PF04497_PF04638            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04497  is 6177740 with Jaccard = 1.0000	|PF04497|=12 [ 12	0	1100199	0 ]
 parent [ 6177740 ] : 6624840 0.293706 (=42/(13*11)) 72.4799
  given [ 6177740 ] : 6177740 1 (=22/(2*11)) 4.55e-19
	best keyword for cluster 6177740 is PF04497 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 6177740 ] : 6479393 0.944444 (=17/(2*9)) 5.55618
	best keyword for cluster 6479393 is PF04638 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04497 ( PF04497   Poxvirus E2 protein )
	B> PF04638 ( PF04638   Pox virus protein O1 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04497 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 111 ) 6657100_PF04523_PF05765            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04523  is 5785944 with Jaccard = 1.0000	|PF04523|=7 [ 7	0	1100204	0 ]
 parent [ 5785944 ] : 6657100 0.202381 (=17/(7*12)) 82.3439
  given [ 5785944 ] : 5785944 1 (=10/(2*5)) 8.09003e-56
	best keyword for cluster 5785944 is PF04523 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 5785944 ] : 6286516 1 (=27/(3*9)) 1.48519e-10
	best keyword for cluster 6286516 is PF05765 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04523 ( PF04523   Herpes virus tegument protein U30 )
	B> PF05765 ( PF05765   Tegument protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04523 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 112 ) 6714693_PF04529_PF05830            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04529  is 6307902 with Jaccard = 1.0000	|PF04529|=7 [ 7	0	1100204	0 ]
 parent [ 6307902 ] : 6714693 0.0714286 (=8/(16*7)) 94.6688
  given [ 6307902 ] : 6307902 1 (=10/(2*5)) 5.08e-09
	best keyword for cluster 6307902 is PF04529 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6307902 ] : 6084671 1 (=60/(6*10)) 8.5e-27
	best keyword for cluster 6084671 is PF05830 with Jaccard = 1.0000	[ 16	0	1100195	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04529 ( PF04529   Herpesvirus U59 protein )
	B> PF05830 ( PF05830   Nodulation protein Z (NodZ) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04529 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 113 ) 5990058_PF03043_PF04532            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04532  is 5482787 with Jaccard = 1.0000	|PF04532|=7 [ 7	0	1100204	0 ]
 parent [ 5482787 ] : 5990058 1 (=98/(7*14)) 4.12245e-35
  given [ 5482787 ] : 5482787 1 (=10/(2*5)) 4.50211e-99
	best keyword for cluster 5482787 is PF04532 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 5482787 ] : 5652310 1 (=33/(3*11)) 1.21333e-72
	best keyword for cluster 5652310 is PF03043 with Jaccard = 0.6087	[ 14	0	1100188	9 ]	1.0000 0.6087

	SUGGESTING RELATEDNESS OF:   
	A> PF04532 ( PF04532   Protein of unknown function (DUF587) )
	B> PF03043 ( PF03043   Herpesvirus UL87 family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03043| = 23 ,	 |PF04532| = 7 , 	|PF03043^PF04532| = 7	( 30.4% and 100.0% )

	Neither  PF04532 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 114 ) 6741166_PF04528_PF04537            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04537  is 6334332 with Jaccard = 1.0000	|PF04537|=12 [ 12	0	1100199	0 ]
 parent [ 6334332 ] : 6741166 0.0484496 (=25/(12*43)) 97.8936
  given [ 6334332 ] : 6334332 1 (=27/(3*9)) 4.02894e-07
	best keyword for cluster 6334332 is PF04537 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 6334332 ] : 6738690 0.0238095 (=1/(1*42)) 97.6548
	best keyword for cluster 6738690 is PF04528 with Jaccard = 1.0000	[ 19	0	1100192	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04537 ( PF04537   Herpesvirus UL55 protein )
	B> PF04528 ( PF04528   Adenovirus early E4 34 kDa protein conserved region )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04537 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 115 ) 6468227_PF04541_PF05900            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04541  is 6025537 with Jaccard = 1.0000	|PF04541|=7 [ 7	0	1100204	0 ]
 parent [ 6025537 ] : 6468227 1 (=91/(13*7)) 3.47301
  given [ 6025537 ] : 6025537 1 (=10/(2*5)) 6.00022e-32
	best keyword for cluster 6025537 is PF04541 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6025537 ] : 5976267 1 (=30/(3*10)) 2.22451e-36
	best keyword for cluster 5976267 is PF05900 with Jaccard = 1.0000	[ 13	0	1100198	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04541 ( PF04541   Herpesvirus virion protein U34 )
	B> PF05900 ( PF05900   Gammaherpesvirus BFRF1 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04541 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 116 ) 6539036_PF01664_PF04582            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04582  is 6460898 with Jaccard = 1.0000	|PF04582|=19 [ 19	0	1100192	0 ]
 parent [ 6460898 ] : 6539036 0.719298 (=123/(9*19)) 31.7089
  given [ 6460898 ] : 6460898 0.983333 (=59/(15*4)) 2.41249
	best keyword for cluster 6460898 is PF04582 with Jaccard = 1.0000	[ 19	0	1100192	0 ]	1.0000 1.0000
sibling [ 6460898 ] : 6089875 1 (=20/(4*5)) 2.27011e-26
	best keyword for cluster 6089875 is PF01664 with Jaccard = 0.7500	[ 9	0	1100199	3 ]	1.0000 0.7500

	SUGGESTING RELATEDNESS OF:   
	A> PF04582 ( PF04582   Reovirus sigma C capsid protein )
	B> PF01664 ( PF01664   Reovirus viral attachment protein sigma 1 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF04582 and PF01664 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 117 ) 6732872_PF04637_PF06106            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04637  is 6455903 with Jaccard = 1.0000	|PF04637|=15 [ 15	0	1100196	0 ]
 parent [ 6455903 ] : 6732872 0.0333333 (=6/(15*12)) 97.0384
  given [ 6455903 ] : 6455903 0.981481 (=53/(6*9)) 1.87769
	best keyword for cluster 6455903 is PF04637 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000
sibling [ 6455903 ] : 6703747 0.0857143 (=3/(5*7)) 92.8572
	best keyword for cluster 6703747 is PF06106 with Jaccard = 0.8571	[ 6	1	1100204	0 ]	0.8571 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04637 ( PF04637   Herpesvirus phosphoprotein 85 (HHV6-7 U14/HCMV UL25) )
	B> PF06106 ( PF06106   Staphylococcus protein of unknown function (DUF950) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04637 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 118 ) 6687298_PF00071_PF04670            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04670  is 6489581 with Jaccard = 1.0000	|PF04670|=66 [ 66	0	1100145	0 ]
 parent [ 6489581 ] : 6687298 0.124608 (=27125/(67*3249)) 89.7127
  given [ 6489581 ] : 6489581 0.923214 (=1034/(32*35)) 8.12218
	best keyword for cluster 6489581 is PF04670 with Jaccard = 1.0000	[ 66	0	1100145	0 ]	1.0000 1.0000
sibling [ 6489581 ] : 6686756 0.122059 (=2770/(7*3242)) 89.6045
	best keyword for cluster 6686756 is PF00071 with Jaccard = 0.6831	[ 2108	930	1097125	48 ]	0.6939 0.9777

	SUGGESTING RELATEDNESS OF:   
	A> PF04670 ( PF04670   Gtr1/RagA G protein conserved region )
	B> PF00071 ( PF00071   Ras family )

	Only B has a clan ( CL0017.14 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF04670 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 119 ) 6666441_PF04677_PF05011            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04677  is 6562020 with Jaccard = 1.0000	|PF04677|=61 [ 61	0	1100150	0 ]
 parent [ 6562020 ] : 6666441 0.168783 (=505/(44*68)) 84.5916
  given [ 6562020 ] : 6562020 0.533566 (=612/(31*37)) 48.5922
	best keyword for cluster 6562020 is PF04677 with Jaccard = 1.0000	[ 61	0	1100150	0 ]	1.0000 1.0000
sibling [ 6562020 ] : 6559297 0.604651 (=26/(1*43)) 46.1052
	best keyword for cluster 6559297 is PF05011 with Jaccard = 0.6818	[ 30	13	1100167	1 ]	0.6977 0.9677

	SUGGESTING RELATEDNESS OF:   
	A> PF04677 ( PF04677   Protein similar to CwfJ C-terminus 1 )
	B> PF05011 ( PF05011   Lariat debranching enzyme, C-terminal domain )

	Only A has a clan ( CL0265.2 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04677 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF04677 SSF54197 0.709 (average over 38 mutual instances, PF04677 38 appearances, SSF54197 2604 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 120 ) 6744017_PF00314_PF04681            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04681  is 6593425 with Jaccard = 1.0000	|PF04681|=6 [ 6	0	1100205	0 ]
 parent [ 6593425 ] : 6744017 0.0242131 (=120/(21*236)) 98.1473
  given [ 6593425 ] : 6593425 0.447368 (=17/(2*19)) 58.6105
	best keyword for cluster 6593425 is PF04681 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 6593425 ] : 6720066 0.0662393 (=31/(2*234)) 95.4238
	best keyword for cluster 6720066 is PF00314 with Jaccard = 0.9447	[ 188	2	1100012	9 ]	0.9895 0.9543

	SUGGESTING RELATEDNESS OF:   
	A> PF04681 ( PF04681   Blastomyces yeast-phase-specific protein )
	B> PF00314 ( PF00314   Thaumatin family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04681 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00314 SSF49870 0.924 (average over 562 mutual instances, PF00314 579 appearances, SSF49870 583 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 121 ) 6672813_PF04691_PF05778            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04691  is 6602369 with Jaccard = 1.0000	|PF04691|=8 [ 8	0	1100203	0 ]
 parent [ 6602369 ] : 6672813 0.144444 (=13/(9*10)) 86.2901
  given [ 6602369 ] : 6602369 0.380952 (=8/(3*7)) 62.9733
	best keyword for cluster 6602369 is PF04691 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 6602369 ] : 6420996 1 (=8/(1*8)) 0.1142
	best keyword for cluster 6420996 is PF05778 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04691 ( PF04691   Apolipoprotein C-I (ApoC-1) )
	B> PF05778 ( PF05778   Apolipoprotein CIII (Apo-CIII) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04691 has a PDB structure (may not be up to date)
	PF04691 	   j.39.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 122 ) 6661979_PF04533_PF04743            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04743  is 5977189 with Jaccard = 1.0000	|PF04743|=12 [ 12	0	1100199	0 ]
 parent [ 5977189 ] : 6661979 0.197917 (=19/(12*8)) 83.6965
  given [ 5977189 ] : 5977189 1 (=35/(5*7)) 2.94783e-36
	best keyword for cluster 5977189 is PF04743 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 5977189 ] : 6403455 1 (=7/(1*7)) 0.0132857
	best keyword for cluster 6403455 is PF04533 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04743 ( PF04743   BSRF1-like protein )
	B> PF04533 ( PF04533   Herpes virus U44 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04743 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 123 ) 6758924_PF00046_PF04770            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04770  is 6455778 with Jaccard = 1.0000	|PF04770|=38 [ 38	0	1100173	0 ]
 parent [ 6455778 ] : 6758924 0.0116433 (=2265/(43*4524)) 99.2069
  given [ 6455778 ] : 6455778 0.981481 (=424/(27*16)) 1.85337
	best keyword for cluster 6455778 is PF04770 with Jaccard = 1.0000	[ 38	0	1100173	0 ]	1.0000 1.0000
sibling [ 6455778 ] : 6758416 0.00928587 (=42/(1*4523)) 99.1778
	best keyword for cluster 6758416 is PF00046 with Jaccard = 0.7971	[ 3284	750	1096091	86 ]	0.8141 0.9745

	SUGGESTING RELATEDNESS OF:   
	A> PF04770 ( PF04770   ZF-HD protein dimerisation region )
	B> PF00046 ( PF00046   Homeobox domain )

	Only B has a clan ( CL0123.12 ).

	the two keywords coincide on Uniref90 proteins: |PF00046| = 3370 ,	 |PF04770| = 38 , 	|PF00046^PF04770| = 1	( 0.0% and 2.6% )

	only  PF04770 has a PDB structure (may not be up to date)
		PF00046 	   a.4.1.1  j.92.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00046 SSF46689 0.773 (average over 9143 mutual instances, PF00046 9568 appearances, SSF46689 68153 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 124 ) 6655288_PF02957_PF04861            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04861  is 5994512 with Jaccard = 1.0000	|PF04861|=2 [ 2	0	1100209	0 ]
 parent [ 5994512 ] : 6655288 0.240854 (=79/(2*164)) 81.7596
  given [ 5994512 ] : 5994512 1 (=1/(1*1)) 1e-34
	best keyword for cluster 5994512 is PF04861 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000
sibling [ 5994512 ] : 6622080 0.314815 (=102/(2*162)) 71.3512
	best keyword for cluster 6622080 is PF02957 with Jaccard = 0.9627	[ 155	0	1100050	6 ]	1.0000 0.9627

	SUGGESTING RELATEDNESS OF:   
	A> PF04861 ( PF04861   Circovirus VP2 protein )
	B> PF02957 ( PF02957   TT viral ORF2 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04861 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 125 ) 6720061_PF01778_PF04874            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04874  is 6434848 with Jaccard = 1.0000	|PF04874|=43 [ 43	0	1100168	0 ]
 parent [ 6434848 ] : 6720061 0.0570248 (=138/(44*55)) 95.4213
  given [ 6434848 ] : 6434848 0.995614 (=227/(6*38)) 0.438605
	best keyword for cluster 6434848 is PF04874 with Jaccard = 1.0000	[ 43	0	1100168	0 ]	1.0000 1.0000
sibling [ 6434848 ] : 6703803 0.0925926 (=5/(1*54)) 92.8704
	best keyword for cluster 6703803 is PF01778 with Jaccard = 1.0000	[ 51	0	1100160	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04874 ( PF04874   Mak16 protein )
	B> PF01778 ( PF01778   Ribosomal L28e protein family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04874 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 126 ) 6518288_PF04903_PF07140            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04903  is 6502709 with Jaccard = 1.0000	|PF04903|=6 [ 6	0	1100205	0 ]
 parent [ 6502709 ] : 6518288 0.8125 (=39/(6*8)) 19.3681
  given [ 6502709 ] : 6502709 0.875 (=14/(4*4)) 12.5
	best keyword for cluster 6502709 is PF04903 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 6502709 ] : 5934272 1 (=8/(4*2)) 2.88001e-40
	best keyword for cluster 5934272 is PF07140 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04903 ( PF04903   Poxvirus interferon gamma receptor )
	B> PF07140 ( PF07140   Interferon gamma receptor alpha chain (IFNGR1) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04903 has a PDB structure (may not be up to date)
		PF07140 	   b.1.2.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 127 ) 6708831_PF04953_PF06857            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04953  is 6361150 with Jaccard = 1.0000	|PF04953|=29 [ 29	0	1100182	0 ]
 parent [ 6361150 ] : 6708831 0.0807692 (=63/(30*26)) 93.7691
  given [ 6361150 ] : 6361150 1 (=161/(23*7)) 2.89541e-05
	best keyword for cluster 6361150 is PF04953 with Jaccard = 1.0000	[ 29	0	1100182	0 ]	1.0000 1.0000
sibling [ 6361150 ] : 6416824 1 (=165/(11*15)) 0.0728859
	best keyword for cluster 6416824 is PF06857 with Jaccard = 0.9286	[ 26	0	1100183	2 ]	1.0000 0.9286

	SUGGESTING RELATEDNESS OF:   
	A> PF04953 ( PF04953   Citrate lyase, gamma subunit )
	B> PF06857 ( PF06857   Malonate decarboxylase delta subunit (MdcD) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04953 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 128 ) 6679715_PF04956_PF06921            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04956  is 6644429 with Jaccard = 1.0000	|PF04956|=39 [ 39	0	1100172	0 ]
 parent [ 6644429 ] : 6679715 0.143476 (=232/(49*33)) 88.0998
  given [ 6644429 ] : 6644429 0.268116 (=37/(3*46)) 78.3097
	best keyword for cluster 6644429 is PF04956 with Jaccard = 1.0000	[ 39	0	1100172	0 ]	1.0000 1.0000
sibling [ 6644429 ] : 6645473 0.255556 (=23/(3*30)) 78.6423
	best keyword for cluster 6645473 is PF06921 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04956 ( PF04956   TrbC/VIRB2 family )
	B> PF06921 (  )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04956 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 129 ) 6652360_PF04962_PF06845            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04962  is 6292252 with Jaccard = 1.0000	|PF04962|=33 [ 33	0	1100178	0 ]
 parent [ 6292252 ] : 6652360 0.229475 (=450/(37*53)) 80.755
  given [ 6292252 ] : 6292252 1 (=36/(1*36)) 3.94403e-10
	best keyword for cluster 6292252 is PF04962 with Jaccard = 1.0000	[ 33	0	1100178	0 ]	1.0000 1.0000
sibling [ 6292252 ] : 6427211 1 (=240/(5*48)) 0.218321
	best keyword for cluster 6427211 is PF06845 with Jaccard = 0.9804	[ 50	0	1100160	1 ]	1.0000 0.9804

	SUGGESTING RELATEDNESS OF:   
	A> PF04962 ( PF04962   5-keto 4-deoxyuronate isomerase )
	B> PF06845 ( PF06845   Myo-inositol catabolism protein IolB )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04962 has a PDB structure (may not be up to date)
	PF04962 	   b.82.1.13
	
	SUPERFAM mapping significantly overlapping:
     1	PF06845 SSF51182 0.902 (average over 165 mutual instances, PF06845 165 appearances, SSF51182 14255 appearances)
     2	PF04962 SSF51182 0.952 (average over 115 mutual instances, PF04962 115 appearances, SSF51182 14255 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 130 ) 6675133_PF04965_PF07115            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04965  is 6664362 with Jaccard = 1.0000	|PF04965|=77 [ 77	0	1100134	0 ]
 parent [ 6664362 ] : 6675133 0.167224 (=200/(13*92)) 86.9705
  given [ 6664362 ] : 6664362 0.172285 (=46/(3*89)) 84.1489
	best keyword for cluster 6664362 is PF04965 with Jaccard = 1.0000	[ 77	0	1100134	0 ]	1.0000 1.0000
sibling [ 6664362 ] : 6612793 0.404762 (=17/(6*7)) 67.6918
	best keyword for cluster 6612793 is PF07115 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04965 ( PF04965   Gene 25-like lysozyme )
	B> PF07115 ( PF07115   Protein of unknown function (DUF1371) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04965 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 131 ) 6737909_PF01917_PF04974            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04974  is 6715771 with Jaccard = 1.0000	|PF04974|=15 [ 15	0	1100196	0 ]
 parent [ 6715771 ] : 6737909 0.0282258 (=70/(80*31)) 97.578
  given [ 6715771 ] : 6715771 0.0672269 (=16/(14*17)) 94.844
	best keyword for cluster 6715771 is PF04974 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000
sibling [ 6715771 ] : 6676966 0.135742 (=139/(16*64)) 87.4912
	best keyword for cluster 6676966 is PF01917 with Jaccard = 0.8219	[ 60	13	1100138	0 ]	0.8219 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04974 ( PF04974   Archaeal flagellar protein F )
	B> PF01917 ( PF01917   Archaebacterial flagellin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04974 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 132 ) 6693292_PF00113_PF05034            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05034  is 6615589 with Jaccard = 1.0000	|PF05034|=17 [ 17	0	1100194	0 ]
 parent [ 6615589 ] : 6693292 0.113277 (=1046/(18*513)) 90.9547
  given [ 6615589 ] : 6615589 0.34375 (=11/(16*2)) 68.6997
	best keyword for cluster 6615589 is PF05034 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000
sibling [ 6615589 ] : 6629605 0.285039 (=724/(508*5)) 74.3969
	best keyword for cluster 6629605 is PF00113 with Jaccard = 0.9665	[ 462	15	1099733	1 ]	0.9686 0.9978

	SUGGESTING RELATEDNESS OF:   
	A> PF05034 ( PF05034   Methylaspartate ammonia-lyase N-terminus )
	B> PF00113 ( PF00113   Enolase, C-terminal TIM barrel domain )

	A and B come from a different clan ( CL0227.3 , CL0256.2 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF05034 and PF00113 have PDB structures
		PF00113 	   c.1.11.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 133 ) 6672094_PF02948_PF05111            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05111  is 6597213 with Jaccard = 1.0000	|PF05111|=8 [ 8	0	1100203	0 ]
 parent [ 6597213 ] : 6672094 0.15625 (=45/(8*36)) 86.0401
  given [ 6597213 ] : 6597213 0.428571 (=3/(1*7)) 60.4286
	best keyword for cluster 6597213 is PF05111 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 6597213 ] : 6626788 0.294118 (=20/(2*34)) 73.3307
	best keyword for cluster 6626788 is PF02948 with Jaccard = 0.9677	[ 30	1	1100180	0 ]	0.9677 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05111 ( PF05111   Ameloblastin precursor (Amelin) )
	B> PF02948 ( PF02948   Amelogenin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05111 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 134 ) 6748081_PF01970_PF05145            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05145  is 6594580 with Jaccard = 1.0000	|PF05145|=65 [ 65	0	1100146	0 ]
 parent [ 6594580 ] : 6748081 0.0188554 (=255/(84*161)) 98.4737
  given [ 6594580 ] : 6594580 0.414634 (=68/(2*82)) 59.2019
	best keyword for cluster 6594580 is PF05145 with Jaccard = 1.0000	[ 65	0	1100146	0 ]	1.0000 1.0000
sibling [ 6594580 ] : 6654642 0.224522 (=141/(157*4)) 81.5858
	best keyword for cluster 6654642 is PF01970 with Jaccard = 0.9923	[ 129	0	1100081	1 ]	1.0000 0.9923

	SUGGESTING RELATEDNESS OF:   
	A> PF05145 ( PF05145   Putative ammonia monooxygenase )
	B> PF01970 ( PF01970   Integral membrane protein DUF112 )

	Only A has a clan ( CL0142.6 ).

	the two keywords coincide on Uniref90 proteins: |PF01970| = 130 ,	 |PF05145| = 65 , 	|PF01970^PF05145| = 1	( 0.8% and 1.5% )

	Neither  PF05145 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 135 ) 6667157_PF05206_PF05253            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05206  is 6560464 with Jaccard = 1.0000	|PF05206|=34 [ 34	0	1100177	0 ]
 parent [ 6560464 ] : 6667157 0.236035 (=300/(41*31)) 84.757
  given [ 6560464 ] : 6560464 0.574324 (=85/(4*37)) 47.0756
	best keyword for cluster 6560464 is PF05206 with Jaccard = 1.0000	[ 34	0	1100177	0 ]	1.0000 1.0000
sibling [ 6560464 ] : 6588721 0.547619 (=92/(7*24)) 56.9234
	best keyword for cluster 6588721 is PF05253 with Jaccard = 0.9600	[ 24	0	1100186	1 ]	1.0000 0.9600

	SUGGESTING RELATEDNESS OF:   
	A> PF05206 ( PF05206   Methyltransferase TRM13 )
	B> PF05253 ( PF05253   Uncharacterised protein family (UPF0224) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05206 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 136 ) 6753721_PF02320_PF05254            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05254  is 6522746 with Jaccard = 1.0000	|PF05254|=24 [ 24	0	1100187	0 ]
 parent [ 6522746 ] : 6753721 0.0168 (=21/(25*50)) 98.8861
  given [ 6522746 ] : 6522746 0.80303 (=53/(22*3)) 21.6183
	best keyword for cluster 6522746 is PF05254 with Jaccard = 1.0000	[ 24	0	1100187	0 ]	1.0000 1.0000
sibling [ 6522746 ] : 6743172 0.0204082 (=1/(1*49)) 98.0694
	best keyword for cluster 6743172 is PF02320 with Jaccard = 1.0000	[ 34	0	1100177	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05254 ( PF05254   Uncharacterised protein family (UPF0203) )
	B> PF02320 ( PF02320   Ubiquinol-cytochrome C reductase hinge protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF05254 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 137 ) 6651126_PF05271_PF08112            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05271  is 6015658 with Jaccard = 1.0000	|PF05271|=5 [ 5	0	1100206	0 ]
 parent [ 6015658 ] : 6651126 0.266667 (=8/(5*6)) 80.3787
  given [ 6015658 ] : 6015658 1 (=6/(3*2)) 8.3937e-33
	best keyword for cluster 6015658 is PF05271 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000
sibling [ 6015658 ] : 6593440 0.444444 (=4/(3*3)) 58.6222
	best keyword for cluster 6593440 is PF08112 with Jaccard = 0.7500	[ 3	1	1100207	0 ]	0.7500 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05271 ( PF05271   Tobravirus 2B protein )
	B> PF08112 ( PF08112   ATP synthase epsilon subunit )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05271 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 138 ) 6749058_PF02713_PF05274            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05274  is 6132517 with Jaccard = 1.0000	|PF05274|=16 [ 16	0	1100195	0 ]
 parent [ 6132517 ] : 6749058 0.0208333 (=10/(20*24)) 98.5477
  given [ 6132517 ] : 6132517 1 (=84/(6*14)) 9.29527e-23
	best keyword for cluster 6132517 is PF05274 with Jaccard = 1.0000	[ 16	0	1100195	0 ]	1.0000 1.0000
sibling [ 6132517 ] : 6706568 0.0875 (=7/(20*4)) 93.3975
	best keyword for cluster 6706568 is PF02713 with Jaccard = 1.0000	[ 16	0	1100195	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05274 ( PF05274   Occlusion-derived virus envelope protein E25 )
	B> PF02713 ( PF02713   Domain of unknown function DUF220 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05274 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 139 ) 6722425_PF00096_PF05281            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05281  is 6438894 with Jaccard = 1.0000	|PF05281|=14 [ 14	0	1100197	0 ]
 parent [ 6438894 ] : 6722425 0.0447213 (=3349/(14*5349)) 95.7748
  given [ 6438894 ] : 6438894 1 (=49/(7*7)) 0.602526
	best keyword for cluster 6438894 is PF05281 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000
sibling [ 6438894 ] : 6722123 0.0569141 (=6368/(21*5328)) 95.7328
	best keyword for cluster 6722123 is PF00096 with Jaccard = 0.8219	[ 4237	269	1095056	649 ]	0.9403 0.8672

	SUGGESTING RELATEDNESS OF:   
	A> PF05281 ( PF05281   Neuroendocrine protein 7B2 precursor (Secretogranin V) )
	B> PF00096 ( PF00096   Zinc finger, C2H2 type )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00096| = 4886 ,	 |PF05281| = 14 , 	|PF00096^PF05281| = 1	( 0.0% and 7.1% )

	only  PF05281 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 140 ) 6650171_PF05289_PF06238            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05289  is 6317166 with Jaccard = 1.0000	|PF05289|=5 [ 5	0	1100206	0 ]
 parent [ 6317166 ] : 6650171 0.2 (=6/(5*6)) 80.0339
  given [ 6317166 ] : 6317166 1 (=4/(1*4)) 2.50018e-08
	best keyword for cluster 6317166 is PF05289 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000
sibling [ 6317166 ] : 6548644 0.625 (=5/(2*4)) 37.991
	best keyword for cluster 6548644 is PF06238 with Jaccard = 1.0000	[ 4	0	1100207	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05289 ( PF05289   Borrelia hemolysin accessory protein )
	B> PF06238 ( PF06238   Borrelia burgdorferi BBR25 lipoprotein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05289 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 141 ) 6704853_PF00184_PF05294            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05294  is 6516382 with Jaccard = 1.0000	|PF05294|=13 [ 13	0	1100198	0 ]
 parent [ 6516382 ] : 6704853 0.119048 (=130/(13*84)) 93.0722
  given [ 6516382 ] : 6516382 0.833333 (=10/(1*12)) 18.2556
	best keyword for cluster 6516382 is PF05294 with Jaccard = 1.0000	[ 13	0	1100198	0 ]	1.0000 1.0000
sibling [ 6516382 ] : 6691264 0.127458 (=188/(59*25)) 90.5119
	best keyword for cluster 6691264 is PF00184 with Jaccard = 0.7910	[ 53	14	1100144	0 ]	0.7910 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05294 ( PF05294   Scorpion short toxin )
	B> PF00184 ( PF00184   Neurohypophysial hormones, C-terminal Domain )

	Only A has a clan ( CL0054.8 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF05294 and PF00184 have PDB structures
		PF00184 	   b.9.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00184 SSF49606 0.854 (average over 98 mutual instances, PF00184 98 appearances, SSF49606 180 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 142 ) 6540203_PF05307_PF05946            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05307  is 6035886 with Jaccard = 1.0000	|PF05307|=3 [ 3	0	1100208	0 ]
 parent [ 6035886 ] : 6540203 0.711111 (=32/(3*15)) 32.7208
  given [ 6035886 ] : 6035886 1 (=2/(1*2)) 5.01e-31
	best keyword for cluster 6035886 is PF05307 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000
sibling [ 6035886 ] : 6186738 1 (=36/(12*3)) 2.42136e-18
	best keyword for cluster 6186738 is PF05946 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05307 ( PF05307   Bundlin )
	B> PF05946 ( PF05946   Toxin-coregulated pilus subunit TcpA )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF05307 and PF05946 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 143 ) 6635708_PF05310_PF07375            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05310  is 6340435 with Jaccard = 1.0000	|PF05310|=6 [ 6	0	1100205	0 ]
 parent [ 6340435 ] : 6635708 0.3 (=9/(6*5)) 75.9464
  given [ 6340435 ] : 6340435 1 (=5/(1*5)) 1.012e-06
	best keyword for cluster 6340435 is PF05310 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 6340435 ] : 6345958 1 (=4/(1*4)) 2.62735e-06
	best keyword for cluster 6345958 is PF07375 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05310 ( PF05310   Tenuivirus NS-3 Protein )
	B> PF07375 ( PF07375   Tenuivirus PV2 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05310 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 144 ) 6689042_PF05332_PF05752            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05332  is 6450143 with Jaccard = 1.0000	|PF05332|=7 [ 7	0	1100204	0 ]
 parent [ 6450143 ] : 6689042 0.131579 (=20/(8*19)) 90.0725
  given [ 6450143 ] : 6450143 1 (=12/(2*6)) 1.32342
	best keyword for cluster 6450143 is PF05332 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6450143 ] : 6467496 0.966667 (=58/(15*4)) 3.34094
	best keyword for cluster 6467496 is PF05752 with Jaccard = 1.0000	[ 16	0	1100195	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05332 ( PF05332   Protein of unknown function (DUF743) )
	B> PF05752 ( PF05752   Calicivirus minor structural protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05332 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 145 ) 6759213_PF05336_PF06271            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05336  is 6667257 with Jaccard = 1.0000	|PF05336|=53 [ 53	0	1100158	0 ]
 parent [ 6667257 ] : 6759213 0.00814672 (=195/(68*352)) 99.2219
  given [ 6667257 ] : 6667257 0.159091 (=21/(2*66)) 84.7903
	best keyword for cluster 6667257 is PF05336 with Jaccard = 1.0000	[ 53	0	1100158	0 ]	1.0000 1.0000
sibling [ 6667257 ] : 6752146 0.016197 (=50/(343*9)) 98.7779
	best keyword for cluster 6752146 is PF06271 with Jaccard = 0.9350	[ 302	0	1099888	21 ]	1.0000 0.9350

	SUGGESTING RELATEDNESS OF:   
	A> PF05336 ( PF05336   Protein of unknown function (DUF718) )
	B> PF06271 ( PF06271   RDD family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF05336| = 53 ,	 |PF06271| = 323 , 	|PF05336^PF06271| = 1	( 1.9% and 0.3% )

	only  PF05336 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 146 ) 6743231_PF05394_PF07420            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05394  is 6650416 with Jaccard = 1.0000	|PF05394|=6 [ 6	0	1100205	0 ]
 parent [ 6650416 ] : 6743231 0.0246914 (=2/(9*9)) 98.0741
  given [ 6650416 ] : 6650416 0.2 (=4/(5*4)) 80.1801
	best keyword for cluster 6650416 is PF05394 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 6650416 ] : 6724602 0.0555556 (=1/(3*6)) 96.0556
	best keyword for cluster 6724602 is PF07420 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05394 ( PF05394   Avirulence protein )
	B> PF07420 ( PF07420   Protein of unknown function (DUF1509) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF05394 has a PDB structure (may not be up to date)
	PF05394 	   e.45.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF05394 SSF103383 0.859 (average over 16 mutual instances, PF05394 16 appearances, SSF103383 16 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 147 ) 6726011_PF05395_PF05781            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05395  is 6518685 with Jaccard = 1.0000	|PF05395|=15 [ 15	0	1100196	0 ]
 parent [ 6518685 ] : 6726011 0.0431373 (=11/(17*15)) 96.2356
  given [ 6518685 ] : 6518685 0.826923 (=43/(4*13)) 19.7202
	best keyword for cluster 6518685 is PF05395 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000
sibling [ 6518685 ] : 6693097 0.111111 (=4/(3*12)) 90.8972
	best keyword for cluster 6693097 is PF05781 with Jaccard = 0.9000	[ 9	1	1100201	0 ]	0.9000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05395 ( PF05395   Protein phosphatase inhibitor 1/DARPP-32 )
	B> PF05781 ( PF05781   MRVI1 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05395 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 148 ) 6617884_PF05412_PF06460            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05412  is 6392080 with Jaccard = 1.0000	|PF05412|=12 [ 12	0	1100199	0 ]
 parent [ 6392080 ] : 6617884 0.320513 (=100/(12*26)) 69.5986
  given [ 6392080 ] : 6392080 1 (=27/(3*9)) 0.0028837
	best keyword for cluster 6392080 is PF05412 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 6392080 ] : 6575475 0.48 (=12/(1*25)) 52.36
	best keyword for cluster 6575475 is PF06460 with Jaccard = 0.8824	[ 15	2	1100194	0 ]	0.8824 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05412 ( PF05412   Equine arterivirus Nsp2-type cysteine proteinase )
	B> PF06460 ( PF06460   Coronavirus NSP13 )

	A and B come from a different clan ( CL0125.9 , CL0102.14 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05412 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 149 ) 6744887_PF04736_PF05434            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05434  is 6703363 with Jaccard = 1.0000	|PF05434|=20 [ 20	0	1100191	0 ]
 parent [ 6703363 ] : 6744887 0.0237154 (=6/(23*11)) 98.2253
  given [ 6703363 ] : 6703363 0.0789474 (=6/(19*4)) 92.7895
	best keyword for cluster 6703363 is PF05434 with Jaccard = 1.0000	[ 20	0	1100191	0 ]	1.0000 1.0000
sibling [ 6703363 ] : 6722383 0.0666667 (=2/(6*5)) 95.7667
	best keyword for cluster 6722383 is PF04736 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05434 ( PF05434   TMEM9 )
	B> PF04736 ( PF04736   Eclosion hormone )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05434 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 150 ) 6681380_PF03045_PF05463            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05463  is 6049508 with Jaccard = 1.0000	|PF05463|=6 [ 6	0	1100205	0 ]
 parent [ 6049508 ] : 6681380 0.149254 (=70/(7*67)) 88.5322
  given [ 6049508 ] : 6049508 1 (=10/(5*2)) 8.10209e-30
	best keyword for cluster 6049508 is PF05463 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 6049508 ] : 6644402 0.241667 (=261/(27*40)) 78.2854
	best keyword for cluster 6644402 is PF03045 with Jaccard = 0.8293	[ 34	6	1100170	1 ]	0.8500 0.9714

	SUGGESTING RELATEDNESS OF:   
	A> PF05463 ( PF05463   Sclerostin (SOST) )
	B> PF03045 ( PF03045   DAN domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05463 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 151 ) 6776705_PF05305_PF05480            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05480  is 6556842 with Jaccard = 1.0000	|PF05480|=14 [ 14	0	1100197	0 ]
 parent [ 6556842 ] : 6776705 0.00113636 (=1/(16*55)) 99.8948
  given [ 6556842 ] : 6556842 0.563636 (=31/(5*11)) 44.1308
	best keyword for cluster 6556842 is PF05480 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000
sibling [ 6556842 ] : 6709645 0.084 (=21/(5*50)) 93.8837
	best keyword for cluster 6709645 is PF05305 with Jaccard = 0.9556	[ 43	0	1100166	2 ]	1.0000 0.9556

	SUGGESTING RELATEDNESS OF:   
	A> PF05480 ( PF05480   Staphylococcus haemolytic protein )
	B> PF05305 ( PF05305   Protein of unknown function (DUF732) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05480 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 152 ) 6719944_PF03607_PF05517            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05517  is 6684704 with Jaccard = 1.0000	|PF05517|=32 [ 32	0	1100179	0 ]
 parent [ 6684704 ] : 6719944 0.0510204 (=90/(36*49)) 95.4035
  given [ 6684704 ] : 6684704 0.121212 (=12/(3*33)) 89.2238
	best keyword for cluster 6684704 is PF05517 with Jaccard = 1.0000	[ 32	0	1100179	0 ]	1.0000 1.0000
sibling [ 6684704 ] : 6703704 0.0897436 (=35/(39*10)) 92.8475
	best keyword for cluster 6703704 is PF03607 with Jaccard = 0.7193	[ 41	0	1100154	16 ]	1.0000 0.7193

	SUGGESTING RELATEDNESS OF:   
	A> PF05517 ( PF05517   p25-alpha )
	B> PF03607 ( PF03607   Doublecortin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF05517 and PF03607 have PDB structures
		PF03607 	   d.15.11.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 153 ) 6682824_PF05535_PF08138            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05535  is 6413746 with Jaccard = 1.0000	|PF05535|=10 [ 10	0	1100201	0 ]
 parent [ 6413746 ] : 6682824 0.194805 (=15/(11*7)) 88.8987
  given [ 6413746 ] : 6413746 1 (=10/(1*10)) 0.0505
	best keyword for cluster 6413746 is PF05535 with Jaccard = 1.0000	[ 10	0	1100201	0 ]	1.0000 1.0000
sibling [ 6413746 ] : 6433919 1 (=6/(1*6)) 0.402167
	best keyword for cluster 6433919 is PF08138 with Jaccard = 0.7000	[ 7	0	1100201	3 ]	1.0000 0.7000

	SUGGESTING RELATEDNESS OF:   
	A> PF05535 ( PF05535   Chromadorea ALT protein )
	B> PF08138 ( PF08138   Sex peptide (SP) family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05535 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 154 ) 6710840_PF02567_PF05544            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05544  is 6538449 with Jaccard = 1.0000	|PF05544|=68 [ 68	0	1100143	0 ]
 parent [ 6538449 ] : 6710840 0.0746374 (=1477/(77*257)) 94.0709
  given [ 6538449 ] : 6538449 0.753333 (=113/(75*2)) 31.0755
	best keyword for cluster 6538449 is PF05544 with Jaccard = 1.0000	[ 68	0	1100143	0 ]	1.0000 1.0000
sibling [ 6538449 ] : 6641224 0.249012 (=252/(4*253)) 77.3737
	best keyword for cluster 6641224 is PF02567 with Jaccard = 0.9913	[ 229	0	1099980	2 ]	1.0000 0.9913

	SUGGESTING RELATEDNESS OF:   
	A> PF05544 ( PF05544   Proline racemase )
	B> PF02567 ( PF02567   Phenazine biosynthesis-like protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF05544 and PF02567 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 155 ) 6708421_PF05571_PF06775            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05571  is 6150558 with Jaccard = 1.0000	|PF05571|=9 [ 9	0	1100202	0 ]
 parent [ 6150558 ] : 6708421 0.0634921 (=20/(9*35)) 93.7051
  given [ 6150558 ] : 6150558 1 (=14/(2*7)) 2.85714e-21
	best keyword for cluster 6150558 is PF05571 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000
sibling [ 6150558 ] : 6641201 0.227273 (=15/(2*33)) 77.3516
	best keyword for cluster 6641201 is PF06775 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05571 ( PF05571   Protein of unknown function (DUF766) )
	B> PF06775 ( PF06775   Putative adipose-regulatory protein (Seipin) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05571 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 156 ) 6566290_PF05576_PF05577            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05576  is 5974417 with Jaccard = 1.0000	|PF05576|=8 [ 8	0	1100203	0 ]
 parent [ 5974417 ] : 6566290 0.552174 (=508/(8*115)) 50.2355
  given [ 5974417 ] : 5974417 1 (=15/(3*5)) 1.536e-36
	best keyword for cluster 5974417 is PF05576 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 5974417 ] : 6554294 0.59292 (=134/(2*113)) 42.1368
	best keyword for cluster 6554294 is PF05577 with Jaccard = 0.9823	[ 111	0	1100098	2 ]	1.0000 0.9823

	SUGGESTING RELATEDNESS OF:   
	A> PF05576 ( PF05576   PS-10 peptidase S37 )
	B> PF05577 ( PF05577   Serine carboxypeptidase S28 )

	they come from the same clan: CL0028.14 : PF05728	PF00975	PF07519	PF06850	PF07819	PF00326	PF05576	PF05577	PF02129	PF00450	PF02089	PF03403	PF03096	PF01764	PF01674	PF00151	PF03583	PF02450	PF03959	PF00756	PF06028	PF05990	PF05677	PF05057	PF04301	PF08538	PF07176	PF06821	PF06500	PF06342	PF06259	PF01738	PF01083	PF00135	PF07224	PF08840	PF05448	PF02273	PF08386	PF07859	PF02230	PF00561	PF06057
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05576 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 157 ) 6731085_PF05630_PF06101            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05630  is 6526015 with Jaccard = 1.0000	|PF05630|=45 [ 45	0	1100166	0 ]
 parent [ 6526015 ] : 6731085 0.0451945 (=79/(46*38)) 96.842
  given [ 6526015 ] : 6526015 0.8 (=36/(1*45)) 23.6626
	best keyword for cluster 6526015 is PF05630 with Jaccard = 1.0000	[ 45	0	1100166	0 ]	1.0000 1.0000
sibling [ 6526015 ] : 6715143 0.0540541 (=2/(1*37)) 94.7459
	best keyword for cluster 6715143 is PF06101 with Jaccard = 0.8182	[ 9	0	1100200	2 ]	1.0000 0.8182

	SUGGESTING RELATEDNESS OF:   
	A> PF05630 ( PF05630   Necrosis inducing protein (NPP1) )
	B> PF06101 ( PF06101   Plant protein of unknown function (DUF946) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05630 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 158 ) 6706019_PF01253_PF05634            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05634  is 5960817 with Jaccard = 1.0000	|PF05634|=8 [ 8	0	1100203	0 ]
 parent [ 5960817 ] : 6706019 0.0712366 (=106/(8*186)) 93.3152
  given [ 5960817 ] : 5960817 1 (=12/(2*6)) 8.75e-38
	best keyword for cluster 5960817 is PF05634 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 5960817 ] : 6669782 0.151351 (=28/(1*185)) 85.4912
	best keyword for cluster 6669782 is PF01253 with Jaccard = 0.8255	[ 175	0	1099999	37 ]	1.0000 0.8255

	SUGGESTING RELATEDNESS OF:   
	A> PF05634 ( PF05634   Arabidopsis thaliana protein of unknown function (DUF794) )
	B> PF01253 ( PF01253   Translation initiation factor SUI1 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01253| = 212 ,	 |PF05634| = 8 , 	|PF01253^PF05634| = 1	( 0.5% and 12.5% )

	only  PF05634 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01253 SSF55159 0.759 (average over 596 mutual instances, PF01253 693 appearances, SSF55159 617 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 159 ) 6723742_PF00004_PF05673            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05673  is 6372123 with Jaccard = 1.0000	|PF05673|=91 [ 91	0	1100120	0 ]
 parent [ 6372123 ] : 6723742 0.0576556 (=44628/(103*7515)) 95.9603
  given [ 6372123 ] : 6372123 1 (=102/(1*102)) 0.000150049
	best keyword for cluster 6372123 is PF05673 with Jaccard = 1.0000	[ 91	0	1100120	0 ]	1.0000 1.0000
sibling [ 6372123 ] : 6721874 0.0629668 (=73193/(158*7357)) 95.6861
	best keyword for cluster 6721874 is PF00004 with Jaccard = 0.6307	[ 4005	2206	1093861	139 ]	0.6448 0.9665

	SUGGESTING RELATEDNESS OF:   
	A> PF05673 ( PF05673   Protein of unknown function (DUF815) )
	B> PF00004 ( PF00004   ATPase family associated with various cellular activities (AAA) )

	Only B has a clan ( CL0023.26 ).

	the two keywords coincide on Uniref90 proteins: |PF00004| = 4144 ,	 |PF05673| = 91 , 	|PF00004^PF05673| = 9	( 0.2% and 9.9% )

	only  PF05673 has a PDB structure (may not be up to date)
		PF00004 	   c.37.1.1  c.37.1.20

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 160 ) 6605432_PF05733_PF06606            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05733  is 6285061 with Jaccard = 1.0000	|PF05733|=7 [ 7	0	1100204	0 ]
 parent [ 6285061 ] : 6605432 0.380952 (=24/(7*9)) 64.286
  given [ 6285061 ] : 6285061 1 (=6/(1*6)) 1.02505e-10
	best keyword for cluster 6285061 is PF05733 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6285061 ] : 6518751 0.875 (=7/(1*8)) 19.7705
	best keyword for cluster 6518751 is PF06606 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05733 ( PF05733   Tenuivirus nucleocapsid protein )
	B> PF06606 ( PF06606   Phlebovirus nucleocapsid (N) protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05733 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 161 ) 6736942_PF01345_PF05753            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05753  is 6632333 with Jaccard = 1.0000	|PF05753|=21 [ 21	0	1100190	0 ]
 parent [ 6632333 ] : 6736942 0.0327277 (=263/(28*287)) 97.4789
  given [ 6632333 ] : 6632333 0.29932 (=44/(21*7)) 75.2152
	best keyword for cluster 6632333 is PF05753 with Jaccard = 1.0000	[ 21	0	1100190	0 ]	1.0000 1.0000
sibling [ 6632333 ] : 6712902 0.0714431 (=1409/(114*173)) 94.3948
	best keyword for cluster 6712902 is PF01345 with Jaccard = 0.6698	[ 71	11	1100105	24 ]	0.8659 0.7474

	SUGGESTING RELATEDNESS OF:   
	A> PF05753 ( PF05753   Translocon-associated protein beta (TRAPB) )
	B> PF01345 ( PF01345   Domain of unknown function DUF11 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05753 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 162 ) 6745007_PF01016_PF05775            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05775  is 6485662 with Jaccard = 1.0000	|PF05775|=9 [ 9	0	1100202	0 ]
 parent [ 6485662 ] : 6745007 0.0332551 (=85/(9*284)) 98.2352
  given [ 6485662 ] : 6485662 0.944444 (=17/(6*3)) 7.0526
	best keyword for cluster 6485662 is PF05775 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000
sibling [ 6485662 ] : 6734529 0.0363636 (=90/(275*9)) 97.2276
	best keyword for cluster 6734529 is PF01016 with Jaccard = 1.0000	[ 250	0	1099961	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05775 ( PF05775   Enterobacteria AfaD invasin protein )
	B> PF01016 ( PF01016   Ribosomal L27 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF05775 and PF01016 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF01016 SSF110324 0.753 (average over 898 mutual instances, PF01016 898 appearances, SSF110324 904 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 163 ) 6552098_PF02723_PF05780            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05780  is 6502172 with Jaccard = 1.0000	|PF05780|=6 [ 6	0	1100205	0 ]
 parent [ 6502172 ] : 6552098 0.75 (=18/(4*6)) 40.4417
  given [ 6502172 ] : 6502172 0.888889 (=8/(3*3)) 12.1179
	best keyword for cluster 6502172 is PF05780 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 6502172 ] : 6429978 1 (=3/(1*3)) 0.281333
	best keyword for cluster 6429978 is PF02723 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05780 ( PF05780   Coronavirus nonstructural protein 4 )
	B> PF02723 ( PF02723   Non-structural protein NS3/Small envelope protein E )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05780 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 164 ) 6680950_PF05796_PF07341            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05796  is 6352508 with Jaccard = 1.0000	|PF05796|=12 [ 12	0	1100199	0 ]
 parent [ 6352508 ] : 6680950 0.145833 (=7/(12*4)) 88.4254
  given [ 6352508 ] : 6352508 1 (=27/(3*9)) 7.40753e-06
	best keyword for cluster 6352508 is PF05796 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 6352508 ] : 6557024 0.666667 (=2/(1*3)) 44.3333
	best keyword for cluster 6557024 is PF07341 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05796 ( PF05796   Chordopoxvirus protein G2 )
	B> PF07341 ( PF07341   Protein of unknown function (DUF1473) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05796 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 165 ) 6729369_PF05799_PF06212            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05799  is 6512394 with Jaccard = 1.0000	|PF05799|=8 [ 8	0	1100203	0 ]
 parent [ 6512394 ] : 6729369 0.0603448 (=14/(8*29)) 96.6599
  given [ 6512394 ] : 6512394 0.857143 (=6/(1*7)) 16.6013
	best keyword for cluster 6512394 is PF05799 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 6512394 ] : 6636798 0.269231 (=21/(3*26)) 76.1737
	best keyword for cluster 6636798 is PF06212 with Jaccard = 1.0000	[ 25	0	1100186	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05799 ( PF05799   Cytochrome c oxidase subunit Vc (COX5C) )
	B> PF06212 ( PF06212   GRIM-19 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05799 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 166 ) 6647322_PF05851_PF07401            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05851  is 5997824 with Jaccard = 1.0000	|PF05851|=6 [ 6	0	1100205	0 ]
 parent [ 5997824 ] : 6647322 0.266667 (=24/(6*15)) 79.1272
  given [ 5997824 ] : 5997824 1 (=8/(4*2)) 2.0005e-34
	best keyword for cluster 5997824 is PF05851 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 5997824 ] : 6545520 0.692308 (=18/(2*13)) 35.6278
	best keyword for cluster 6545520 is PF07401 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05851 ( PF05851   Lentivirus virion infectivity factor (VIF) )
	B> PF07401 ( PF07401   Bovine Lentivirus VIF protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05851 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 167 ) 6749957_PF05873_PF08181            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05873  is 6713494 with Jaccard = 1.0000	|PF05873|=20 [ 20	0	1100191	0 ]
 parent [ 6713494 ] : 6749957 0.0165094 (=14/(53*16)) 98.6159
  given [ 6713494 ] : 6713494 0.0602837 (=17/(6*47)) 94.4918
	best keyword for cluster 6713494 is PF05873 with Jaccard = 1.0000	[ 20	0	1100191	0 ]	1.0000 1.0000
sibling [ 6713494 ] : 6729805 0.0333333 (=2/(6*10)) 96.7035
	best keyword for cluster 6729805 is PF08181 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05873 ( PF05873   ATP synthase D chain, mitochondrial (ATP5H) )
	B> PF08181 ( PF08181   DegQ (SacQ) family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05873 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 168 ) 6509819_PF05722_PF05920            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05920  is 6349015 with Jaccard = 1.0000	|PF05920|=13 [ 13	0	1100198	0 ]
 parent [ 6349015 ] : 6509819 0.912593 (=616/(25*27)) 15.2979
  given [ 6349015 ] : 6349015 1 (=84/(4*21)) 4.13007e-06
	best keyword for cluster 6349015 is PF05920 with Jaccard = 1.0000	[ 13	0	1100198	0 ]	1.0000 1.0000
sibling [ 6349015 ] : 6426982 1 (=92/(4*23)) 0.21135
	best keyword for cluster 6426982 is PF05722 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05920 ( PF05920   Coprinus cinereus mating-type protein )
	B> PF05722 ( PF05722   Ustilago B locus mating-type protein )

	Only A has a clan ( CL0123.12 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05920 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 169 ) 6757902_PF05947_PF06996            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05947  is 6479870 with Jaccard = 1.0000	|PF05947|=94 [ 94	0	1100117	0 ]
 parent [ 6479870 ] : 6757902 0.00860086 (=86/(101*99)) 99.149
  given [ 6479870 ] : 6479870 0.945578 (=278/(3*98)) 5.65956
	best keyword for cluster 6479870 is PF05947 with Jaccard = 1.0000	[ 94	0	1100117	0 ]	1.0000 1.0000
sibling [ 6479870 ] : 6756624 0.0102041 (=1/(1*98)) 99.0684
	best keyword for cluster 6756624 is PF06996 with Jaccard = 0.9878	[ 81	0	1100129	1 ]	1.0000 0.9878

	SUGGESTING RELATEDNESS OF:   
	A> PF05947 ( PF05947   Bacterial protein of unknown function (DUF879) )
	B> PF06996 ( PF06996   Protein of unknown function (DUF1305) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF05947| = 94 ,	 |PF06996| = 82 , 	|PF05947^PF06996| = 1	( 1.1% and 1.2% )

	Neither  PF05947 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 170 ) 6672236_PF00600_PF05993            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05993  is 6182456 with Jaccard = 1.0000	|PF05993|=7 [ 7	0	1100204	0 ]
 parent [ 6182456 ] : 6672236 0.236607 (=53/(7*32)) 86.1161
  given [ 6182456 ] : 6182456 1 (=12/(4*3)) 1.00034e-18
	best keyword for cluster 6182456 is PF05993 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6182456 ] : 6603505 0.677419 (=21/(1*31)) 63.4839
	best keyword for cluster 6603505 is PF00600 with Jaccard = 0.6829	[ 28	0	1100170	13 ]	1.0000 0.6829

	SUGGESTING RELATEDNESS OF:   
	A> PF05993 ( PF05993   Reovirus major virion structural protein Mu-1/Mu-1C (M2) )
	B> PF00600 ( PF00600   Influenza non-structural protein (NS1) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF05993 and PF00600 have PDB structures
		PF00600 	   a.16.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF05993 SSF69908 0.988 (average over 30 mutual instances, PF05993 30 appearances, SSF69908 30 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 171 ) 6597138_PF05994_PF07159            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05994  is 6577545 with Jaccard = 1.0000	|PF05994|=18 [ 18	0	1100193	0 ]
 parent [ 6577545 ] : 6597138 0.45614 (=182/(19*21)) 60.3451
  given [ 6577545 ] : 6577545 0.5 (=10/(1*20)) 52.9755
	best keyword for cluster 6577545 is PF05994 with Jaccard = 1.0000	[ 18	0	1100193	0 ]	1.0000 1.0000
sibling [ 6577545 ] : 6340607 1 (=48/(3*16)) 1.04167e-06
	best keyword for cluster 6340607 is PF07159 with Jaccard = 0.9500	[ 19	0	1100191	1 ]	1.0000 0.9500

	SUGGESTING RELATEDNESS OF:   
	A> PF05994 ( PF05994   Cytoplasmic Fragile-X interacting family )
	B> PF07159 ( PF07159   Protein of unknown function (DUF1394) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05994 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 172 ) 6749622_PF05996_PF06405            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05996  is 6486543 with Jaccard = 1.0000	|PF05996|=52 [ 52	0	1100159	0 ]
 parent [ 6486543 ] : 6749622 0.0196078 (=18/(54*17)) 98.5927
  given [ 6486543 ] : 6486543 0.935185 (=606/(18*36)) 7.28387
	best keyword for cluster 6486543 is PF05996 with Jaccard = 1.0000	[ 52	0	1100159	0 ]	1.0000 1.0000
sibling [ 6486543 ] : 6728289 0.0416667 (=3/(9*8)) 96.5167
	best keyword for cluster 6728289 is PF06405 with Jaccard = 0.8750	[ 7	1	1100203	0 ]	0.8750 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05996 ( PF05996   Ferredoxin-dependent bilin reductase )
	B> PF06405 ( PF06405   Red chlorophyll catabolite reductase (RCC reductase) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF05996 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 173 ) 6747907_PF06075_PF08528            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06075  is 6639802 with Jaccard = 1.0000	|PF06075|=14 [ 14	0	1100197	0 ]
 parent [ 6639802 ] : 6747907 0.0208333 (=4/(16*12)) 98.4599
  given [ 6639802 ] : 6639802 0.230769 (=9/(13*3)) 76.9887
	best keyword for cluster 6639802 is PF06075 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000
sibling [ 6639802 ] : 6703520 0.0909091 (=1/(1*11)) 92.8182
	best keyword for cluster 6703520 is PF08528 with Jaccard = 0.8182	[ 9	0	1100200	2 ]	1.0000 0.8182

	SUGGESTING RELATEDNESS OF:   
	A> PF06075 ( PF06075   Plant protein of unknown function (DUF936) )
	B> PF08528 ( PF08528   Whi5 like )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06075 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 174 ) 6774560_PF03257_PF06099            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06099  is 6604123 with Jaccard = 1.0000	|PF06099|=19 [ 19	0	1100192	0 ]
 parent [ 6604123 ] : 6774560 0.00241109 (=4/(21*79)) 99.8451
  given [ 6604123 ] : 6604123 0.5 (=10/(1*20)) 63.5127
	best keyword for cluster 6604123 is PF06099 with Jaccard = 1.0000	[ 19	0	1100192	0 ]	1.0000 1.0000
sibling [ 6604123 ] : 6756664 0.0102041 (=15/(30*49)) 99.0709
	best keyword for cluster 6756664 is PF03257 with Jaccard = 0.6087	[ 14	9	1100188	0 ]	0.6087 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06099 ( PF06099   Phenol hydroxylase subunit )
	B> PF03257 ( PF03257   Mycoplasma adhesin P1 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06099 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 175 ) 6733367_PF00015_PF06103            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06103  is 6726004 with Jaccard = 1.0000	|PF06103|=33 [ 33	0	1100178	0 ]
 parent [ 6726004 ] : 6733367 0.042611 (=12364/(72*4030)) 97.0938
  given [ 6726004 ] : 6726004 0.0471154 (=49/(20*52)) 96.2351
	best keyword for cluster 6726004 is PF06103 with Jaccard = 1.0000	[ 33	0	1100178	0 ]	1.0000 1.0000
sibling [ 6726004 ] : 6732672 0.0379158 (=1980/(13*4017)) 97.0101
	best keyword for cluster 6732672 is PF00015 with Jaccard = 0.8611	[ 2735	412	1097035	29 ]	0.8691 0.9895

	SUGGESTING RELATEDNESS OF:   
	A> PF06103 ( PF06103   Bacterial protein of unknown function (DUF948) )
	B> PF00015 ( PF00015   Methyl-accepting chemotaxis protein (MCP) signaling domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06103 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF06103 SSF47954 0.819 (average over 1 mutual instances, PF06103 1 appearances, SSF47954 3885 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 176 ) 6612532_PF06147_PF06914            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06147  is 6557065 with Jaccard = 1.0000	|PF06147|=14 [ 14	0	1100197	0 ]
 parent [ 6557065 ] : 6612532 0.344444 (=62/(12*15)) 67.5446
  given [ 6557065 ] : 6557065 0.571429 (=8/(1*14)) 44.3738
	best keyword for cluster 6557065 is PF06147 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000
sibling [ 6557065 ] : 6499254 0.888889 (=24/(9*3)) 11.1112
	best keyword for cluster 6499254 is PF06914 with Jaccard = 1.0000	[ 10	0	1100201	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06147 ( PF06147   Protein of unknown function (DUF968) )
	B> PF06914 ( PF06914   Protein of unknown function (DUF1277) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06147 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 177 ) 6732946_PF00543_PF06153            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06153  is 6462482 with Jaccard = 1.0000	|PF06153|=31 [ 31	0	1100180	0 ]
 parent [ 6462482 ] : 6732946 0.0434851 (=550/(34*372)) 97.0461
  given [ 6462482 ] : 6462482 0.975 (=117/(4*30)) 2.61774
	best keyword for cluster 6462482 is PF06153 with Jaccard = 1.0000	[ 31	0	1100180	0 ]	1.0000 1.0000
sibling [ 6462482 ] : 6679748 0.14462 (=1301/(26*346)) 88.1122
	best keyword for cluster 6679748 is PF00543 with Jaccard = 0.9724	[ 317	0	1099885	9 ]	1.0000 0.9724

	SUGGESTING RELATEDNESS OF:   
	A> PF06153 ( PF06153   Protein of unknown function (DUF970) )
	B> PF00543 ( PF00543   Nitrogen regulatory protein P-II )

	they come from the same clan: CL0089.8 : PF08029	PF06153	PF02641	PF03091	PF00543

	the two keywords coincide on Uniref90 proteins: |PF00543| = 326 ,	 |PF06153| = 31 , 	|PF00543^PF06153| = 1	( 0.3% and 3.2% )

	only  PF06153 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00543 SSF54913 0.911 (average over 1190 mutual instances, PF00543 1203 appearances, SSF54913 2763 appearances)
     2	PF06153 SSF54913 0.994 (average over 106 mutual instances, PF06153 106 appearances, SSF54913 2763 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 178 ) 6740956_PF06157_PF06195            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06157  is 6704726 with Jaccard = 1.0000	|PF06157|=23 [ 23	0	1100188	0 ]
 parent [ 6704726 ] : 6740956 0.0324074 (=28/(36*24)) 97.8744
  given [ 6704726 ] : 6704726 0.0774194 (=12/(31*5)) 93.0321
	best keyword for cluster 6704726 is PF06157 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000
sibling [ 6704726 ] : 6718524 0.0555556 (=6/(6*18)) 95.2176
	best keyword for cluster 6718524 is PF06195 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06157 ( PF06157   Protein of unknown function (DUF973) )
	B> PF06195 ( PF06195   Protein of unknown function (DUF996) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06157 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 179 ) 6662790_PF01470_PF06162            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06162  is 6347970 with Jaccard = 1.0000	|PF06162|=11 [ 11	0	1100200	0 ]
 parent [ 6347970 ] : 6662790 0.176033 (=213/(11*110)) 83.8457
  given [ 6347970 ] : 6347970 1 (=28/(4*7)) 3.59592e-06
	best keyword for cluster 6347970 is PF06162 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000
sibling [ 6347970 ] : 6571430 0.522885 (=377/(7*103)) 51.3244
	best keyword for cluster 6571430 is PF01470 with Jaccard = 0.9888	[ 88	0	1100122	1 ]	1.0000 0.9888

	SUGGESTING RELATEDNESS OF:   
	A> PF06162 ( PF06162   Caenorhabditis elegans protein of unknown function (DUF976) )
	B> PF01470 ( PF01470   Pyroglutamyl peptidase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06162 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF06162 SSF53182 0.639 (average over 8 mutual instances, PF06162 8 appearances, SSF53182 285 appearances)
     2	PF01470 SSF53182 0.943 (average over 276 mutual instances, PF01470 277 appearances, SSF53182 285 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 180 ) 6673923_PF01903_PF06180            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06180  is 6343254 with Jaccard = 1.0000	|PF06180|=28 [ 28	0	1100183	0 ]
 parent [ 6343254 ] : 6673923 0.170699 (=889/(31*168)) 86.6456
  given [ 6343254 ] : 6343254 1 (=150/(6*25)) 1.79778e-06
	best keyword for cluster 6343254 is PF06180 with Jaccard = 1.0000	[ 28	0	1100183	0 ]	1.0000 1.0000
sibling [ 6343254 ] : 6671378 0.186503 (=152/(5*163)) 85.8541
	best keyword for cluster 6671378 is PF01903 with Jaccard = 0.9313	[ 149	0	1100051	11 ]	1.0000 0.9313

	SUGGESTING RELATEDNESS OF:   
	A> PF06180 ( PF06180   Cobalt chelatase (CbiK) )
	B> PF01903 ( PF01903   CbiX )

	they come from the same clan: CL0043.7 : PF06180	PF01903	PF00762
	the two keywords do not coincide on UniRef90 proteins

	both PF06180 and PF01903 have PDB structures
	PF06180 	   c.92.1.2
	PF01903 	   c.92.1.3

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 181 ) 6607810_PF06193_PF06909            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06193  is 5955101 with Jaccard = 1.0000	|PF06193|=3 [ 3	0	1100208	0 ]
 parent [ 5955101 ] : 6607810 0.416667 (=10/(3*8)) 65.703
  given [ 5955101 ] : 5955101 1 (=2/(1*2)) 2.5005e-38
	best keyword for cluster 5955101 is PF06193 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000
sibling [ 5955101 ] : 6545869 0.75 (=9/(2*6)) 35.9787
	best keyword for cluster 6545869 is PF06909 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06193 ( PF06193   Orthopoxvirus A5L protein )
	B> PF06909 ( PF06909   Protein of unknown function (DUF1274) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06193 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 182 ) 6762134_PF06210_PF07300            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06210  is 6512231 with Jaccard = 1.0000	|PF06210|=53 [ 53	0	1100158	0 ]
 parent [ 6512231 ] : 6762134 0.00907945 (=36/(61*65)) 99.3776
  given [ 6512231 ] : 6512231 0.846698 (=359/(53*8)) 16.4585
	best keyword for cluster 6512231 is PF06210 with Jaccard = 1.0000	[ 53	0	1100158	0 ]	1.0000 1.0000
sibling [ 6512231 ] : 6759162 0.015625 (=1/(1*64)) 99.2188
	best keyword for cluster 6759162 is PF07300 with Jaccard = 0.8000	[ 28	7	1100176	0 ]	0.8000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06210 ( PF06210   Protein of unknown function (DUF1003) )
	B> PF07300 ( PF07300   Protein of unknown function (DUF1452) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06210 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 183 ) 6707568_PF04195_PF06217            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06217  is 6192842 with Jaccard = 1.0000	|PF06217|=12 [ 12	0	1100199	0 ]
 parent [ 6192842 ] : 6707568 0.0710744 (=559/(13*605)) 93.5585
  given [ 6192842 ] : 6192842 1 (=40/(5*8)) 7.63296e-18
	best keyword for cluster 6192842 is PF06217 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 6192842 ] : 6690466 0.0978441 (=118/(2*603)) 90.3542
	best keyword for cluster 6690466 is PF04195 with Jaccard = 0.8333	[ 305	58	1099845	3 ]	0.8402 0.9903

	SUGGESTING RELATEDNESS OF:   
	A> PF06217 ( PF06217   GAGA binding protein-like family )
	B> PF04195 ( PF04195   Putative gypsy type transposon )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06217 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 184 ) 6669712_PF06229_PF06268            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06229  is 6538480 with Jaccard = 1.0000	|PF06229|=22 [ 22	0	1100189	0 ]
 parent [ 6538480 ] : 6669712 0.19496 (=147/(26*29)) 85.4409
  given [ 6538480 ] : 6538480 0.715152 (=118/(15*11)) 31.1103
	best keyword for cluster 6538480 is PF06229 with Jaccard = 1.0000	[ 22	0	1100189	0 ]	1.0000 1.0000
sibling [ 6538480 ] : 6645488 0.275362 (=38/(6*23)) 78.6531
	best keyword for cluster 6645488 is PF06268 with Jaccard = 0.9524	[ 20	0	1100190	1 ]	1.0000 0.9524

	SUGGESTING RELATEDNESS OF:   
	A> PF06229 ( PF06229   FRG1-like family )
	B> PF06268 ( PF06268   Fascin domain )

	they come from the same clan: CL0066.9 : PF00652	PF02815	PF00197	PF00340	PF06229	PF00167	PF06268	PF04601	PF03498	PF05588	PF07468	PF05270	PF07951
	the two keywords do not coincide on UniRef90 proteins

	only  PF06229 has a PDB structure (may not be up to date)
		PF06268 	   b.42.5.1

	SUPERFAM mapping significantly overlapping:
     1	PF06229 SSF50405 0.549 (average over 42 mutual instances, PF06229 42 appearances, SSF50405 195 appearances)
     2	PF06268 SSF50405 0.748 (average over 53 mutual instances, PF06268 53 appearances, SSF50405 195 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 185 ) 6764362_PF00223_PF06234            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06234  is 6551601 with Jaccard = 1.0000	|PF06234|=10 [ 10	0	1100201	0 ]
 parent [ 6551601 ] : 6764362 0.0135895 (=29/(11*194)) 99.4822
  given [ 6551601 ] : 6551601 0.6 (=6/(1*10)) 40.0001
	best keyword for cluster 6551601 is PF06234 with Jaccard = 1.0000	[ 10	0	1100201	0 ]	1.0000 1.0000
sibling [ 6551601 ] : 6760531 0.00903614 (=42/(28*166)) 99.2922
	best keyword for cluster 6760531 is PF00223 with Jaccard = 1.0000	[ 116	0	1100095	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06234 ( PF06234   Toluene-4-monooxygenase system protein B (TmoB) )
	B> PF00223 ( PF00223   Photosystem I psaA/psaB protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF06234 and PF00223 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF06234 SSF110814 0.975 (average over 21 mutual instances, PF06234 21 appearances, SSF110814 21 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 186 ) 6750698_PF05682_PF06243            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06243  is 6527124 with Jaccard = 1.0000	|PF06243|=37 [ 37	0	1100174	0 ]
 parent [ 6527124 ] : 6750698 0.0200846 (=38/(43*44)) 98.6696
  given [ 6527124 ] : 6527124 0.841667 (=101/(3*40)) 24.2872
	best keyword for cluster 6527124 is PF06243 with Jaccard = 1.0000	[ 37	0	1100174	0 ]	1.0000 1.0000
sibling [ 6527124 ] : 6748255 0.0232558 (=1/(1*43)) 98.4884
	best keyword for cluster 6748255 is PF05682 with Jaccard = 1.0000	[ 37	0	1100174	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06243 ( PF06243   Phenylacetic acid degradation B )
	B> PF05682 ( PF05682   Phosphorylase kinase alpha/beta )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06243 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 187 ) 6611985_PF06285_PF07190            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06285  is 5897495 with Jaccard = 1.0000	|PF06285|=2 [ 2	0	1100209	0 ]
 parent [ 5897495 ] : 6611985 0.333333 (=2/(2*3)) 67.5
  given [ 5897495 ] : 5897495 1 (=1/(1*1)) 7e-44
	best keyword for cluster 5897495 is PF06285 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000
sibling [ 5897495 ] : 6561939 1 (=2/(1*2)) 48.5
	best keyword for cluster 6561939 is PF07190 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06285 ( PF06285   Protein of unknown function (DUF1038) )
	B> PF07190 ( PF07190   Protein of unknown function (DUF1406) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06285 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 188 ) 6687592_PF06304_PF06570            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06304  is 6373256 with Jaccard = 1.0000	|PF06304|=8 [ 8	0	1100203	0 ]
 parent [ 6373256 ] : 6687592 0.12549 (=64/(10*51)) 89.7545
  given [ 6373256 ] : 6373256 1 (=21/(3*7)) 0.000186907
	best keyword for cluster 6373256 is PF06304 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 6373256 ] : 6665704 0.194444 (=105/(15*36)) 84.4283
	best keyword for cluster 6665704 is PF06570 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06304 ( PF06304   Protein of unknown function (DUF1048) )
	B> PF06570 ( PF06570   Protein of unknown function (DUF1129) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06304 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 189 ) 6682268_PF03313_PF06354            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06354  is 6637613 with Jaccard = 1.0000	|PF06354|=37 [ 37	0	1100174	0 ]
 parent [ 6637613 ] : 6682268 0.137195 (=1260/(224*41)) 88.764
  given [ 6637613 ] : 6637613 0.236842 (=27/(38*3)) 76.3604
	best keyword for cluster 6637613 is PF06354 with Jaccard = 1.0000	[ 37	0	1100174	0 ]	1.0000 1.0000
sibling [ 6637613 ] : 6537747 0.697079 (=5799/(47*177)) 30.7838
	best keyword for cluster 6537747 is PF03313 with Jaccard = 0.7871	[ 159	43	1100009	0 ]	0.7871 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06354 ( PF06354   Protein of unknown function (DUF1063) )
	B> PF03313 ( PF03313   Serine dehydratase alpha chain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06354 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 190 ) 6605478_PF06357_PF08087            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06357  is 6407369 with Jaccard = 1.0000	|PF06357|=4 [ 4	0	1100207	0 ]
 parent [ 6407369 ] : 6605478 0.464286 (=26/(4*14)) 64.3339
  given [ 6407369 ] : 6407369 1 (=3/(1*3)) 0.0223333
	best keyword for cluster 6407369 is PF06357 with Jaccard = 1.0000	[ 4	0	1100207	0 ]	1.0000 1.0000
sibling [ 6407369 ] : 6473895 1 (=13/(1*13)) 4.40817
	best keyword for cluster 6473895 is PF08087 with Jaccard = 0.7368	[ 14	0	1100192	5 ]	1.0000 0.7368

	SUGGESTING RELATEDNESS OF:   
	A> PF06357 ( PF06357   Omega-atracotoxin )
	B> PF08087 ( PF08087   Conotoxin O-superfamily )

	Only A has a clan ( CL0083.9 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF06357 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 191 ) 6567383_PF06358_PF06716            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06358  is 5919364 with Jaccard = 1.0000	|PF06358|=2 [ 2	0	1100209	0 ]
 parent [ 5919364 ] : 6567383 0.5 (=2/(2*2)) 50.45
  given [ 5919364 ] : 5919364 1 (=1/(1*1)) 1e-41
	best keyword for cluster 5919364 is PF06358 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000
sibling [ 5919364 ] : 6160596 1 (=1/(1*1)) 2e-20
	best keyword for cluster 6160596 is PF06716 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06358 ( PF06358   Protein of unknown function (DUF1065) )
	B> PF06716 ( PF06716   Protein of unknown function (DUF1201) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06358 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 192 ) 6714912_PF01027_PF06539            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06539  is 6626501 with Jaccard = 1.0000	|PF06539|=32 [ 32	0	1100179	0 ]
 parent [ 6626501 ] : 6714912 0.0648471 (=929/(38*377)) 94.7092
  given [ 6626501 ] : 6626501 0.277778 (=20/(2*36)) 73.0771
	best keyword for cluster 6626501 is PF06539 with Jaccard = 1.0000	[ 32	0	1100179	0 ]	1.0000 1.0000
sibling [ 6626501 ] : 6694175 0.0953654 (=107/(3*374)) 91.1356
	best keyword for cluster 6694175 is PF01027 with Jaccard = 0.8990	[ 276	0	1099904	31 ]	1.0000 0.8990

	SUGGESTING RELATEDNESS OF:   
	A> PF06539 ( PF06539   Protein of unknown function (DUF1112) )
	B> PF01027 ( PF01027   Uncharacterised protein family UPF0005 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06539 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 193 ) 6593602_PF04258_PF06550            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06550  is 6242234 with Jaccard = 1.0000	|PF06550|=14 [ 14	0	1100197	0 ]
 parent [ 6242234 ] : 6593602 0.486555 (=579/(14*85)) 58.7706
  given [ 6242234 ] : 6242234 1 (=33/(3*11)) 6.20457e-14
	best keyword for cluster 6242234 is PF06550 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000
sibling [ 6242234 ] : 6562687 0.542169 (=90/(2*83)) 49.0988
	best keyword for cluster 6562687 is PF04258 with Jaccard = 0.9651	[ 83	0	1100125	3 ]	1.0000 0.9651

	SUGGESTING RELATEDNESS OF:   
	A> PF06550 ( PF06550   Protein of unknown function (DUF1119) )
	B> PF04258 ( PF04258   Signal peptide peptidase )

	they come from the same clan: CL0130.6 : PF06550	PF04258	PF01478	PF01080
	the two keywords do not coincide on UniRef90 proteins

	only  PF06550 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 194 ) 6562682_PF02502_PF06562            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06562  is 6137323 with Jaccard = 1.0000	|PF06562|=17 [ 17	0	1100194	0 ]
 parent [ 6137323 ] : 6562682 0.579258 (=2653/(20*229)) 49.0913
  given [ 6137323 ] : 6137323 1 (=51/(17*3)) 2.18261e-22
	best keyword for cluster 6137323 is PF06562 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000
sibling [ 6137323 ] : 6494884 0.923333 (=831/(4*225)) 9.71686
	best keyword for cluster 6494884 is PF02502 with Jaccard = 0.9810	[ 206	0	1100001	4 ]	1.0000 0.9810

	SUGGESTING RELATEDNESS OF:   
	A> PF06562 (  )
	B> PF02502 ( PF02502   Ribose/Galactose Isomerase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06562 has a PDB structure (may not be up to date)
		PF02502 	   c.121.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF02502 SSF89623 0.954 (average over 779 mutual instances, PF02502 788 appearances, SSF89623 793 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 195 ) 6643833_PF03027_PF06585            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06585  is 6584814 with Jaccard = 1.0000	|PF06585|=4 [ 4	0	1100207	0 ]
 parent [ 6584814 ] : 6643833 0.254795 (=93/(5*73)) 78.111
  given [ 6584814 ] : 6584814 0.5 (=3/(3*2)) 55.451
	best keyword for cluster 6584814 is PF06585 with Jaccard = 1.0000	[ 4	0	1100207	0 ]	1.0000 1.0000
sibling [ 6584814 ] : 6640910 0.233333 (=49/(70*3)) 77.3018
	best keyword for cluster 6640910 is PF03027 with Jaccard = 0.8875	[ 71	0	1100131	9 ]	1.0000 0.8875

	SUGGESTING RELATEDNESS OF:   
	A> PF06585 ( PF06585   Haemolymph juvenile hormone binding protein (JHBP) )
	B> PF03027 ( PF03027   Odorant binding protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06585 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 196 ) 6536308_PF04706_PF06607            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06607  is 6518161 with Jaccard = 1.0000	|PF06607|=15 [ 15	0	1100196	0 ]
 parent [ 6518161 ] : 6536308 0.713555 (=279/(23*17)) 29.827
  given [ 6518161 ] : 6518161 0.9375 (=15/(1*16)) 19.2516
	best keyword for cluster 6518161 is PF06607 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000
sibling [ 6518161 ] : 6505222 0.904762 (=38/(2*21)) 13.3688
	best keyword for cluster 6505222 is PF04706 with Jaccard = 0.9500	[ 19	0	1100191	1 ]	1.0000 0.9500

	SUGGESTING RELATEDNESS OF:   
	A> PF06607 ( PF06607   Prokineticin )
	B> PF04706 ( PF04706   Dickkopf N-terminal cysteine-rich region )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06607 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF04706 SSF57027 0.529 (average over 1 mutual instances, PF04706 1 appearances, SSF57027 43 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 197 ) 6660579_PF05258_PF06647            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06647  is 6537173 with Jaccard = 1.0000	|PF06647|=28 [ 28	0	1100183	0 ]
 parent [ 6537173 ] : 6660579 0.199001 (=677/(42*81)) 83.4517
  given [ 6537173 ] : 6537173 0.730556 (=263/(30*12)) 30.2164
	best keyword for cluster 6537173 is PF06647 with Jaccard = 1.0000	[ 28	0	1100183	0 ]	1.0000 1.0000
sibling [ 6537173 ] : 6650307 0.225 (=18/(1*80)) 80.1135
	best keyword for cluster 6650307 is PF05258 with Jaccard = 0.7353	[ 25	0	1100177	9 ]	1.0000 0.7353

	SUGGESTING RELATEDNESS OF:   
	A> PF06647 ( PF06647   Protein of unknown function (DUF1159) )
	B> PF05258 ( PF05258   Protein of unknown function (DUF721) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06647 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 198 ) 6768259_PF01310_PF06649            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06649  is 6467462 with Jaccard = 1.0000	|PF06649|=18 [ 18	0	1100193	0 ]
 parent [ 6467462 ] : 6768259 0.00414079 (=4/(21*46)) 99.6492
  given [ 6467462 ] : 6467462 0.977778 (=88/(6*15)) 3.33503
	best keyword for cluster 6467462 is PF06649 with Jaccard = 1.0000	[ 18	0	1100193	0 ]	1.0000 1.0000
sibling [ 6467462 ] : 6753521 0.0133929 (=6/(14*32)) 98.8723
	best keyword for cluster 6753521 is PF01310 with Jaccard = 1.0000	[ 30	0	1100181	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06649 ( PF06649   Protein of unknown function (DUF1161) )
	B> PF01310 ( PF01310   Adenovirus hexon associated protein, protein VIII )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06649 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 199 ) 6702845_PF02810_PF06685            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06685  is 6348829 with Jaccard = 1.0000	|PF06685|=7 [ 7	0	1100204	0 ]
 parent [ 6348829 ] : 6702845 0.0765521 (=381/(7*711)) 92.6708
  given [ 6348829 ] : 6348829 1 (=6/(1*6)) 4.00032e-06
	best keyword for cluster 6348829 is PF06685 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6348829 ] : 6700055 0.0834515 (=353/(6*705)) 92.1962
	best keyword for cluster 6700055 is PF02810 with Jaccard = 0.6573	[ 374	160	1099642	35 ]	0.7004 0.9144

	SUGGESTING RELATEDNESS OF:   
	A> PF06685 ( PF06685   Protein of unknown function (DUF1186) )
	B> PF02810 ( PF02810   SEC-C motif )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02810| = 409 ,	 |PF06685| = 7 , 	|PF02810^PF06685| = 1	( 0.2% and 14.3% )

	only  PF06685 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 200 ) 6725303_PF00999_PF06826            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06826  is 6428778 with Jaccard = 1.0000	|PF06826|=63 [ 63	0	1100148	0 ]
 parent [ 6428778 ] : 6725303 0.0520848 (=6683/(70*1833)) 96.1477
  given [ 6428778 ] : 6428778 0.997533 (=1213/(32*38)) 0.251136
	best keyword for cluster 6428778 is PF06826 with Jaccard = 1.0000	[ 63	0	1100148	0 ]	1.0000 1.0000
sibling [ 6428778 ] : 6709395 0.0707204 (=53186/(1213*620)) 93.8441
	best keyword for cluster 6709395 is PF00999 with Jaccard = 0.7175	[ 1194	456	1098547	14 ]	0.7236 0.9884

	SUGGESTING RELATEDNESS OF:   
	A> PF06826 ( PF06826   Predicted Permease Membrane Region )
	B> PF00999 ( PF00999   Sodium/hydrogen exchanger family )

	they come from the same clan: CL0064.7 : PF06826	PF03547	PF03601	PF05684	PF05982	PF03616	PF06965	PF00999	PF03977	PF01758
	the two keywords do not coincide on UniRef90 proteins

	only  PF06826 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 201 ) 6652057_PF05602_PF06836            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06836  is 6453855 with Jaccard = 1.0000	|PF06836|=17 [ 17	0	1100194	0 ]
 parent [ 6453855 ] : 6652057 0.27342 (=251/(17*54)) 80.6986
  given [ 6453855 ] : 6453855 0.983333 (=59/(12*5)) 1.67437
	best keyword for cluster 6453855 is PF06836 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000
sibling [ 6453855 ] : 6615832 0.313725 (=48/(3*51)) 68.8235
	best keyword for cluster 6615832 is PF05602 with Jaccard = 1.0000	[ 51	0	1100160	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06836 ( PF06836   Protein of unknown function (DUF1240) )
	B> PF05602 ( PF05602   Cleft lip and palate transmembrane protein 1 (CLPTM1) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06836 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 202 ) 6604293_PF06819_PF06847            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06847  is 6554514 with Jaccard = 1.0000	|PF06847|=17 [ 17	0	1100194	0 ]
 parent [ 6554514 ] : 6604293 0.418301 (=64/(9*17)) 63.7092
  given [ 6554514 ] : 6554514 0.619048 (=26/(3*14)) 42.3601
	best keyword for cluster 6554514 is PF06847 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000
sibling [ 6554514 ] : 6432270 1 (=20/(4*5)) 0.350884
	best keyword for cluster 6432270 is PF06819 with Jaccard = 0.8889	[ 8	1	1100202	0 ]	0.8889 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06847 ( PF06847   Archaeal Peptidase A24 C-terminus Type II )
	B> PF06819 ( PF06819   Archaeal Peptidase A24 C-terminal Domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06847 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 203 ) 6766845_PF00338_PF06856            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06856  is 6524521 with Jaccard = 1.0000	|PF06856|=18 [ 18	0	1100193	0 ]
 parent [ 6524521 ] : 6766845 0.00543901 (=28/(18*286)) 99.5923
  given [ 6524521 ] : 6524521 0.775 (=62/(8*10)) 22.7035
	best keyword for cluster 6524521 is PF06856 with Jaccard = 1.0000	[ 18	0	1100193	0 ]	1.0000 1.0000
sibling [ 6524521 ] : 6726916 0.0381304 (=155/(15*271)) 96.3481
	best keyword for cluster 6726916 is PF00338 with Jaccard = 1.0000	[ 205	0	1100006	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06856 ( PF06856   Protein of unknown function (DUF1251) )
	B> PF00338 ( PF00338   Ribosomal protein S10p/S20e )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06856 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00338 SSF54999 0.971 (average over 1055 mutual instances, PF00338 1056 appearances, SSF54999 1061 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 204 ) 6767361_PF01357_PF06865            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06865  is 6440739 with Jaccard = 1.0000	|PF06865|=46 [ 46	0	1100165	0 ]
 parent [ 6440739 ] : 6767361 0.00649838 (=173/(54*493)) 99.6118
  given [ 6440739 ] : 6440739 0.993103 (=720/(25*29)) 0.690185
	best keyword for cluster 6440739 is PF06865 with Jaccard = 1.0000	[ 46	0	1100165	0 ]	1.0000 1.0000
sibling [ 6440739 ] : 6765849 0.0066309 (=131/(44*449)) 99.5501
	best keyword for cluster 6765849 is PF01357 with Jaccard = 0.7397	[ 270	95	1099846	0 ]	0.7397 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06865 ( PF06865   Protein of unknown function (DUF1255) )
	B> PF01357 ( PF01357   Pollen allergen )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06865 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 205 ) 6738371_PF04306_PF06897            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06897  is 6713513 with Jaccard = 1.0000	|PF06897|=32 [ 32	0	1100179	0 ]
 parent [ 6713513 ] : 6738371 0.0336458 (=140/(57*73)) 97.6253
  given [ 6713513 ] : 6713513 0.0555556 (=24/(9*48)) 94.4991
	best keyword for cluster 6713513 is PF06897 with Jaccard = 1.0000	[ 32	0	1100179	0 ]	1.0000 1.0000
sibling [ 6713513 ] : 6720837 0.0497512 (=20/(6*67)) 95.5347
	best keyword for cluster 6720837 is PF04306 with Jaccard = 1.0000	[ 37	0	1100174	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06897 ( PF06897   Protein of unknown function (DUF1269) )
	B> PF04306 ( PF04306   Protein of unknown function (DUF456) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06897 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 206 ) 6740286_PF03833_PF06906            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06906  is 6336414 with Jaccard = 1.0000	|PF06906|=21 [ 21	0	1100190	0 ]
 parent [ 6336414 ] : 6740286 0.0378788 (=50/(24*55)) 97.8034
  given [ 6336414 ] : 6336414 1 (=80/(20*4)) 5.74267e-07
	best keyword for cluster 6336414 is PF06906 with Jaccard = 1.0000	[ 21	0	1100190	0 ]	1.0000 1.0000
sibling [ 6336414 ] : 6718365 0.0701058 (=53/(27*28)) 95.1994
	best keyword for cluster 6718365 is PF03833 with Jaccard = 0.7097	[ 22	9	1100180	0 ]	0.7097 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06906 ( PF06906   Protein of unknown function (DUF1272) )
	B> PF03833 ( PF03833   DNA polymerase II large subunit DP2 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06906 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 207 ) 6714146_PF04391_PF06967            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06967  is 6270216 with Jaccard = 1.0000	|PF06967|=23 [ 23	0	1100188	0 ]
 parent [ 6270216 ] : 6714146 0.0714286 (=115/(23*70)) 94.5837
  given [ 6270216 ] : 6270216 1 (=42/(21*2)) 8.89159e-12
	best keyword for cluster 6270216 is PF06967 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000
sibling [ 6270216 ] : 6695280 0.102941 (=14/(2*68)) 91.388
	best keyword for cluster 6695280 is PF04391 with Jaccard = 0.9750	[ 39	1	1100171	0 ]	0.9750 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06967 ( PF06967   Mo-dependent nitrogenase C-terminus )
	B> PF04391 ( PF04391   Protein of unknown function (DUF533) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06967 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 208 ) 6735731_PF00831_PF06984            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06984  is 6537196 with Jaccard = 1.0000	|PF06984|=39 [ 39	0	1100172	0 ]
 parent [ 6537196 ] : 6735731 0.0346955 (=433/(40*312)) 97.3435
  given [ 6537196 ] : 6537196 0.701299 (=162/(7*33)) 30.2449
	best keyword for cluster 6537196 is PF06984 with Jaccard = 1.0000	[ 39	0	1100172	0 ]	1.0000 1.0000
sibling [ 6537196 ] : 6607871 0.38245 (=1155/(10*302)) 65.7607
	best keyword for cluster 6607871 is PF00831 with Jaccard = 0.9965	[ 281	0	1099929	1 ]	1.0000 0.9965

	SUGGESTING RELATEDNESS OF:   
	A> PF06984 ( PF06984   Mitochondrial 39-S ribosomal protein L47 (MRP-L47) )
	B> PF00831 ( PF00831   Ribosomal L29 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06984 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF06984 SSF46561 0.752 (average over 14 mutual instances, PF06984 14 appearances, SSF46561 1019 appearances)
     2	PF00831 SSF46561 0.926 (average over 999 mutual instances, PF00831 1000 appearances, SSF46561 1019 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 209 ) 6696135_PF07006_PF07252            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07006  is 6317449 with Jaccard = 1.0000	|PF07006|=19 [ 19	0	1100192	0 ]
 parent [ 6317449 ] : 6696135 0.109114 (=85/(19*41)) 91.5899
  given [ 6317449 ] : 6317449 1 (=78/(13*6)) 2.60803e-08
	best keyword for cluster 6317449 is PF07006 with Jaccard = 1.0000	[ 19	0	1100192	0 ]	1.0000 1.0000
sibling [ 6317449 ] : 6664039 0.195767 (=74/(14*27)) 84.074
	best keyword for cluster 6664039 is PF07252 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07006 ( PF07006   Protein of unknown function (DUF1310) )
	B> PF07252 ( PF07252   Protein of unknown function (DUF1433) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07006 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 210 ) 6723254_PF03301_PF07014            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07014  is 6303319 with Jaccard = 1.0000	|PF07014|=9 [ 9	0	1100202	0 ]
 parent [ 6303319 ] : 6723254 0.0610329 (=39/(9*71)) 95.8948
  given [ 6303319 ] : 6303319 1 (=8/(1*8)) 2.5e-09
	best keyword for cluster 6303319 is PF07014 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000
sibling [ 6303319 ] : 6676221 0.147059 (=30/(3*68)) 87.2258
	best keyword for cluster 6676221 is PF03301 with Jaccard = 1.0000	[ 59	0	1100152	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07014 ( PF07014   Hs1pro-1 protein C-terminus )
	B> PF03301 ( PF03301   Tryptophan 2,3-dioxygenase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF07014 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 211 ) 6773216_PF07023_PF07999            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07023  is 6445759 with Jaccard = 1.0000	|PF07023|=32 [ 32	0	1100179	0 ]
 parent [ 6445759 ] : 6773216 0.00322545 (=29/(37*243)) 99.811
  given [ 6445759 ] : 6445759 0.990909 (=327/(15*22)) 0.997043
	best keyword for cluster 6445759 is PF07023 with Jaccard = 1.0000	[ 32	0	1100179	0 ]	1.0000 1.0000
sibling [ 6445759 ] : 6769867 0.00413223 (=1/(1*242)) 99.7066
	best keyword for cluster 6769867 is PF07999 with Jaccard = 0.9944	[ 176	1	1100034	0 ]	0.9944 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07023 ( PF07023   Protein of unknown function (DUF1315) )
	B> PF07999 ( PF07999   Retrotransposon hot spot protein )

	Only A has a clan ( CL0072.14 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07023 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 212 ) 6726885_PF04965_PF07025            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07025  is 6699234 with Jaccard = 1.0000	|PF07025|=81 [ 81	0	1100130	0 ]
 parent [ 6699234 ] : 6726885 0.0480519 (=481/(91*110)) 96.3428
  given [ 6699234 ] : 6699234 0.0948276 (=33/(87*4)) 92.0491
	best keyword for cluster 6699234 is PF07025 with Jaccard = 1.0000	[ 81	0	1100130	0 ]	1.0000 1.0000
sibling [ 6699234 ] : 6720064 0.0533333 (=28/(5*105)) 95.4232
	best keyword for cluster 6720064 is PF04965 with Jaccard = 0.9277	[ 77	6	1100128	0 ]	0.9277 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07025 (  )
	B> PF04965 ( PF04965   Gene 25-like lysozyme )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07025 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 213 ) 6678285_PF06133_PF07050            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07050  is 6617796 with Jaccard = 1.0000	|PF07050|=20 [ 20	0	1100191	0 ]
 parent [ 6617796 ] : 6678285 0.139706 (=285/(40*51)) 87.7535
  given [ 6617796 ] : 6617796 0.3225 (=129/(20*20)) 69.537
	best keyword for cluster 6617796 is PF07050 with Jaccard = 1.0000	[ 20	0	1100191	0 ]	1.0000 1.0000
sibling [ 6617796 ] : 6654679 0.18617 (=35/(47*4)) 81.6075
	best keyword for cluster 6654679 is PF06133 with Jaccard = 0.9756	[ 40	0	1100170	1 ]	1.0000 0.9756

	SUGGESTING RELATEDNESS OF:   
	A> PF07050 ( PF07050   Protein of unknown function (DUF1333) )
	B> PF06133 ( PF06133   Protein of unknown function (DUF964) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07050 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 214 ) 6729287_PF01081_PF07071            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07071  is 6288448 with Jaccard = 1.0000	|PF07071|=11 [ 11	0	1100200	0 ]
 parent [ 6288448 ] : 6729287 0.0474658 (=118/(11*226)) 96.6454
  given [ 6288448 ] : 6288448 1 (=10/(1*10)) 2e-10
	best keyword for cluster 6288448 is PF07071 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000
sibling [ 6288448 ] : 6678844 0.15786 (=242/(7*219)) 87.8921
	best keyword for cluster 6678844 is PF01081 with Jaccard = 0.9846	[ 192	0	1100016	3 ]	1.0000 0.9846

	SUGGESTING RELATEDNESS OF:   
	A> PF07071 ( PF07071   Protein of unknown function (DUF1341) )
	B> PF01081 ( PF01081   KDPG and KHG aldolase )

	Only B has a clan ( CL0036.17 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF07071 has a PDB structure (may not be up to date)
		PF01081 	   c.1.10.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 215 ) 6680901_PF07088_PF07181            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07088  is 6297211 with Jaccard = 1.0000	|PF07088|=7 [ 7	0	1100204	0 ]
 parent [ 6297211 ] : 6680901 0.119048 (=5/(6*7)) 88.4074
  given [ 6297211 ] : 6297211 1 (=10/(2*5)) 9.01e-10
	best keyword for cluster 6297211 is PF07088 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6297211 ] : 6109666 1 (=5/(1*5)) 1.00042e-24
	best keyword for cluster 6109666 is PF07181 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07088 ( PF07088   GvpD gas vesicle protein )
	B> PF07181 ( PF07181   VirC2 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07088 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 216 ) 6765756_PF00816_PF07146            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07146  is 6742240 with Jaccard = 1.0000	|PF07146|=17 [ 17	0	1100194	0 ]
 parent [ 6742240 ] : 6765756 0.00577201 (=40/(210*33)) 99.5458
  given [ 6742240 ] : 6742240 0.0225564 (=6/(19*14)) 97.9948
	best keyword for cluster 6742240 is PF07146 with Jaccard = 1.0000	[ 17	0	1100194	0 ]	1.0000 1.0000
sibling [ 6742240 ] : 6732367 0.0386939 (=237/(175*35)) 96.9927
	best keyword for cluster 6732367 is PF00816 with Jaccard = 1.0000	[ 150	0	1100061	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07146 ( PF07146   Protein of unknown function (DUF1389) )
	B> PF00816 ( PF00816   H-NS histone family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF07146 has a PDB structure (may not be up to date)
		PF00816 	   a.155.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 217 ) 6748319_PF00335_PF07150            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07150  is 6691378 with Jaccard = 1.0000	|PF07150|=8 [ 8	0	1100203	0 ]
 parent [ 6691378 ] : 6748319 0.0176329 (=146/(15*552)) 98.4933
  given [ 6691378 ] : 6691378 0.113636 (=5/(4*11)) 90.5531
	best keyword for cluster 6691378 is PF07150 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000
sibling [ 6691378 ] : 6738017 0.0322623 (=243/(14*538)) 97.5898
	best keyword for cluster 6738017 is PF00335 with Jaccard = 0.9867	[ 445	1	1099760	5 ]	0.9978 0.9889

	SUGGESTING RELATEDNESS OF:   
	A> PF07150 ( PF07150   Protein of unknown function (DUF1390) )
	B> PF00335 ( PF00335   Tetraspanin family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF07150 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 218 ) 6671500_PF00669_PF07164            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07164  is 6507099 with Jaccard = 1.0000	|PF07164|=14 [ 14	0	1100197	0 ]
 parent [ 6507099 ] : 6671500 0.162438 (=5263/(40*810)) 85.933
  given [ 6507099 ] : 6507099 0.874459 (=202/(7*33)) 14.1077
	best keyword for cluster 6507099 is PF07164 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000
sibling [ 6507099 ] : 6643793 0.235149 (=380/(2*808)) 78.0815
	best keyword for cluster 6643793 is PF00669 with Jaccard = 0.9661	[ 684	22	1099503	2 ]	0.9688 0.9971

	SUGGESTING RELATEDNESS OF:   
	A> PF07164 ( PF07164   Putative flagellar hook-associated protein 3 (HAP3) )
	B> PF00669 ( PF00669   Bacterial flagellin N-terminus )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF07164 has a PDB structure (may not be up to date)
		PF00669 	   e.32.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 219 ) 6702869_PF07183_PF07756            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07183  is 6520379 with Jaccard = 1.0000	|PF07183|=12 [ 12	0	1100199	0 ]
 parent [ 6520379 ] : 6702869 0.0926724 (=43/(16*29)) 92.6756
  given [ 6520379 ] : 6520379 0.8 (=12/(1*15)) 20.2269
	best keyword for cluster 6520379 is PF07183 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000
sibling [ 6520379 ] : 6631579 0.275 (=33/(5*24)) 75.0431
	best keyword for cluster 6631579 is PF07756 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07183 ( PF07183   Protein of unknown function (DUF1403) )
	B> PF07756 ( PF07756   Protein of unknown function (DUF1612) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07183 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 220 ) 6675009_PF07241_PF07873            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07241  is 6395454 with Jaccard = 1.0000	|PF07241|=24 [ 24	0	1100187	0 ]
 parent [ 6395454 ] : 6675009 0.172727 (=95/(22*25)) 86.9088
  given [ 6395454 ] : 6395454 1 (=66/(3*22)) 0.0044824
	best keyword for cluster 6395454 is PF07241 with Jaccard = 1.0000	[ 24	0	1100187	0 ]	1.0000 1.0000
sibling [ 6395454 ] : 6359649 1 (=40/(2*20)) 2.12643e-05
	best keyword for cluster 6359649 is PF07873 with Jaccard = 1.0000	[ 21	0	1100190	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07241 ( PF07241   Protein of unknown function (DUF1429) )
	B> PF07873 ( PF07873   YabP family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07241 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 221 ) 6672174_PF06656_PF07245            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07245  is 6376394 with Jaccard = 1.0000	|PF07245|=16 [ 16	0	1100195	0 ]
 parent [ 6376394 ] : 6672174 0.151786 (=17/(16*7)) 86.0753
  given [ 6376394 ] : 6376394 1 (=55/(5*11)) 0.000295131
	best keyword for cluster 6376394 is PF07245 with Jaccard = 1.0000	[ 16	0	1100195	0 ]	1.0000 1.0000
sibling [ 6376394 ] : 6608857 0.4 (=4/(5*2)) 66.49
	best keyword for cluster 6608857 is PF06656 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07245 ( PF07245   Phlebovirus glycoprotein G2 )
	B> PF06656 ( PF06656   Tenuivirus PVC2 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07245 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 222 ) 6728300_PF04271_PF07261            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07261  is 6545393 with Jaccard = 1.0000	|PF07261|=33 [ 33	0	1100178	0 ]
 parent [ 6545393 ] : 6728300 0.0441122 (=478/(43*252)) 96.5185
  given [ 6545393 ] : 6545393 0.678947 (=129/(5*38)) 35.5023
	best keyword for cluster 6545393 is PF07261 with Jaccard = 1.0000	[ 33	0	1100178	0 ]	1.0000 1.0000
sibling [ 6545393 ] : 6721711 0.0456731 (=475/(52*200)) 95.6656
	best keyword for cluster 6721711 is PF04271 with Jaccard = 0.7607	[ 89	25	1100094	3 ]	0.7807 0.9674

	SUGGESTING RELATEDNESS OF:   
	A> PF07261 ( PF07261   Replication initiation and membrane attachment protein (DnaB) )
	B> PF04271 ( PF04271   DnaD-like domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07261 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 223 ) 6707929_PF07211_PF07352            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07352  is 6529230 with Jaccard = 1.0000	|PF07352|=15 [ 15	0	1100196	0 ]
 parent [ 6529230 ] : 6707929 0.0864198 (=14/(18*9)) 93.6176
  given [ 6529230 ] : 6529230 0.763889 (=55/(12*6)) 25.3618
	best keyword for cluster 6529230 is PF07352 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000
sibling [ 6529230 ] : 6662867 0.166667 (=3/(6*3)) 83.8579
	best keyword for cluster 6662867 is PF07211 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07352 ( PF07352   Bacteriophage Mu Gam like protein )
	B> PF07211 ( PF07211   Protein of unknown function (DUF1417) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07352 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 224 ) 6696871_PF02987_PF07384            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07384  is 6683589 with Jaccard = 1.0000	|PF07384|=1 [ 1	0	1100210	0 ]
 parent [ 6683589 ] : 6696871 0.103596 (=386/(9*414)) 91.7138
  given [ 6683589 ] : 6683589 0.125 (=1/(1*8)) 89
	best keyword for cluster 6683589 is PF07384 with Jaccard = 1.0000	[ 1	0	1100210	0 ]	1.0000 1.0000
sibling [ 6683589 ] : 6693759 0.106762 (=761/(18*396)) 91.0191
	best keyword for cluster 6693759 is PF02987 with Jaccard = 0.6947	[ 132	26	1100021	32 ]	0.8354 0.8049

	SUGGESTING RELATEDNESS OF:   
	A> PF07384 ( PF07384   Protein of unknown function (DUF1497) )
	B> PF02987 ( PF02987   Late embryogenesis abundant protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07384 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 225 ) 6736878_PF02691_PF07406            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07406  is 6181172 with Jaccard = 1.0000	|PF07406|=9 [ 9	0	1100202	0 ]
 parent [ 6181172 ] : 6736878 0.0299145 (=14/(9*52)) 97.4704
  given [ 6181172 ] : 6181172 1 (=18/(3*6)) 8.89382e-19
	best keyword for cluster 6181172 is PF07406 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000
sibling [ 6181172 ] : 6725357 0.0416667 (=8/(4*48)) 96.151
	best keyword for cluster 6725357 is PF02691 with Jaccard = 0.9375	[ 45	1	1100163	2 ]	0.9783 0.9574

	SUGGESTING RELATEDNESS OF:   
	A> PF07406 ( PF07406   NICE-3 protein )
	B> PF02691 ( PF02691   Vacuolating cyotoxin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF07406 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 226 ) 6756120_PF01963_PF07446            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07446  is 6689118 with Jaccard = 1.0000	|PF07446|=61 [ 61	0	1100150	0 ]
 parent [ 6689118 ] : 6756120 0.0141666 (=107/(83*91)) 99.0387
  given [ 6689118 ] : 6689118 0.115616 (=77/(74*9)) 90.102
	best keyword for cluster 6689118 is PF07446 with Jaccard = 1.0000	[ 61	0	1100150	0 ]	1.0000 1.0000
sibling [ 6689118 ] : 6740494 0.0278293 (=30/(77*14)) 97.8268
	best keyword for cluster 6740494 is PF01963 with Jaccard = 0.9831	[ 58	0	1100152	1 ]	1.0000 0.9831

	SUGGESTING RELATEDNESS OF:   
	A> PF07446 ( PF07446   GumN protein )
	B> PF01963 ( PF01963   TraB family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07446 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 227 ) 6600048_PF00666_PF07448            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07448  is 6262049 with Jaccard = 1.0000	|PF07448|=6 [ 6	0	1100205	0 ]
 parent [ 6262049 ] : 6600048 0.460317 (=116/(6*42)) 61.5633
  given [ 6262049 ] : 6262049 1 (=5/(1*5)) 2.01606e-12
	best keyword for cluster 6262049 is PF07448 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000
sibling [ 6262049 ] : 6536028 0.815789 (=124/(4*38)) 29.552
	best keyword for cluster 6536028 is PF00666 with Jaccard = 0.9459	[ 35	0	1100174	2 ]	1.0000 0.9459

	SUGGESTING RELATEDNESS OF:   
	A> PF07448 ( PF07448   Secreted phosphoprotein 24 (Spp-24) )
	B> PF00666 ( PF00666   Cathelicidin )

	they come from the same clan: CL0121.6 : PF00666	PF00031	PF07448
	the two keywords do not coincide on UniRef90 proteins

	only  PF07448 has a PDB structure (may not be up to date)
		PF00666 	   d.17.1.3

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 228 ) 6713040_PF00728_PF07555            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07555  is 6653972 with Jaccard = 1.0000	|PF07555|=31 [ 31	0	1100180	0 ]
 parent [ 6653972 ] : 6713040 0.0634921 (=576/(36*252)) 94.4161
  given [ 6653972 ] : 6653972 0.191176 (=13/(2*34)) 81.3482
	best keyword for cluster 6653972 is PF07555 with Jaccard = 1.0000	[ 31	0	1100180	0 ]	1.0000 1.0000
sibling [ 6653972 ] : 6702335 0.0916335 (=23/(1*251)) 92.5988
	best keyword for cluster 6702335 is PF00728 with Jaccard = 0.9957	[ 233	0	1099977	1 ]	1.0000 0.9957

	SUGGESTING RELATEDNESS OF:   
	A> PF07555 ( PF07555   Hyaluronidase )
	B> PF00728 ( PF00728   Glycosyl hydrolase family 20, catalytic domain )

	Only B has a clan ( CL0058.10 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF07555 has a PDB structure (may not be up to date)
		PF00728 	   c.1.8.6

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 229 ) 6645404_PF04642_PF07794            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07794  is 6512476 with Jaccard = 1.0000	|PF07794|=13 [ 13	0	1100198	0 ]
 parent [ 6512476 ] : 6645404 0.238782 (=149/(13*48)) 78.5899
  given [ 6512476 ] : 6512476 0.833333 (=30/(4*9)) 16.6667
	best keyword for cluster 6512476 is PF07794 with Jaccard = 1.0000	[ 13	0	1100198	0 ]	1.0000 1.0000
sibling [ 6512476 ] : 6625243 0.284375 (=91/(8*40)) 72.5771
	best keyword for cluster 6625243 is PF04642 with Jaccard = 0.7500	[ 6	2	1100203	0 ]	0.7500 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07794 ( PF07794   Protein of unknown function (DUF1633) )
	B> PF04642 ( PF04642   Protein of unknown function, DUF601 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07794 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 230 ) 6734835_PF02624_PF07812            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07812  is 6575948 with Jaccard = 1.0000	|PF07812|=23 [ 23	0	1100188	0 ]
 parent [ 6575948 ] : 6734835 0.0286499 (=73/(26*98)) 97.2563
  given [ 6575948 ] : 6575948 0.479167 (=23/(24*2)) 52.5428
	best keyword for cluster 6575948 is PF07812 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000
sibling [ 6575948 ] : 6723850 0.0412371 (=4/(1*97)) 95.9815
	best keyword for cluster 6723850 is PF02624 with Jaccard = 0.9348	[ 86	0	1100119	6 ]	1.0000 0.9348

	SUGGESTING RELATEDNESS OF:   
	A> PF07812 ( PF07812   TfuA-like protein )
	B> PF02624 ( PF02624   YcaO-like family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02624| = 92 ,	 |PF07812| = 23 , 	|PF02624^PF07812| = 1	( 1.1% and 4.3% )

	Neither  PF07812 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 231 ) 6640090_PF04961_PF07837            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07837  is 6552246 with Jaccard = 1.0000	|PF07837|=36 [ 36	0	1100175	0 ]
 parent [ 6552246 ] : 6640090 0.23141 (=389/(41*41)) 77.0151
  given [ 6552246 ] : 6552246 0.7 (=28/(1*40)) 40.5791
	best keyword for cluster 6552246 is PF07837 with Jaccard = 1.0000	[ 36	0	1100175	0 ]	1.0000 1.0000
sibling [ 6552246 ] : 6628199 0.263158 (=30/(3*38)) 73.758
	best keyword for cluster 6628199 is PF04961 with Jaccard = 0.7609	[ 35	0	1100165	11 ]	1.0000 0.7609

	SUGGESTING RELATEDNESS OF:   
	A> PF07837 ( PF07837   Formiminotransferase domain, N-terminal subdomain )
	B> PF04961 ( PF04961   Formiminotransferase-cyclodeaminase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04961| = 46 ,	 |PF07837| = 36 , 	|PF04961^PF07837| = 10	( 21.7% and 27.8% )

	both PF07837 and PF04961 have PDB structures
	PF07837 	   d.58.34.1
	PF04961 	   a.191.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 232 ) 6658087_PF07399_PF07854            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07854  is 6022870 with Jaccard = 1.0000	|PF07854|=15 [ 15	0	1100196	0 ]
 parent [ 6022870 ] : 6658087 0.226389 (=163/(15*48)) 82.7444
  given [ 6022870 ] : 6022870 1 (=14/(1*14)) 3.57921e-32
	best keyword for cluster 6022870 is PF07854 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000
sibling [ 6022870 ] : 6633549 0.308594 (=158/(32*16)) 75.4505
	best keyword for cluster 6633549 is PF07399 with Jaccard = 0.7619	[ 16	5	1100190	0 ]	0.7619 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07854 ( PF07854   Protein of unknown function (DUF1646) )
	B> PF07399 ( PF07399   Protein of unknown function (DUF1504) )

	they come from the same clan: CL0182.8 : PF06450	PF00939	PF03553	PF07158	PF02652	PF02447	PF04165	PF07854	PF07399	PF03606	PF03605	PF06808	PF03600	PF02040	PF00873	PF03806	PF02667
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07854 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 233 ) 6734858_PF00614_PF07894            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07894  is 6651714 with Jaccard = 1.0000	|PF07894|=35 [ 35	0	1100176	0 ]
 parent [ 6651714 ] : 6734858 0.0338302 (=1112/(38*865)) 97.2589
  given [ 6651714 ] : 6651714 0.198529 (=27/(34*4)) 80.524
	best keyword for cluster 6651714 is PF07894 with Jaccard = 1.0000	[ 35	0	1100176	0 ]	1.0000 1.0000
sibling [ 6651714 ] : 6715742 0.0644783 (=660/(12*853)) 94.8356
	best keyword for cluster 6715742 is PF00614 with Jaccard = 0.9690	[ 657	13	1099533	8 ]	0.9806 0.9880

	SUGGESTING RELATEDNESS OF:   
	A> PF07894 ( PF07894   Protein of unknown function (DUF1669) )
	B> PF00614 ( PF00614   Phospholipase D Active site motif )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF07894 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 234 ) 6753445_PF01281_PF07942            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07942  is 6491569 with Jaccard = 1.0000	|PF07942|=58 [ 58	0	1100153	0 ]
 parent [ 6491569 ] : 6753445 0.0125095 (=230/(58*317)) 98.8673
  given [ 6491569 ] : 6491569 0.924528 (=245/(5*53)) 8.73119
	best keyword for cluster 6491569 is PF07942 with Jaccard = 1.0000	[ 58	0	1100153	0 ]	1.0000 1.0000
sibling [ 6491569 ] : 6740541 0.030303 (=102/(306*11)) 97.8315
	best keyword for cluster 6740541 is PF01281 with Jaccard = 0.9821	[ 274	1	1099932	4 ]	0.9964 0.9856

	SUGGESTING RELATEDNESS OF:   
	A> PF07942 ( PF07942   N2227-like protein )
	B> PF01281 ( PF01281   Ribosomal protein L9, N-terminal domain )

	Only A has a clan ( CL0102.14 ).

	the two keywords coincide on Uniref90 proteins: |PF01281| = 278 ,	 |PF07942| = 58 , 	|PF01281^PF07942| = 1	( 0.4% and 1.7% )

	only  PF07942 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 235 ) 6676079_PF05300_PF07956            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07956  is 6479923 with Jaccard = 1.0000	|PF07956|=13 [ 13	0	1100198	0 ]
 parent [ 6479923 ] : 6676079 0.16036 (=89/(15*37)) 87.1812
  given [ 6479923 ] : 6479923 0.946429 (=53/(8*7)) 5.6792
	best keyword for cluster 6479923 is PF07956 with Jaccard = 1.0000	[ 13	0	1100198	0 ]	1.0000 1.0000
sibling [ 6479923 ] : 6672233 0.138889 (=5/(1*36)) 86.1129
	best keyword for cluster 6672233 is PF05300 with Jaccard = 0.7037	[ 19	8	1100184	0 ]	0.7037 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07956 ( PF07956   Protein of Unknown function (DUF1690) )
	B> PF05300 ( PF05300   Protein of unknown function (DUF737) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07956 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 236 ) 6659593_PF04314_PF07987            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07987  is 6596885 with Jaccard = 1.0000	|PF07987|=36 [ 36	0	1100175	0 ]
 parent [ 6596885 ] : 6659593 0.168105 (=896/(41*130)) 83.2756
  given [ 6596885 ] : 6596885 0.447368 (=51/(3*38)) 60.1169
	best keyword for cluster 6596885 is PF07987 with Jaccard = 1.0000	[ 36	0	1100175	0 ]	1.0000 1.0000
sibling [ 6596885 ] : 6638422 0.257812 (=66/(2*128)) 76.5979
	best keyword for cluster 6638422 is PF04314 with Jaccard = 0.9500	[ 114	0	1100091	6 ]	1.0000 0.9500

	SUGGESTING RELATEDNESS OF:   
	A> PF07987 ( PF07987   Domain of unkown function (DUF1775) )
	B> PF04314 ( PF04314   Protein of unknown function (DUF461) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04314| = 120 ,	 |PF07987| = 36 , 	|PF04314^PF07987| = 6	( 5.0% and 16.7% )

	only  PF07987 has a PDB structure (may not be up to date)
		PF04314 	   b.2.10.1

	SUPERFAM mapping significantly overlapping:
     1	PF04314 SSF110087 0.786 (average over 395 mutual instances, PF04314 395 appearances, SSF110087 417 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 237 ) 6728765_PF05013_PF08014            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08014  is 6625336 with Jaccard = 1.0000	|PF08014|=30 [ 30	0	1100181	0 ]
 parent [ 6625336 ] : 6728765 0.0395745 (=279/(50*141)) 96.5812
  given [ 6625336 ] : 6625336 0.295635 (=149/(14*36)) 72.6554
	best keyword for cluster 6625336 is PF08014 with Jaccard = 1.0000	[ 30	0	1100181	0 ]	1.0000 1.0000
sibling [ 6625336 ] : 6691550 0.105839 (=58/(137*4)) 90.5826
	best keyword for cluster 6691550 is PF05013 with Jaccard = 1.0000	[ 117	0	1100094	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08014 ( PF08014   Domain of unknown function (DUF1704) )
	B> PF05013 ( PF05013   N-formylglutamate amidohydrolase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF08014 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 238 ) 6759786_PF00272_PF08107            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08107  is 6691232 with Jaccard = 1.0000	|PF08107|=22 [ 22	0	1100189	0 ]
 parent [ 6691232 ] : 6759786 0.0118881 (=17/(22*65)) 99.2522
  given [ 6691232 ] : 6691232 0.0952381 (=2/(1*21)) 90.5016
	best keyword for cluster 6691232 is PF08107 with Jaccard = 1.0000	[ 22	0	1100189	0 ]	1.0000 1.0000
sibling [ 6691232 ] : 6741762 0.0328947 (=15/(8*57)) 97.9496
	best keyword for cluster 6741762 is PF00272 with Jaccard = 1.0000	[ 51	0	1100160	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08107 ( PF08107   Pleurocidin family )
	B> PF00272 ( PF00272   Cecropin family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF08107 and PF00272 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 239 ) 6625572_PF00400_PF08149            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08149  is 6299611 with Jaccard = 1.0000	|PF08149|=47 [ 47	0	1100164	0 ]
 parent [ 6299611 ] : 6625572 0.326452 (=64457/(47*4201)) 72.7022
  given [ 6299611 ] : 6299611 1 (=46/(1*46)) 1.30436e-09
	best keyword for cluster 6299611 is PF08149 with Jaccard = 1.0000	[ 47	0	1100164	0 ]	1.0000 1.0000
sibling [ 6299611 ] : 6624157 0.31286 (=33961/(26*4175)) 72.0956
	best keyword for cluster 6624157 is PF00400 with Jaccard = 0.6951	[ 3976	38	1094491	1706 ]	0.9905 0.6998

	SUGGESTING RELATEDNESS OF:   
	A> PF08149 ( PF08149   BING4CT (NUC141) domain )
	B> PF00400 ( PF00400   WD domain, G-beta repeat )

	Only B has a clan ( CL0186.8 ).

	the two keywords coincide on Uniref90 proteins: |PF00400| = 5682 ,	 |PF08149| = 47 , 	|PF00400^PF08149| = 44	( 0.8% and 93.6% )

	only  PF08149 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 240 ) 6737546_PF00400_PF08159            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08159  is 6695144 with Jaccard = 1.0000	|PF08159|=50 [ 50	0	1100161	0 ]
 parent [ 6695144 ] : 6737546 0.0325438 (=20362/(99*6320)) 97.5386
  given [ 6695144 ] : 6695144 0.12018 (=293/(46*53)) 91.3417
	best keyword for cluster 6695144 is PF08159 with Jaccard = 1.0000	[ 50	0	1100161	0 ]	1.0000 1.0000
sibling [ 6695144 ] : 6736650 0.0304183 (=1536/(8*6312)) 97.4416
	best keyword for cluster 6736650 is PF00400 with Jaccard = 0.8806	[ 5141	156	1094373	541 ]	0.9705 0.9048

	SUGGESTING RELATEDNESS OF:   
	A> PF08159 ( PF08159   NUC153 domain )
	B> PF00400 ( PF00400   WD domain, G-beta repeat )

	Only B has a clan ( CL0186.8 ).

	the two keywords coincide on Uniref90 proteins: |PF00400| = 5682 ,	 |PF08159| = 50 , 	|PF00400^PF08159| = 5	( 0.1% and 10.0% )

	only  PF08159 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 241 ) 6774912_PF06006_PF08195            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08195  is 6770811 with Jaccard = 1.0000	|PF08195|=1 [ 1	0	1100210	0 ]
 parent [ 6770811 ] : 6774912 0.0015361 (=4/(42*62)) 99.8533
  given [ 6770811 ] : 6770811 0.0163934 (=1/(1*61)) 99.7377
	best keyword for cluster 6770811 is PF08195 with Jaccard = 1.0000	[ 1	0	1100210	0 ]	1.0000 1.0000
sibling [ 6770811 ] : 6768338 0.00470588 (=2/(25*17)) 99.6518
	best keyword for cluster 6768338 is PF06006 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08195 ( PF08195   TRI9 protein )
	B> PF06006 ( PF06006   Bacterial protein of unknown function (DUF905) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF08195 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 242 ) 6729629_PF01581_PF08257            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08257  is 6629938 with Jaccard = 1.0000	|PF08257|=7 [ 7	0	1100204	0 ]
 parent [ 6629938 ] : 6729629 0.0435374 (=32/(7*105)) 96.6833
  given [ 6629938 ] : 6629938 0.666667 (=4/(1*6)) 74.5
	best keyword for cluster 6629938 is PF08257 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000
sibling [ 6629938 ] : 6725506 0.0515789 (=49/(10*95)) 96.169
	best keyword for cluster 6725506 is PF01581 with Jaccard = 0.8971	[ 61	0	1100143	7 ]	1.0000 0.8971

	SUGGESTING RELATEDNESS OF:   
	A> PF08257 ( PF08257   Sulfakinin family )
	B> PF01581 ( PF01581   FMRFamide related peptide family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF08257 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 243 ) 6722846_PF05827_PF08319            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08319  is 6619152 with Jaccard = 1.0000	|PF08319|=18 [ 18	0	1100193	0 ]
 parent [ 6619152 ] : 6722846 0.0601173 (=41/(22*31)) 95.838
  given [ 6619152 ] : 6619152 0.321429 (=36/(8*14)) 70.021
	best keyword for cluster 6619152 is PF08319 with Jaccard = 1.0000	[ 18	0	1100193	0 ]	1.0000 1.0000
sibling [ 6619152 ] : 6705544 0.0769231 (=10/(26*5)) 93.2016
	best keyword for cluster 6705544 is PF05827 with Jaccard = 0.9600	[ 24	0	1100186	1 ]	1.0000 0.9600

	SUGGESTING RELATEDNESS OF:   
	A> PF08319 ( PF08319   ER protein BIG1 )
	B> PF05827 ( PF05827   Vacuolar ATP synthase subunit S1 (ATP6S1) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF08319 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 244 ) 6236339_PF07952_PF08470            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08470  is 5184278 with Jaccard = 1.0000	|PF08470|=10 [ 10	0	1100201	0 ]
 parent [ 5184278 ] : 6236339 1 (=140/(10*14)) 2.06062e-14
  given [ 5184278 ] : 5184278 1 (=9/(1*9)) 0
	best keyword for cluster 5184278 is PF08470 with Jaccard = 1.0000	[ 10	0	1100201	0 ]	1.0000 1.0000
sibling [ 5184278 ] : 6071757 1 (=13/(1*13)) 6.85435e-28
	best keyword for cluster 6071757 is PF07952 with Jaccard = 0.9286	[ 13	1	1100197	0 ]	0.9286 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08470 ( PF08470   Nontoxic nonhaemagglutinin C-terminal )
	B> PF07952 ( PF07952   Clostridium neurotoxin, Translocation domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF08470 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 245 ) 6760921_PF00420_PF06235            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00420  is 6743340 with Jaccard = 0.9989	|PF00420|=929 [ 928	0	1099282	1 ]
 parent [ 6743340 ] : 6760921 0.0100538 (=200/(19*1047)) 99.3142
  given [ 6743340 ] : 6743340 0.0240034 (=4204/(838*209)) 98.0836
	best keyword for cluster 6743340 is PF00420 with Jaccard = 0.9989	[ 928	0	1099282	1 ]	1.0000 0.9989
sibling [ 6743340 ] : 6718092 0.0512821 (=4/(13*6)) 95.1635
	best keyword for cluster 6718092 is PF06235 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00420 ( PF00420   NADH-ubiquinone/plastoquinone oxidoreductase chain 4L )
	B> PF06235 ( PF06235   NADH dehydrogenase subunit 4L (NAD4L) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF00420 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 246 ) 6767216_PF01545_PF05181            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01545  is 6690713 with Jaccard = 0.9977	|PF01545|=880 [ 878	0	1099331	2 ]
 parent [ 6690713 ] : 6767216 0.00416271 (=263/(972*65)) 99.6059
  given [ 6690713 ] : 6690713 0.113659 (=3212/(30*942)) 90.4195
	best keyword for cluster 6690713 is PF01545 with Jaccard = 0.9977	[ 878	0	1099331	2 ]	1.0000 0.9977
sibling [ 6690713 ] : 6746882 0.0188889 (=17/(20*45)) 98.3799
	best keyword for cluster 6746882 is PF05181 with Jaccard = 0.9677	[ 30	1	1100180	0 ]	0.9677 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01545 ( PF01545   Cation efflux family )
	B> PF05181 ( PF05181   XPA protein C-terminus )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01545 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05181 SSF46955 0.701 (average over 55 mutual instances, PF05181 57 appearances, SSF46955 11923 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 247 ) 6553025_PF01715_PF01745            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01715  is 6538777 with Jaccard = 0.9971	|PF01715|=339 [ 338	0	1099872	1 ]
 parent [ 6538777 ] : 6553025 0.653226 (=4617/(19*372)) 41.1118
  given [ 6538777 ] : 6538777 0.735849 (=273/(1*371)) 31.431
	best keyword for cluster 6538777 is PF01715 with Jaccard = 0.9971	[ 338	0	1099872	1 ]	1.0000 0.9971
sibling [ 6538777 ] : 6398972 1 (=18/(1*18)) 0.00723507
	best keyword for cluster 6398972 is PF01745 with Jaccard = 0.8571	[ 18	0	1100190	3 ]	1.0000 0.8571

	SUGGESTING RELATEDNESS OF:   
	A> PF01715 ( PF01715   IPP transferase )
	B> PF01745 ( PF01745   Isopentenyl transferase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01715| = 339 ,	 |PF01745| = 21 , 	|PF01715^PF01745| = 3	( 0.9% and 14.3% )

	Neither  PF01715 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 248 ) 6743131_PF00871_PF07318            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00871  is 6558767 with Jaccard = 0.9969	|PF00871|=321 [ 320	0	1099890	1 ]
 parent [ 6558767 ] : 6743131 0.0249478 (=191/(348*22)) 98.066
  given [ 6558767 ] : 6558767 0.611272 (=423/(2*346)) 45.8346
	best keyword for cluster 6558767 is PF00871 with Jaccard = 0.9969	[ 320	0	1099890	1 ]	1.0000 0.9969
sibling [ 6558767 ] : 6716438 0.0583333 (=7/(10*12)) 94.9506
	best keyword for cluster 6716438 is PF07318 with Jaccard = 0.9231	[ 12	1	1100198	0 ]	0.9231 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00871 ( PF00871   Acetokinase family )
	B> PF07318 ( PF07318   Protein of unknown function (DUF1464) )

	Only A has a clan ( CL0108.10 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF00871 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 249 ) 6685101_PF01268_PF02882            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01268  is 6574435 with Jaccard = 0.9968	|PF01268|=308 [ 307	0	1099903	1 ]
 parent [ 6574435 ] : 6685101 0.10742 (=12844/(318*376)) 89.2919
  given [ 6574435 ] : 6574435 0.47943 (=303/(2*316)) 52.0643
	best keyword for cluster 6574435 is PF01268 with Jaccard = 0.9968	[ 307	0	1099903	1 ]	1.0000 0.9968
sibling [ 6574435 ] : 6677784 0.165333 (=62/(1*375)) 87.6654
	best keyword for cluster 6677784 is PF02882 with Jaccard = 0.8992	[ 339	4	1099834	34 ]	0.9883 0.9088

	SUGGESTING RELATEDNESS OF:   
	A> PF01268 ( PF01268   Formate--tetrahydrofolate ligase )
	B> PF02882 ( PF02882   Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain )

	Only B has a clan ( CL0063.17 ).

	the two keywords coincide on Uniref90 proteins: |PF01268| = 308 ,	 |PF02882| = 373 , 	|PF01268^PF02882| = 35	( 11.4% and 9.4% )

	both PF01268 and PF02882 have PDB structures
	PF01268 	   c.37.1.10
	
	SUPERFAM mapping significantly overlapping:
     1	PF02882 SSF51735 0.956 (average over 1086 mutual instances, PF02882 1086 appearances, SSF51735 164772 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 250 ) 6662778_PF01795_PF06962            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01795  is 6617005 with Jaccard = 0.9965	|PF01795|=284 [ 284	1	1099926	0 ]
 parent [ 6617005 ] : 6662778 0.197811 (=3290/(54*308)) 83.8393
  given [ 6617005 ] : 6617005 0.330055 (=302/(305*3)) 69.2658
	best keyword for cluster 6617005 is PF01795 with Jaccard = 0.9965	[ 284	1	1099926	0 ]	0.9965 1.0000
sibling [ 6617005 ] : 6456009 0.981132 (=52/(1*53)) 1.8999
	best keyword for cluster 6456009 is PF06962 with Jaccard = 0.9800	[ 49	0	1100161	1 ]	1.0000 0.9800

	SUGGESTING RELATEDNESS OF:   
	A> PF01795 ( PF01795   MraW methylase family )
	B> PF06962 ( PF06962   Putative rRNA methylase )

	they come from the same clan: CL0102.14 : PF06962	PF00398	PF06325	PF03291	PF01135	PF01358	PF06460	PF01189	PF05401	PF01234	PF01555	PF02384	PF07942	PF05175	PF05063	PF07109	PF02475	PF07021	PF08003	PF05148	PF01795	PF02390	PF01596	PF00891	PF09445	PF08242	PF08241	PF05971	PF02086	PF02527	PF08704	PF01728	PF01269	PF07669	PF06080	PF05891	PF05430	PF04816	PF04672	PF04445	PF04378	PF01861	PF03269	PF03141	PF07757	PF07279	PF05219	PF08123	PF00145	PF03602	PF02353	PF01739	PF06859	PF09243	PF01564	PF03848	PF05724	PF02005	PF05958	PF01209	PF01170
	the two keywords do not coincide on UniRef90 proteins

	only  PF01795 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 251 ) 6746842_PF00015_PF02470            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02470  is 6736817 with Jaccard = 0.9964	|PF02470|=557 [ 555	0	1099654	2 ]
 parent [ 6736817 ] : 6746842 0.0224868 (=66499/(628*4709)) 98.375
  given [ 6736817 ] : 6736817 0.0317717 (=177/(619*9)) 97.4622
	best keyword for cluster 6736817 is PF02470 with Jaccard = 0.9964	[ 555	0	1099654	2 ]	1.0000 0.9964
sibling [ 6736817 ] : 6745796 0.0236725 (=35831/(347*4362)) 98.2964
	best keyword for cluster 6745796 is PF00015 with Jaccard = 0.8016	[ 2735	648	1096799	29 ]	0.8085 0.9895

	SUGGESTING RELATEDNESS OF:   
	A> PF02470 ( PF02470   mce related protein )
	B> PF00015 ( PF00015   Methyl-accepting chemotaxis protein (MCP) signaling domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02470 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 252 ) 6658992_PF00068_PF05826            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00068  is 6645295 with Jaccard = 0.9961	|PF00068|=254 [ 253	0	1099957	1 ]
 parent [ 6645295 ] : 6658992 0.24362 (=2482/(36*283)) 83.0149
  given [ 6645295 ] : 6645295 0.229537 (=129/(2*281)) 78.5045
	best keyword for cluster 6645295 is PF00068 with Jaccard = 0.9961	[ 253	0	1099957	1 ]	1.0000 0.9961
sibling [ 6645295 ] : 6616056 0.343434 (=34/(3*33)) 68.9341
	best keyword for cluster 6616056 is PF05826 with Jaccard = 1.0000	[ 32	0	1100179	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00068 ( PF00068   Phospholipase A2 )
	B> PF05826 ( PF05826   Phospholipase A2 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF00068 and PF05826 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF05826 SSF48619 0.746 (average over 62 mutual instances, PF05826 62 appearances, SSF48619 849 appearances)
     2	PF00068 SSF48619 0.990 (average over 726 mutual instances, PF00068 726 appearances, SSF48619 849 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 253 ) 6651265_PF00977_PF01884            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00977  is 6632472 with Jaccard = 0.9959	|PF00977|=486 [ 484	0	1099725	2 ]
 parent [ 6632472 ] : 6651265 0.231575 (=6322/(52*525)) 80.4663
  given [ 6632472 ] : 6632472 0.294455 (=308/(2*523)) 75.2482
	best keyword for cluster 6632472 is PF00977 with Jaccard = 0.9959	[ 484	0	1099725	2 ]	1.0000 0.9959
sibling [ 6632472 ] : 6538508 0.696429 (=468/(28*24)) 31.1492
	best keyword for cluster 6538508 is PF01884 with Jaccard = 0.9804	[ 50	0	1100160	1 ]	1.0000 0.9804

	SUGGESTING RELATEDNESS OF:   
	A> PF00977 ( PF00977   Histidine biosynthesis protein )
	B> PF01884 ( PF01884   PcrB family )

	they come from the same clan: CL0036.17 : PF05690	PF01680	PF00834	PF01729	PF00697	PF03740	PF01884	PF00724	PF00215	PF03060	PF04095	PF04131	PF00478	PF00218	PF00977	PF01645	PF04309	PF01070	PF01207	PF04481	PF04476	PF01180	PF00701	PF01791	PF03932	PF03437	PF01081	PF00121	PF09370	PF02581	PF00290
	the two keywords do not coincide on UniRef90 proteins

	both PF00977 and PF01884 have PDB structures
	PF00977 	   c.1.2.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00977 SSF51366 0.938 (average over 1629 mutual instances, PF00977 1632 appearances, SSF51366 8168 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 254 ) 6643704_PF00873_PF02355            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00873  is 6640682 with Jaccard = 0.9952	|PF00873|=1263 [ 1257	0	1098948	6 ]
 parent [ 6640682 ] : 6643704 0.250534 (=188158/(523*1436)) 78.0119
  given [ 6640682 ] : 6640682 0.230447 (=1320/(4*1432)) 77.2349
	best keyword for cluster 6640682 is PF00873 with Jaccard = 0.9952	[ 1257	0	1098948	6 ]	1.0000 0.9952
sibling [ 6640682 ] : 6518549 0.814485 (=55284/(239*284)) 19.5749
	best keyword for cluster 6518549 is PF02355 with Jaccard = 0.9553	[ 449	20	1099741	1 ]	0.9574 0.9978

	SUGGESTING RELATEDNESS OF:   
	A> PF00873 ( PF00873   AcrB/AcrD/AcrF family )
	B> PF02355 ( PF02355   Protein export membrane protein )

	Only A has a clan ( CL0182.8 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF00873 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 255 ) 6748546_PF00316_PF00459            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00316  is 6540891 with Jaccard = 0.9951	|PF00316|=206 [ 205	0	1100005	1 ]
 parent [ 6540891 ] : 6748546 0.0209252 (=4349/(223*932)) 98.5054
  given [ 6540891 ] : 6540891 0.690045 (=305/(2*221)) 33.0142
	best keyword for cluster 6540891 is PF00316 with Jaccard = 0.9951	[ 205	0	1100005	1 ]	1.0000 0.9951
sibling [ 6540891 ] : 6741039 0.0325564 (=3013/(113*819)) 97.88
	best keyword for cluster 6741039 is PF00459 with Jaccard = 0.8765	[ 752	102	1099353	4 ]	0.8806 0.9947

	SUGGESTING RELATEDNESS OF:   
	A> PF00316 ( PF00316   Fructose-1-6-bisphosphatase )
	B> PF00459 ( PF00459   Inositol monophosphatase family )

	they come from the same clan: CL0171.6 : PF00316	PF03320	PF00459
	the two keywords do not coincide on UniRef90 proteins

	both PF00316 and PF00459 have PDB structures
	PF00316 	   e.7.1.1
	PF00459 	   e.7.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 256 ) 6752572_PF02146_PF04502            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02146  is 6733556 with Jaccard = 0.9951	|PF02146|=409 [ 407	0	1099802	2 ]
 parent [ 6733556 ] : 6752572 0.0122138 (=463/(81*468)) 98.8082
  given [ 6733556 ] : 6733556 0.0299786 (=14/(1*467)) 97.1176
	best keyword for cluster 6733556 is PF02146 with Jaccard = 0.9951	[ 407	0	1099802	2 ]	1.0000 0.9951
sibling [ 6733556 ] : 6642691 0.240506 (=38/(2*79)) 77.7892
	best keyword for cluster 6642691 is PF04502 with Jaccard = 0.9863	[ 72	0	1100138	1 ]	1.0000 0.9863

	SUGGESTING RELATEDNESS OF:   
	A> PF02146 ( PF02146   Sir2 family )
	B> PF04502 ( PF04502   Family of unknown function (DUF572) )

	Only A has a clan ( CL0085.9 ).

	the two keywords coincide on Uniref90 proteins: |PF02146| = 409 ,	 |PF04502| = 73 , 	|PF02146^PF04502| = 1	( 0.2% and 1.4% )

	only  PF02146 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 257 ) 6769822_PF02416_PF07544            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02416  is 6755055 with Jaccard = 0.9951	|PF02416|=406 [ 404	0	1099805	2 ]
 parent [ 6755055 ] : 6769822 0.00409531 (=154/(68*553)) 99.7048
  given [ 6755055 ] : 6755055 0.0130956 (=199/(29*524)) 98.9733
	best keyword for cluster 6755055 is PF02416 with Jaccard = 0.9951	[ 404	0	1099805	2 ]	1.0000 0.9951
sibling [ 6755055 ] : 6751265 0.0138408 (=12/(51*17)) 98.7155
	best keyword for cluster 6751265 is PF07544 with Jaccard = 0.9130	[ 21	0	1100188	2 ]	1.0000 0.9130

	SUGGESTING RELATEDNESS OF:   
	A> PF02416 ( PF02416   mttA/Hcf106 family )
	B> PF07544 ( PF07544   RNA polymerase II transcription mediator )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02416 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 258 ) 6758546_PF02699_PF04085            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04085  is 6585954 with Jaccard = 0.9950	|PF04085|=199 [ 198	0	1100012	1 ]
 parent [ 6585954 ] : 6758546 0.0139672 (=832/(219*272)) 99.1852
  given [ 6585954 ] : 6585954 0.486239 (=106/(1*218)) 55.8544
	best keyword for cluster 6585954 is PF04085 with Jaccard = 0.9950	[ 198	0	1100012	1 ]	1.0000 0.9950
sibling [ 6585954 ] : 6737068 0.0417234 (=276/(245*27)) 97.4961
	best keyword for cluster 6737068 is PF02699 with Jaccard = 0.9087	[ 209	21	1099981	0 ]	0.9087 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04085 ( PF04085   rod shape-determining protein MreC )
	B> PF02699 ( PF02699   Preprotein translocase subunit )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04085 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 259 ) 6729668_PF01758_PF03977            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01758  is 6699795 with Jaccard = 0.9947	|PF01758|=377 [ 376	1	1099833	1 ]
 parent [ 6699795 ] : 6729668 0.0405675 (=1324/(69*473)) 96.6876
  given [ 6699795 ] : 6699795 0.0941692 (=730/(17*456)) 92.1405
	best keyword for cluster 6699795 is PF01758 with Jaccard = 0.9947	[ 376	1	1099833	1 ]	0.9973 0.9973
sibling [ 6699795 ] : 6216906 1 (=320/(5*64)) 6.56969e-16
	best keyword for cluster 6216906 is PF03977 with Jaccard = 0.9839	[ 61	0	1100149	1 ]	1.0000 0.9839

	SUGGESTING RELATEDNESS OF:   
	A> PF01758 ( PF01758   Sodium Bile acid symporter family )
	B> PF03977 ( PF03977   Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, beta subunit )

	they come from the same clan: CL0064.7 : PF06826	PF03547	PF03601	PF05684	PF05982	PF03616	PF06965	PF00999	PF03977	PF01758
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01758 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 260 ) 6741096_PF00177_PF05549            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00177  is 6606993 with Jaccard = 0.9946	|PF00177|=367 [ 365	0	1099844	2 ]
 parent [ 6606993 ] : 6741096 0.0297619 (=140/(392*12)) 97.8852
  given [ 6606993 ] : 6606993 0.353846 (=276/(2*390)) 65.1253
	best keyword for cluster 6606993 is PF00177 with Jaccard = 0.9946	[ 365	0	1099844	2 ]	1.0000 0.9946
sibling [ 6606993 ] : 6668816 0.2 (=4/(10*2)) 85.195
	best keyword for cluster 6668816 is PF05549 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00177 ( PF00177   Ribosomal protein S7p/S5e )
	B> PF05549 ( PF05549   Allexivirus 40kDa protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00177 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00177 SSF47973 0.955 (average over 1488 mutual instances, PF00177 1489 appearances, SSF47973 1493 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 261 ) 6754138_PF00881_PF02277            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02277  is 6696819 with Jaccard = 0.9945	|PF02277|=182 [ 181	0	1100029	1 ]
 parent [ 6696819 ] : 6754138 0.0116079 (=2488/(197*1088)) 98.9145
  given [ 6696819 ] : 6696819 0.0972222 (=665/(45*152)) 91.7
	best keyword for cluster 6696819 is PF02277 with Jaccard = 0.9945	[ 181	0	1100029	1 ]	1.0000 0.9945
sibling [ 6696819 ] : 6738255 0.0319249 (=2000/(61*1027)) 97.6168
	best keyword for cluster 6738255 is PF00881 with Jaccard = 0.9383	[ 821	44	1099336	10 ]	0.9491 0.9880

	SUGGESTING RELATEDNESS OF:   
	A> PF02277 ( PF02277   Phosphoribosyltransferase )
	B> PF00881 ( PF00881   Nitroreductase family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00881| = 831 ,	 |PF02277| = 182 , 	|PF00881^PF02277| = 4	( 0.5% and 2.2% )

	both PF02277 and PF00881 have PDB structures
	PF02277 	   c.39.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF02277 SSF52733 0.934 (average over 503 mutual instances, PF02277 510 appearances, SSF52733 515 appearances)
     2	PF00881 SSF55469 0.829 (average over 2724 mutual instances, PF00881 2740 appearances, SSF55469 3051 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 262 ) 6778875_PF03152_PF04203            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04203  is 6774760 with Jaccard = 0.9944	|PF04203|=178 [ 177	0	1100033	1 ]
 parent [ 6774760 ] : 6778875 0.000955779 (=30/(133*236)) 99.9354
  given [ 6774760 ] : 6774760 0.00224905 (=19/(44*192)) 99.8497
	best keyword for cluster 6774760 is PF04203 with Jaccard = 0.9944	[ 177	0	1100033	1 ]	1.0000 0.9944
sibling [ 6774760 ] : 6773392 0.00272603 (=12/(62*71)) 99.8155
	best keyword for cluster 6773392 is PF03152 with Jaccard = 0.6304	[ 58	24	1100119	10 ]	0.7073 0.8529

	SUGGESTING RELATEDNESS OF:   
	A> PF04203 ( PF04203   Sortase family )
	B> PF03152 ( PF03152   Ubiquitin fusion degradation protein UFD1 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF04203 and PF03152 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 263 ) 6751374_PF01150_PF02541            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01150  is 6711614 with Jaccard = 0.9942	|PF01150|=173 [ 172	0	1100038	1 ]
 parent [ 6711614 ] : 6751374 0.0188701 (=1168/(331*187)) 98.7241
  given [ 6711614 ] : 6711614 0.0626984 (=79/(180*7)) 94.1834
	best keyword for cluster 6711614 is PF01150 with Jaccard = 0.9942	[ 172	0	1100038	1 ]	1.0000 0.9942
sibling [ 6711614 ] : 6707944 0.074159 (=97/(327*4)) 93.6222
	best keyword for cluster 6707944 is PF02541 with Jaccard = 0.9933	[ 297	0	1099912	2 ]	1.0000 0.9933

	SUGGESTING RELATEDNESS OF:   
	A> PF01150 ( PF01150   GDA1/CD39 (nucleoside phosphatase) family )
	B> PF02541 ( PF02541   Ppx/GppA phosphatase family )

	they come from the same clan: CL0108.10 : PF06406	PF00480	PF02541	PF00814	PF06723	PF05378	PF01968	PF00012	PF03727	PF00349	PF02685	PF01150	PF02491	PF00370	PF02782	PF02543	PF01869	PF00022	PF00871	PF03702
	the two keywords do not coincide on UniRef90 proteins

	only  PF01150 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 264 ) 6733370_PF01936_PF04396            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01936  is 6714532 with Jaccard = 0.9939	|PF01936|=164 [ 164	1	1100046	0 ]
 parent [ 6714532 ] : 6733370 0.0374665 (=546/(247*59)) 97.0951
  given [ 6714532 ] : 6714532 0.0716487 (=186/(11*236)) 94.6519
	best keyword for cluster 6714532 is PF01936 with Jaccard = 0.9939	[ 164	1	1100046	0 ]	0.9939 1.0000
sibling [ 6714532 ] : 6719105 0.0471698 (=15/(6*53)) 95.2927
	best keyword for cluster 6719105 is PF04396 with Jaccard = 0.9762	[ 41	0	1100169	1 ]	1.0000 0.9762

	SUGGESTING RELATEDNESS OF:   
	A> PF01936 ( PF01936   Protein of unknown function DUF88 )
	B> PF04396 ( PF04396   Protein of unknown function, DUF537 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01936 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 265 ) 6705925_PF00950_PF01032            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01032  is 6676879 with Jaccard = 0.9938	|PF01032|=809 [ 804	0	1099402	5 ]
 parent [ 6676879 ] : 6705925 0.0807105 (=32731/(874*464)) 93.2869
  given [ 6676879 ] : 6676879 0.15729 (=411/(3*871)) 87.4365
	best keyword for cluster 6676879 is PF01032 with Jaccard = 0.9938	[ 804	0	1099402	5 ]	1.0000 0.9938
sibling [ 6676879 ] : 6697540 0.0822242 (=834/(441*23)) 91.8064
	best keyword for cluster 6697540 is PF00950 with Jaccard = 0.9878	[ 404	4	1099802	1 ]	0.9902 0.9975

	SUGGESTING RELATEDNESS OF:   
	A> PF01032 ( PF01032   FecCD transport family )
	B> PF00950 ( PF00950   ABC 3 transport family )

	they come from the same clan: CL0142.6 : PF00950	PF05145	PF02653	PF01032	PF01098
	the two keywords do not coincide on UniRef90 proteins

	only  PF01032 has a PDB structure (may not be up to date)
	PF01032 	   f.22.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 266 ) 6763029_PF01925_PF07290            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01925  is 6700986 with Jaccard = 0.9932	|PF01925|=877 [ 871	0	1099334	6 ]
 parent [ 6700986 ] : 6763029 0.00696362 (=343/(1048*47)) 99.4184
  given [ 6700986 ] : 6700986 0.0958065 (=17626/(223*825)) 92.3528
	best keyword for cluster 6700986 is PF01925 with Jaccard = 0.9932	[ 871	0	1099334	6 ]	1.0000 0.9932
sibling [ 6700986 ] : 6751882 0.0163043 (=9/(23*24)) 98.7609
	best keyword for cluster 6751882 is PF07290 with Jaccard = 0.9286	[ 13	1	1100197	0 ]	0.9286 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01925 ( PF01925   Domain of unknown function DUF81 )
	B> PF07290 ( PF07290   Protein of unknown function (DUF1449) )

	Only B has a clan ( CL0252.2 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01925 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 267 ) 6524525_PF00006_PF07497            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07497  is 6391976 with Jaccard = 0.9932	|PF07497|=145 [ 145	1	1100065	0 ]
 parent [ 6391976 ] : 6524525 0.796193 (=152904/(164*1171)) 22.7046
  given [ 6391976 ] : 6391976 1 (=163/(1*163)) 0.00281036
	best keyword for cluster 6391976 is PF07497 with Jaccard = 0.9932	[ 145	1	1100065	0 ]	0.9932 1.0000
sibling [ 6391976 ] : 6500669 0.896913 (=5229/(5*1166)) 11.9606
	best keyword for cluster 6500669 is PF00006 with Jaccard = 0.8694	[ 1092	4	1098955	160 ]	0.9964 0.8722

	SUGGESTING RELATEDNESS OF:   
	A> PF07497 ( PF07497   Rho termination factor, RNA-binding domain )
	B> PF00006 ( PF00006   ATP synthase alpha/beta family, nucleotide-binding domain )

	Only A has a clan ( CL0021.12 ).

	the two keywords coincide on Uniref90 proteins: |PF00006| = 1252 ,	 |PF07497| = 145 , 	|PF00006^PF07497| = 144	( 11.5% and 99.3% )

	both PF07497 and PF00006 have PDB structures
		PF00006 	   b.86.1.2  c.37.1.11

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 268 ) 6737226_PF04610_PF07863            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04610  is 6726881 with Jaccard = 0.9931	|PF04610|=144 [ 143	0	1100067	1 ]
 parent [ 6726881 ] : 6737226 0.0323643 (=167/(24*215)) 97.5038
  given [ 6726881 ] : 6726881 0.0467715 (=452/(64*151)) 96.3417
	best keyword for cluster 6726881 is PF04610 with Jaccard = 0.9931	[ 143	0	1100067	1 ]	1.0000 0.9931
sibling [ 6726881 ] : 6665083 0.181818 (=8/(2*22)) 84.2675
	best keyword for cluster 6665083 is PF07863 with Jaccard = 1.0000	[ 15	0	1100196	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04610 ( PF04610   TrbL/VirB6 plasmid conjugal transfer protein )
	B> PF07863 ( PF07863   Homologues of TraJ from Bacteroides conjugative transposon )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04610 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 269 ) 6705892_PF02091_PF02092            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02091  is 6439843 with Jaccard = 0.9928	|PF02091|=138 [ 137	0	1100073	1 ]
 parent [ 6439843 ] : 6705892 0.0673119 (=2177/(157*206)) 93.279
  given [ 6439843 ] : 6439843 0.993506 (=459/(154*3)) 0.649351
	best keyword for cluster 6439843 is PF02091 with Jaccard = 0.9928	[ 137	0	1100073	1 ]	1.0000 0.9928
sibling [ 6439843 ] : 6672209 0.165854 (=34/(1*205)) 86.0968
	best keyword for cluster 6672209 is PF02092 with Jaccard = 0.9381	[ 182	2	1100017	10 ]	0.9891 0.9479

	SUGGESTING RELATEDNESS OF:   
	A> PF02091 ( PF02091   Glycyl-tRNA synthetase alpha subunit )
	B> PF02092 ( PF02092   Glycyl-tRNA synthetase beta subunit )

	Only A has a clan ( CL0040.10 ).

	the two keywords coincide on Uniref90 proteins: |PF02091| = 138 ,	 |PF02092| = 192 , 	|PF02091^PF02092| = 11	( 8.0% and 5.7% )

	only  PF02091 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 270 ) 6716140_PF00768_PF02113            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02113  is 6648149 with Jaccard = 0.9926	|PF02113|=136 [ 135	0	1100075	1 ]
 parent [ 6648149 ] : 6716140 0.0725591 (=5029/(151*459)) 94.907
  given [ 6648149 ] : 6648149 0.22973 (=102/(148*3)) 79.4139
	best keyword for cluster 6648149 is PF02113 with Jaccard = 0.9926	[ 135	0	1100075	1 ]	1.0000 0.9926
sibling [ 6648149 ] : 6675816 0.14442 (=132/(2*457)) 87.111
	best keyword for cluster 6675816 is PF00768 with Jaccard = 0.9881	[ 416	2	1099790	3 ]	0.9952 0.9928

	SUGGESTING RELATEDNESS OF:   
	A> PF02113 ( PF02113   D-Ala-D-Ala carboxypeptidase 3 (S13) family )
	B> PF00768 ( PF00768   D-alanyl-D-alanine carboxypeptidase )

	they come from the same clan: CL0013.12 : PF02113	PF00768	PF04960	PF00144	PF00905

	the two keywords coincide on Uniref90 proteins: |PF00768| = 419 ,	 |PF02113| = 136 , 	|PF00768^PF02113| = 1	( 0.2% and 0.7% )

	both PF02113 and PF00768 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF02113 SSF56601 0.850 (average over 447 mutual instances, PF02113 449 appearances, SSF56601 18812 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 271 ) 6723068_PF02485_PF03267            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02485  is 6601232 with Jaccard = 0.9924	|PF02485|=132 [ 131	0	1100079	1 ]
 parent [ 6601232 ] : 6723068 0.0507246 (=364/(52*138)) 95.8668
  given [ 6601232 ] : 6601232 0.402523 (=989/(21*117)) 62.3826
	best keyword for cluster 6601232 is PF02485 with Jaccard = 0.9924	[ 131	0	1100079	1 ]	1.0000 0.9924
sibling [ 6601232 ] : 6587498 0.490196 (=25/(1*51)) 56.3828
	best keyword for cluster 6587498 is PF03267 with Jaccard = 1.0000	[ 48	0	1100163	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02485 ( PF02485   Core-2/I-Branching enzyme )
	B> PF03267 ( PF03267   Domain of unknown function, DUF266 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02485 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 272 ) 6755148_PF03788_PF04172            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04172  is 6595757 with Jaccard = 0.9923	|PF04172|=129 [ 129	1	1100081	0 ]
 parent [ 6595757 ] : 6755148 0.0120537 (=262/(152*143)) 98.979
  given [ 6595757 ] : 6595757 0.420582 (=188/(149*3)) 59.9851
	best keyword for cluster 6595757 is PF04172 with Jaccard = 0.9923	[ 129	1	1100081	0 ]	0.9923 1.0000
sibling [ 6595757 ] : 6630440 0.282609 (=195/(138*5)) 74.829
	best keyword for cluster 6630440 is PF03788 with Jaccard = 0.9764	[ 124	0	1100084	3 ]	1.0000 0.9764

	SUGGESTING RELATEDNESS OF:   
	A> PF04172 ( PF04172   LrgB-like family )
	B> PF03788 ( PF03788   LrgA family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03788| = 127 ,	 |PF04172| = 129 , 	|PF03788^PF04172| = 1	( 0.8% and 0.8% )

	Neither  PF04172 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 273 ) 6767657_PF00278_PF01168            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01168  is 6746182 with Jaccard = 0.9919	|PF01168|=613 [ 611	3	1099595	2 ]
 parent [ 6746182 ] : 6767657 0.00528002 (=3167/(781*768)) 99.625
  given [ 6746182 ] : 6746182 0.0207981 (=3023/(306*475)) 98.3253
	best keyword for cluster 6746182 is PF01168 with Jaccard = 0.9919	[ 611	3	1099595	2 ]	0.9951 0.9967
sibling [ 6746182 ] : 6766706 0.00502084 (=53/(14*754)) 99.5864
	best keyword for cluster 6766706 is PF00278 with Jaccard = 0.9248	[ 615	46	1099546	4 ]	0.9304 0.9935

	SUGGESTING RELATEDNESS OF:   
	A> PF01168 ( PF01168   Alanine racemase, N-terminal domain )
	B> PF00278 ( PF00278   Pyridoxal-dependent decarboxylase, C-terminal sheet domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01168 and PF00278 have PDB structures
	PF01168 	   c.1.6.1  c.1.6.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF00278 SSF50621 0.705 (average over 1601 mutual instances, PF00278 1639 appearances, SSF50621 5076 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 274 ) 6763955_PF01292_PF04264            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01292  is 6758683 with Jaccard = 0.9918	|PF01292|=365 [ 362	0	1099846	3 ]
 parent [ 6758683 ] : 6763955 0.00549582 (=920/(270*620)) 99.4633
  given [ 6758683 ] : 6758683 0.00876913 (=165/(32*588)) 99.1933
	best keyword for cluster 6758683 is PF01292 with Jaccard = 0.9918	[ 362	0	1099846	3 ]	1.0000 0.9918
sibling [ 6758683 ] : 6712397 0.0656566 (=104/(6*264)) 94.3155
	best keyword for cluster 6712397 is PF04264 with Jaccard = 0.9674	[ 208	2	1099996	5 ]	0.9905 0.9765

	SUGGESTING RELATEDNESS OF:   
	A> PF01292 ( PF01292   Cytochrome b561 family )
	B> PF04264 ( PF04264   YceI-like domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01292| = 365 ,	 |PF04264| = 213 , 	|PF01292^PF04264| = 4	( 1.1% and 1.9% )

	only  PF01292 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF04264 SSF101874 0.935 (average over 775 mutual instances, PF04264 819 appearances, SSF101874 813 appearances)
     2	PF01292 SSF81342 0.948 (average over 1156 mutual instances, PF01292 1185 appearances, SSF81342 82802 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 275 ) 6728285_PF02127_PF05343            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02127  is 6527533 with Jaccard = 0.9917	|PF02127|=120 [ 119	0	1100091	1 ]
 parent [ 6527533 ] : 6728285 0.0466621 (=1082/(124*187)) 96.5157
  given [ 6527533 ] : 6527533 0.764228 (=94/(1*123)) 24.7605
	best keyword for cluster 6527533 is PF02127 with Jaccard = 0.9917	[ 119	0	1100091	1 ]	1.0000 0.9917
sibling [ 6527533 ] : 6712483 0.0628415 (=46/(4*183)) 94.332
	best keyword for cluster 6712483 is PF05343 with Jaccard = 0.9607	[ 171	0	1100033	7 ]	1.0000 0.9607

	SUGGESTING RELATEDNESS OF:   
	A> PF02127 ( PF02127   Aminopeptidase I zinc metalloprotease (M18) )
	B> PF05343 ( PF05343   M42 glutamyl aminopeptidase )

	they come from the same clan: CL0035.11 : PF05343	PF04389	PF01546	PF02127	PF00883	PF00246	PF05450	PF04952
	the two keywords do not coincide on UniRef90 proteins

	both PF02127 and PF05343 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 276 ) 6678820_PF03710_PF08335            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03710  is 6261092 with Jaccard = 0.9917	|PF03710|=121 [ 120	0	1100090	1 ]
 parent [ 6261092 ] : 6678820 0.15667 (=3448/(131*168)) 87.8852
  given [ 6261092 ] : 6261092 1 (=258/(129*2)) 1.9382e-12
	best keyword for cluster 6261092 is PF03710 with Jaccard = 0.9917	[ 120	0	1100090	1 ]	1.0000 0.9917
sibling [ 6261092 ] : 6665383 0.183735 (=61/(2*166)) 84.3405
	best keyword for cluster 6665383 is PF08335 with Jaccard = 0.7584	[ 113	35	1100062	1 ]	0.7635 0.9912

	SUGGESTING RELATEDNESS OF:   
	A> PF03710 ( PF03710   Glutamate-ammonia ligase adenylyltransferase )
	B> PF08335 ( PF08335   GlnD PII-uridylyltransferase )

	Only A has a clan ( CL0260.2 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF03710 has a PDB structure (may not be up to date)
	PF03710 	   d.218.1.9
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 277 ) 6745083_PF03901_PF04921            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03901  is 6607080 with Jaccard = 0.9917	|PF03901|=120 [ 119	0	1100091	1 ]
 parent [ 6607080 ] : 6745083 0.0183206 (=84/(131*35)) 98.2412
  given [ 6607080 ] : 6607080 0.387906 (=789/(18*113)) 65.2303
	best keyword for cluster 6607080 is PF03901 with Jaccard = 0.9917	[ 119	0	1100091	1 ]	1.0000 0.9917
sibling [ 6607080 ] : 6731220 0.0402299 (=7/(29*6)) 96.8569
	best keyword for cluster 6731220 is PF04921 with Jaccard = 0.9333	[ 28	2	1100181	0 ]	0.9333 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03901 ( PF03901   Alg9-like mannosyltransferase family )
	B> PF04921 ( PF04921   XAP5 protein )

	Only A has a clan ( CL0111.6 ).

	the two keywords coincide on Uniref90 proteins: |PF03901| = 120 ,	 |PF04921| = 28 , 	|PF03901^PF04921| = 1	( 0.8% and 3.6% )

	Neither  PF03901 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 278 ) 6755711_PF01323_PF06965            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06965  is 6502272 with Jaccard = 0.9916	|PF06965|=119 [ 118	0	1100092	1 ]
 parent [ 6502272 ] : 6755711 0.0105609 (=1406/(133*1001)) 99.0137
  given [ 6502272 ] : 6502272 0.900763 (=236/(2*131)) 12.1763
	best keyword for cluster 6502272 is PF06965 with Jaccard = 0.9916	[ 118	0	1100092	1 ]	1.0000 0.9916
sibling [ 6502272 ] : 6743735 0.0261836 (=438/(984*17)) 98.1195
	best keyword for cluster 6743735 is PF01323 with Jaccard = 0.9719	[ 518	3	1099678	12 ]	0.9942 0.9774

	SUGGESTING RELATEDNESS OF:   
	A> PF06965 ( PF06965   Na+/H+ antiporter 1 )
	B> PF01323 ( PF01323   DSBA-like thioredoxin domain )

	A and B come from a different clan ( CL0064.7 , CL0172.11 ).

	the two keywords coincide on Uniref90 proteins: |PF01323| = 530 ,	 |PF06965| = 119 , 	|PF01323^PF06965| = 4	( 0.8% and 3.4% )

	both PF06965 and PF01323 have PDB structures
		PF01323 	   c.47.1.13

	SUPERFAM mapping significantly overlapping:
     1	PF01323 SSF52833 0.887 (average over 1786 mutual instances, PF01323 1794 appearances, SSF52833 34965 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 279 ) 6745017_PF02321_PF07405            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02321  is 6742278 with Jaccard = 0.9915	|PF02321|=1170 [ 1164	4	1099037	6 ]
 parent [ 6742278 ] : 6745017 0.019257 (=339/(12*1467)) 98.2359
  given [ 6742278 ] : 6742278 0.0242461 (=283/(8*1459)) 97.9984
	best keyword for cluster 6742278 is PF02321 with Jaccard = 0.9915	[ 1164	4	1099037	6 ]	0.9966 0.9949
sibling [ 6742278 ] : 6723188 0.0571429 (=2/(7*5)) 95.8857
	best keyword for cluster 6723188 is PF07405 with Jaccard = 0.7500	[ 3	1	1100207	0 ]	0.7500 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02321 ( PF02321   Outer membrane efflux protein )
	B> PF07405 ( PF07405   Protein of unknown function (DUF1506) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02321 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 280 ) 6629352_PF01053_PF06838            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01053  is 6621731 with Jaccard = 0.9914	|PF01053|=815 [ 809	1	1099395	6 ]
 parent [ 6621731 ] : 6629352 0.29154 (=15721/(61*884)) 74.3192
  given [ 6621731 ] : 6621731 0.436014 (=385/(1*883)) 71.2134
	best keyword for cluster 6621731 is PF01053 with Jaccard = 0.9914	[ 809	1	1099395	6 ]	0.9988 0.9926
sibling [ 6621731 ] : 6560120 0.551515 (=182/(6*55)) 46.9849
	best keyword for cluster 6560120 is PF06838 with Jaccard = 0.9455	[ 52	3	1100156	0 ]	0.9455 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01053 ( PF01053   Cys/Met metabolism PLP-dependent enzyme )
	B> PF06838 ( PF06838   Aluminium resistance protein )

	they come from the same clan: CL0061.8 : PF05889	PF00464	PF03841	PF00282	PF01276	PF02347	PF01041	PF01053	PF01212	PF00266	PF00202	PF00155	PF06838	PF04864
	the two keywords do not coincide on UniRef90 proteins

	only  PF01053 has a PDB structure (may not be up to date)
	PF01053 	   c.67.1.3
	
	SUPERFAM mapping significantly overlapping:
     1	PF06838 SSF53383 0.898 (average over 166 mutual instances, PF06838 167 appearances, SSF53383 34644 appearances)
     2	PF01053 SSF53383 0.965 (average over 2570 mutual instances, PF01053 2583 appearances, SSF53383 34644 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 281 ) 6768387_PF02221_PF06011            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02221  is 6758279 with Jaccard = 0.9912	|PF02221|=113 [ 112	0	1100098	1 ]
 parent [ 6758279 ] : 6768387 0.00479167 (=69/(160*90)) 99.6532
  given [ 6758279 ] : 6758279 0.0111989 (=34/(138*22)) 99.1694
	best keyword for cluster 6758279 is PF02221 with Jaccard = 0.9912	[ 112	0	1100098	1 ]	1.0000 0.9912
sibling [ 6758279 ] : 6767238 0.011236 (=1/(1*89)) 99.6067
	best keyword for cluster 6767238 is PF06011 with Jaccard = 0.9508	[ 58	3	1100150	0 ]	0.9508 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02221 ( PF02221   ML domain )
	B> PF06011 ( PF06011   Transient receptor potential (TRP) ion channel )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02221 has a PDB structure (may not be up to date)
	PF02221 	   b.1.18.7
	
	SUPERFAM mapping significantly overlapping:
     1	PF02221 SSF81296 0.978 (average over 173 mutual instances, PF02221 173 appearances, SSF81296 30857 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 282 ) 6758016_PF01026_PF02126            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01026  is 6741382 with Jaccard = 0.9907	|PF01026|=536 [ 532	1	1099674	4 ]
 parent [ 6741382 ] : 6758016 0.0114149 (=314/(46*598)) 99.1546
  given [ 6741382 ] : 6741382 0.022766 (=147/(587*11)) 97.9137
	best keyword for cluster 6741382 is PF01026 with Jaccard = 0.9907	[ 532	1	1099674	4 ]	0.9981 0.9925
sibling [ 6741382 ] : 6482848 0.94375 (=453/(30*16)) 6.35177
	best keyword for cluster 6482848 is PF02126 with Jaccard = 0.9333	[ 42	0	1100166	3 ]	1.0000 0.9333

	SUGGESTING RELATEDNESS OF:   
	A> PF01026 ( PF01026   TatD related DNase )
	B> PF02126 ( PF02126   Phosphotriesterase family )

	they come from the same clan: CL0034.9 : PF01979	PF04909	PF07969	PF00962	PF01244	PF02811	PF02126	PF01026
	the two keywords do not coincide on UniRef90 proteins

	both PF01026 and PF02126 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 283 ) 6769239_PF02329_PF03637            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03637  is 6549674 with Jaccard = 0.9906	|PF03637|=106 [ 105	0	1100105	1 ]
 parent [ 6549674 ] : 6769239 0.00431732 (=16/(109*34)) 99.6847
  given [ 6549674 ] : 6549674 0.643468 (=1054/(18*91)) 38.6649
	best keyword for cluster 6549674 is PF03637 with Jaccard = 0.9906	[ 105	0	1100105	1 ]	1.0000 0.9906
sibling [ 6549674 ] : 6747293 0.0166667 (=4/(24*10)) 98.4108
	best keyword for cluster 6747293 is PF02329 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03637 ( PF03637   Mob1/phocein family )
	B> PF02329 ( PF02329   Histidine carboxylase PI chain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF03637 and PF02329 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF03637 SSF101152 0.918 (average over 261 mutual instances, PF03637 261 appearances, SSF101152 271 appearances)
     2	PF02329 SSF56271 0.959 (average over 22 mutual instances, PF02329 22 appearances, SSF56271 104 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 284 ) 6614036_PF04168_PF04169            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04169  is 6514081 with Jaccard = 0.9905	|PF04169|=104 [ 104	1	1100106	0 ]
 parent [ 6514081 ] : 6614036 0.322947 (=2674/(115*72)) 68.0623
  given [ 6514081 ] : 6514081 0.836283 (=189/(2*113)) 17.0344
	best keyword for cluster 6514081 is PF04169 with Jaccard = 0.9905	[ 104	1	1100106	0 ]	0.9905 1.0000
sibling [ 6514081 ] : 6554825 0.742857 (=104/(2*70)) 42.674
	best keyword for cluster 6554825 is PF04168 with Jaccard = 0.6633	[ 65	0	1100113	33 ]	1.0000 0.6633

	SUGGESTING RELATEDNESS OF:   
	A> PF04169 ( PF04169   Domain of unknown function (DUF404) )
	B> PF04168 ( PF04168   Bacterial domain of unknown function (DUF403) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04168| = 98 ,	 |PF04169| = 104 , 	|PF04168^PF04169| = 33	( 33.7% and 31.7% )

	Neither  PF04169 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 285 ) 6745410_PF00416_PF06831            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06831  is 6553051 with Jaccard = 0.9904	|PF06831|=309 [ 309	3	1099899	0 ]
 parent [ 6553051 ] : 6745410 0.0267345 (=3320/(344*361)) 98.2648
  given [ 6553051 ] : 6553051 0.647455 (=2786/(13*331)) 41.1455
	best keyword for cluster 6553051 is PF06831 with Jaccard = 0.9904	[ 309	3	1099899	0 ]	0.9904 1.0000
sibling [ 6553051 ] : 6737622 0.0527778 (=19/(1*360)) 97.5483
	best keyword for cluster 6737622 is PF00416 with Jaccard = 0.9969	[ 320	0	1099890	1 ]	1.0000 0.9969

	SUGGESTING RELATEDNESS OF:   
	A> PF06831 ( PF06831   Formamidopyrimidine-DNA glycosylase H2TH domain )
	B> PF00416 ( PF00416   Ribosomal protein S13/S18 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF06831 and PF00416 have PDB structures
	PF06831 	   a.156.1.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF00416 SSF46946 0.898 (average over 1258 mutual instances, PF00416 1260 appearances, SSF46946 3615 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 286 ) 6759015_PF01730_PF02814            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01730  is 6562784 with Jaccard = 0.9903	|PF01730|=103 [ 102	0	1100108	1 ]
 parent [ 6562784 ] : 6759015 0.00816143 (=127/(133*117)) 99.2108
  given [ 6562784 ] : 6562784 0.522901 (=137/(2*131)) 49.214
	best keyword for cluster 6562784 is PF01730 with Jaccard = 0.9903	[ 102	0	1100108	1 ]	1.0000 0.9903
sibling [ 6562784 ] : 6752344 0.0194175 (=28/(103*14)) 98.7923
	best keyword for cluster 6752344 is PF02814 with Jaccard = 0.9451	[ 86	5	1100120	0 ]	0.9451 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01730 ( PF01730   UreF )
	B> PF02814 ( PF02814   UreE urease accessory protein, N-terminal domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01730| = 103 ,	 |PF02814| = 86 , 	|PF01730^PF02814| = 1	( 1.0% and 1.2% )

	only  PF01730 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 287 ) 6650867_PF01222_PF06966            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01222  is 6554368 with Jaccard = 0.9902	|PF01222|=102 [ 101	0	1100109	1 ]
 parent [ 6554368 ] : 6650867 0.250959 (=2355/(102*92)) 80.2748
  given [ 6554368 ] : 6554368 0.643564 (=65/(1*101)) 42.2253
	best keyword for cluster 6554368 is PF01222 with Jaccard = 0.9902	[ 101	0	1100109	1 ]	1.0000 0.9902
sibling [ 6554368 ] : 6598067 0.450893 (=303/(8*84)) 60.6577
	best keyword for cluster 6598067 is PF06966 with Jaccard = 0.9419	[ 81	2	1100125	3 ]	0.9759 0.9643

	SUGGESTING RELATEDNESS OF:   
	A> PF01222 ( PF01222   Ergosterol biosynthesis ERG4/ERG24 family )
	B> PF06966 ( PF06966   Protein of unknown function (DUF1295) )

	they come from the same clan: CL0115.7 : PF04191	PF04140	PF01222	PF06966	PF02544
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01222 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 288 ) 6770153_PF03006_PF04080            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03006  is 6768942 with Jaccard = 0.9902	|PF03006|=305 [ 302	0	1099906	3 ]
 parent [ 6768942 ] : 6770153 0.00475135 (=135/(77*369)) 99.7158
  given [ 6768942 ] : 6768942 0.00340582 (=26/(22*347)) 99.6746
	best keyword for cluster 6768942 is PF03006 with Jaccard = 0.9902	[ 302	0	1099906	3 ]	1.0000 0.9902
sibling [ 6768942 ] : 6760055 0.0107962 (=16/(38*39)) 99.2667
	best keyword for cluster 6760055 is PF04080 with Jaccard = 1.0000	[ 34	0	1100177	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03006 ( PF03006   Haemolysin-III related )
	B> PF04080 ( PF04080   Per1-like )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03006 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 289 ) 6750217_PF01187_PF01361            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01361  is 6738423 with Jaccard = 0.9897	|PF01361|=194 [ 192	0	1100017	2 ]
 parent [ 6738423 ] : 6750217 0.0180963 (=377/(83*251)) 98.6341
  given [ 6738423 ] : 6738423 0.0308642 (=60/(8*243)) 97.6298
	best keyword for cluster 6738423 is PF01361 with Jaccard = 0.9897	[ 192	0	1100017	2 ]	1.0000 0.9897
sibling [ 6738423 ] : 6652507 0.195833 (=47/(3*80)) 80.8467
	best keyword for cluster 6652507 is PF01187 with Jaccard = 0.9714	[ 68	1	1100141	1 ]	0.9855 0.9855

	SUGGESTING RELATEDNESS OF:   
	A> PF01361 ( PF01361   Tautomerase enzyme )
	B> PF01187 ( PF01187   Macrophage migration inhibitory factor (MIF) )

	they come from the same clan: CL0082.7 : PF02962	PF01187	PF01361
	the two keywords do not coincide on UniRef90 proteins

	both PF01361 and PF01187 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 290 ) 6745290_PF00049_PF03488            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00049  is 6730824 with Jaccard = 0.9890	|PF00049|=182 [ 180	0	1100029	2 ]
 parent [ 6730824 ] : 6745290 0.0286305 (=291/(44*231)) 98.2546
  given [ 6730824 ] : 6730824 0.0505165 (=489/(55*176)) 96.8109
	best keyword for cluster 6730824 is PF00049 with Jaccard = 0.9890	[ 180	0	1100029	2 ]	1.0000 0.9890
sibling [ 6730824 ] : 6713740 0.0810811 (=21/(7*37)) 94.5119
	best keyword for cluster 6713740 is PF03488 with Jaccard = 1.0000	[ 25	0	1100186	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00049 ( PF00049   Insulin/IGF/Relaxin family )
	B> PF03488 ( PF03488   Nematode insulin-related peptide beta type )

	they come from the same clan: CL0239.3 : PF00049	PF03488
	the two keywords do not coincide on UniRef90 proteins

	only  PF00049 has a PDB structure (may not be up to date)
	PF00049 	   g.1.1.1  j.75.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00049 SSF56994 0.776 (average over 578 mutual instances, PF00049 578 appearances, SSF56994 698 appearances)
     2	PF03488 SSF56994 0.852 (average over 26 mutual instances, PF03488 26 appearances, SSF56994 698 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 291 ) 6607664_PF02538_PF05378            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02538  is 6537248 with Jaccard = 0.9887	|PF02538|=177 [ 175	0	1100034	2 ]
 parent [ 6537248 ] : 6607664 0.346224 (=13402/(207*187)) 65.5373
  given [ 6537248 ] : 6537248 0.727027 (=269/(2*185)) 30.299
	best keyword for cluster 6537248 is PF02538 with Jaccard = 0.9887	[ 175	0	1100034	2 ]	1.0000 0.9887
sibling [ 6537248 ] : 6474067 0.958146 (=9821/(125*82)) 4.4726
	best keyword for cluster 6474067 is PF05378 with Jaccard = 0.7462	[ 194	3	1099951	63 ]	0.9848 0.7549

	SUGGESTING RELATEDNESS OF:   
	A> PF02538 ( PF02538   Hydantoinase B/oxoprolinase )
	B> PF05378 ( PF05378   Hydantoinase/oxoprolinase N-terminal region )

	Only B has a clan ( CL0108.10 ).

	the two keywords coincide on Uniref90 proteins: |PF02538| = 177 ,	 |PF05378| = 257 , 	|PF02538^PF05378| = 60	( 33.9% and 23.3% )

	Neither  PF02538 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05378 SSF53383 0.793 (average over 1 mutual instances, PF05378 1 appearances, SSF53383 34644 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 292 ) 6755212_PF00891_PF02545            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02545  is 6674688 with Jaccard = 0.9887	|PF02545|=353 [ 349	0	1099858	4 ]
 parent [ 6674688 ] : 6755212 0.0103212 (=2268/(391*562)) 98.9836
  given [ 6674688 ] : 6674688 0.166667 (=194/(388*3)) 86.8243
	best keyword for cluster 6674688 is PF02545 with Jaccard = 0.9887	[ 349	0	1099858	4 ]	1.0000 0.9887
sibling [ 6674688 ] : 6751447 0.0172977 (=667/(80*482)) 98.7291
	best keyword for cluster 6751447 is PF00891 with Jaccard = 0.6996	[ 368	146	1099685	12 ]	0.7160 0.9684

	SUGGESTING RELATEDNESS OF:   
	A> PF02545 ( PF02545   Maf-like protein )
	B> PF00891 ( PF00891   O-methyltransferase )

	A and B come from a different clan ( CL0269.2 , CL0102.14 ).

	the two keywords coincide on Uniref90 proteins: |PF00891| = 380 ,	 |PF02545| = 353 , 	|PF00891^PF02545| = 7	( 1.8% and 2.0% )

	both PF02545 and PF00891 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 293 ) 6691574_PF02317_PF07479            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07479  is 6652458 with Jaccard = 0.9884	|PF07479|=344 [ 342	2	1099865	2 ]
 parent [ 6652458 ] : 6691574 0.114798 (=2591/(370*61)) 90.5913
  given [ 6652458 ] : 6652458 0.203252 (=75/(1*369)) 80.8118
	best keyword for cluster 6652458 is PF07479 with Jaccard = 0.9884	[ 342	2	1099865	2 ]	0.9942 0.9942
sibling [ 6652458 ] : 6675162 0.132143 (=37/(56*5)) 86.9912
	best keyword for cluster 6675162 is PF02317 with Jaccard = 0.9818	[ 54	1	1100156	0 ]	0.9818 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07479 ( PF07479   NAD-dependent glycerol-3-phosphate dehydrogenase C-terminus )
	B> PF02317 ( PF02317   NAD/NADP octopine/nopaline dehydrogenase, alpha-helical domain )

	Only A has a clan ( CL0106.7 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF07479 and PF02317 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF02317 SSF48179 0.882 (average over 195 mutual instances, PF02317 361 appearances, SSF48179 20570 appearances)
     2	PF07479 SSF48179 0.946 (average over 1191 mutual instances, PF07479 2352 appearances, SSF48179 20570 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 294 ) 6758849_PF01757_PF04235            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01757  is 6750455 with Jaccard = 0.9877	|PF01757|=733 [ 724	0	1099478	9 ]
 parent [ 6750455 ] : 6758849 0.0118531 (=3730/(315*999)) 99.2011
  given [ 6750455 ] : 6750455 0.0144163 (=473/(34*965)) 98.6519
	best keyword for cluster 6750455 is PF01757 with Jaccard = 0.9877	[ 724	0	1099478	9 ]	1.0000 0.9877
sibling [ 6750455 ] : 6748206 0.0221728 (=449/(225*90)) 98.4834
	best keyword for cluster 6748206 is PF04235 with Jaccard = 0.6290	[ 78	46	1100087	0 ]	0.6290 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01757 ( PF01757   Acyltransferase family )
	B> PF04235 ( PF04235   Protein of unknown function (DUF418) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01757 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 295 ) 6767541_PF02082_PF03631            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03631  is 6730402 with Jaccard = 0.9873	|PF03631|=315 [ 311	0	1099896	4 ]
 parent [ 6730402 ] : 6767541 0.00478897 (=964/(368*547)) 99.6202
  given [ 6730402 ] : 6730402 0.0366692 (=144/(11*357)) 96.7636
	best keyword for cluster 6730402 is PF03631 with Jaccard = 0.9873	[ 311	0	1099896	4 ]	1.0000 0.9873
sibling [ 6730402 ] : 6766996 0.00633903 (=89/(520*27)) 99.598
	best keyword for cluster 6766996 is PF02082 with Jaccard = 0.9307	[ 470	3	1099706	32 ]	0.9937 0.9363

	SUGGESTING RELATEDNESS OF:   
	A> PF03631 ( PF03631   Ribonuclease BN-like family )
	B> PF02082 ( PF02082   Transcriptional regulator )

	Only B has a clan ( CL0123.12 ).

	the two keywords coincide on Uniref90 proteins: |PF02082| = 502 ,	 |PF03631| = 315 , 	|PF02082^PF03631| = 5	( 1.0% and 1.6% )

	only  PF03631 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 296 ) 6737647_PF01330_PF02132            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01330  is 6697737 with Jaccard = 0.9870	|PF01330|=228 [ 227	2	1099981	1 ]
 parent [ 6697737 ] : 6737647 0.0476279 (=2811/(227*260)) 97.5529
  given [ 6697737 ] : 6697737 0.111543 (=86/(257*3)) 91.8403
	best keyword for cluster 6697737 is PF01330 with Jaccard = 0.9870	[ 227	2	1099981	1 ]	0.9913 0.9956
sibling [ 6697737 ] : 6475873 0.969027 (=219/(1*226)) 4.80038
	best keyword for cluster 6475873 is PF02132 with Jaccard = 0.9507	[ 193	10	1100008	0 ]	0.9507 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01330 ( PF01330   RuvA N terminal domain )
	B> PF02132 ( PF02132   RecR protein )

	Only A has a clan ( CL0021.12 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF01330 and PF02132 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF01330 SSF50249 0.982 (average over 769 mutual instances, PF01330 2238 appearances, SSF50249 52669 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 297 ) 6682370_PF00636_PF02137            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02137  is 6551095 with Jaccard = 0.9870	|PF02137|=77 [ 76	0	1100134	1 ]
 parent [ 6551095 ] : 6682370 0.128951 (=4900/(79*481)) 88.8153
  given [ 6551095 ] : 6551095 0.609649 (=139/(3*76)) 39.8307
	best keyword for cluster 6551095 is PF02137 with Jaccard = 0.9870	[ 76	0	1100134	1 ]	1.0000 0.9870
sibling [ 6551095 ] : 6656333 0.215546 (=513/(5*476)) 82.0966
	best keyword for cluster 6656333 is PF00636 with Jaccard = 0.7120	[ 356	88	1099711	56 ]	0.8018 0.8641

	SUGGESTING RELATEDNESS OF:   
	A> PF02137 ( PF02137   Adenosine-deaminase (editase) domain )
	B> PF00636 ( PF00636   RNase3 domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF02137 and PF00636 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00636 SSF69065 0.604 (average over 1400 mutual instances, PF00636 1401 appearances, SSF69065 2883 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 298 ) 6673857_PF00313_PF06961            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06961  is 6503442 with Jaccard = 0.9867	|PF06961|=75 [ 74	0	1100136	1 ]
 parent [ 6503442 ] : 6673857 0.146562 (=9794/(81*825)) 86.6022
  given [ 6503442 ] : 6503442 0.905063 (=143/(2*79)) 12.8527
	best keyword for cluster 6503442 is PF06961 with Jaccard = 0.9867	[ 74	0	1100136	1 ]	1.0000 0.9867
sibling [ 6503442 ] : 6673302 0.160171 (=526/(4*821)) 86.4364
	best keyword for cluster 6673302 is PF00313 with Jaccard = 0.9633	[ 709	5	1099475	22 ]	0.9930 0.9699

	SUGGESTING RELATEDNESS OF:   
	A> PF06961 ( PF06961   Protein of unknown function (DUF1294) )
	B> PF00313 ( PF00313   'Cold-shock' DNA-binding domain )

	Only B has a clan ( CL0021.12 ).

	the two keywords coincide on Uniref90 proteins: |PF00313| = 731 ,	 |PF06961| = 75 , 	|PF00313^PF06961| = 14	( 1.9% and 18.7% )

	only  PF06961 has a PDB structure (may not be up to date)
		PF00313 	   b.40.4.5

	SUPERFAM mapping significantly overlapping:
     1	PF00313 SSF50249 0.971 (average over 2759 mutual instances, PF00313 2782 appearances, SSF50249 52669 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 299 ) 6594634_PF00478_PF01070            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00478  is 6560532 with Jaccard = 0.9866	|PF00478|=374 [ 369	0	1099837	5 ]
 parent [ 6560532 ] : 6594634 0.486234 (=80513/(415*399)) 59.2515
  given [ 6560532 ] : 6560532 0.565823 (=894/(4*395)) 47.1569
	best keyword for cluster 6560532 is PF00478 with Jaccard = 0.9866	[ 369	0	1099837	5 ]	1.0000 0.9866
sibling [ 6560532 ] : 6553162 0.646349 (=22767/(119*296)) 41.2847
	best keyword for cluster 6553162 is PF01070 with Jaccard = 0.8529	[ 313	45	1099844	9 ]	0.8743 0.9720

	SUGGESTING RELATEDNESS OF:   
	A> PF00478 ( PF00478   IMP dehydrogenase / GMP reductase domain )
	B> PF01070 ( PF01070   FMN-dependent dehydrogenase )

	they come from the same clan: CL0036.17 : PF05690	PF01680	PF00834	PF01729	PF00697	PF03740	PF01884	PF00724	PF00215	PF03060	PF04095	PF04131	PF00478	PF00218	PF00977	PF01645	PF04309	PF01070	PF01207	PF04481	PF04476	PF01180	PF00701	PF01791	PF03932	PF03437	PF01081	PF00121	PF09370	PF02581	PF00290
	the two keywords do not coincide on UniRef90 proteins

	both PF00478 and PF01070 have PDB structures
	PF00478 	   c.1.5.1
	PF01070 	   c.1.4.1

	SUPERFAM mapping significantly overlapping:
     1	PF00478 SSF51621 0.696 (average over 2 mutual instances, PF00478 2 appearances, SSF51621 12495 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 300 ) 6752904_PF01311_PF01312            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01311  is 6624208 with Jaccard = 0.9866	|PF01311|=224 [ 221	0	1099987	3 ]
 parent [ 6624208 ] : 6752904 0.0119479 (=909/(317*240)) 98.8319
  given [ 6624208 ] : 6624208 0.287815 (=137/(2*238)) 72.1473
	best keyword for cluster 6624208 is PF01311 with Jaccard = 0.9866	[ 221	0	1099987	3 ]	1.0000 0.9866
sibling [ 6624208 ] : 6522545 0.813291 (=257/(1*316)) 21.4382
	best keyword for cluster 6522545 is PF01312 with Jaccard = 0.9861	[ 284	0	1099923	4 ]	1.0000 0.9861

	SUGGESTING RELATEDNESS OF:   
	A> PF01311 ( PF01311   Bacterial export proteins, family 1 )
	B> PF01312 ( PF01312   FlhB HrpN YscU SpaS Family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01311| = 224 ,	 |PF01312| = 288 , 	|PF01311^PF01312| = 3	( 1.3% and 1.0% )

	Neither  PF01311 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 301 ) 6724834_PF00076_PF05383            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05383  is 6549022 with Jaccard = 0.9864	|PF05383|=147 [ 145	0	1100064	2 ]
 parent [ 6549022 ] : 6724834 0.0475652 (=30729/(155*4168)) 96.088
  given [ 6549022 ] : 6549022 0.626172 (=1469/(17*138)) 38.033
	best keyword for cluster 6549022 is PF05383 with Jaccard = 0.9864	[ 145	0	1100064	2 ]	1.0000 0.9864
sibling [ 6549022 ] : 6716126 0.0578344 (=29681/(127*4041)) 94.9031
	best keyword for cluster 6716126 is PF00076 with Jaccard = 0.8118	[ 3459	217	1095950	585 ]	0.9410 0.8553

	SUGGESTING RELATEDNESS OF:   
	A> PF05383 ( PF05383   La domain )
	B> PF00076 ( PF00076   RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) )

	Only B has a clan ( CL0221.5 ).

	the two keywords coincide on Uniref90 proteins: |PF00076| = 4044 ,	 |PF05383| = 147 , 	|PF00076^PF05383| = 35	( 0.9% and 23.8% )

	both PF05383 and PF00076 have PDB structures
	PF05383 	   a.4.5.46
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 302 ) 6644915_PF00112_PF03051            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03051  is 6633203 with Jaccard = 0.9861	|PF03051|=72 [ 71	0	1100139	1 ]
 parent [ 6633203 ] : 6644915 0.288649 (=27663/(76*1261)) 78.4904
  given [ 6633203 ] : 6633203 0.306667 (=23/(1*75)) 75.3572
	best keyword for cluster 6633203 is PF03051 with Jaccard = 0.9861	[ 71	0	1100139	1 ]	1.0000 0.9861
sibling [ 6633203 ] : 6635277 0.28254 (=356/(1*1260)) 75.8182
	best keyword for cluster 6635277 is PF00112 with Jaccard = 0.9572	[ 1163	28	1098996	24 ]	0.9765 0.9798

	SUGGESTING RELATEDNESS OF:   
	A> PF03051 ( PF03051   Peptidase C1-like family )
	B> PF00112 ( PF00112   Papain family cysteine protease )

	they come from the same clan: CL0125.9 : PF08715	PF01707	PF03569	PF01830	PF00851	PF03543	PF03416	PF05543	PF05533	PF03412	PF05415	PF05412	PF05411	PF05410	PF05408	PF05407	PF05379	PF05381	PF00648	PF03051	PF01831	PF01088	PF01640	PF00112	PF00877	PF05257	PF05382
	the two keywords do not coincide on UniRef90 proteins

	both PF03051 and PF00112 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 303 ) 6510097_PF00330_PF06434            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06434  is 6449454 with Jaccard = 0.9861	|PF06434|=71 [ 71	1	1100139	0 ]
 parent [ 6449454 ] : 6510097 0.852824 (=44601/(79*662)) 15.4905
  given [ 6449454 ] : 6449454 0.987179 (=77/(1*78)) 1.28205
	best keyword for cluster 6449454 is PF06434 with Jaccard = 0.9861	[ 71	1	1100139	0 ]	0.9861 1.0000
sibling [ 6449454 ] : 6497283 0.896804 (=2946/(5*657)) 10.5348
	best keyword for cluster 6497283 is PF00330 with Jaccard = 0.8779	[ 604	4	1099523	80 ]	0.9934 0.8830

	SUGGESTING RELATEDNESS OF:   
	A> PF06434 ( PF06434   Aconitate hydratase 2 N-terminus )
	B> PF00330 ( PF00330   Aconitase family (aconitate hydratase) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00330| = 684 ,	 |PF06434| = 71 , 	|PF00330^PF06434| = 70	( 10.2% and 98.6% )

	both PF06434 and PF00330 have PDB structures
		PF00330 	   c.83.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00330 SSF53732 0.902 (average over 2340 mutual instances, PF00330 4017 appearances, SSF53732 3709 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 304 ) 6766417_PF02535_PF03773            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02535  is 6756659 with Jaccard = 0.9858	|PF02535|=493 [ 487	1	1099717	6 ]
 parent [ 6756659 ] : 6766417 0.00559701 (=888/(592*268)) 99.5743
  given [ 6756659 ] : 6756659 0.0112069 (=78/(580*12)) 99.0707
	best keyword for cluster 6756659 is PF02535 with Jaccard = 0.9858	[ 487	1	1099717	6 ]	0.9980 0.9878
sibling [ 6756659 ] : 6764145 0.00674916 (=24/(254*14)) 99.4723
	best keyword for cluster 6764145 is PF03773 with Jaccard = 0.9854	[ 202	3	1100006	0 ]	0.9854 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02535 ( PF02535   ZIP Zinc transporter )
	B> PF03773 ( PF03773   Predicted permease )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02535 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 305 ) 6733106_PF04237_PF04944            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04237  is 6600049 with Jaccard = 0.9855	|PF04237|=69 [ 68	0	1100142	1 ]
 parent [ 6600049 ] : 6733106 0.0350708 (=109/(84*37)) 97.0666
  given [ 6600049 ] : 6600049 0.416964 (=585/(23*61)) 61.5661
	best keyword for cluster 6600049 is PF04237 with Jaccard = 0.9855	[ 68	0	1100142	1 ]	1.0000 0.9855
sibling [ 6600049 ] : 6670205 0.175 (=28/(5*32)) 85.5747
	best keyword for cluster 6670205 is PF04944 with Jaccard = 0.9524	[ 20	1	1100190	0 ]	0.9524 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04237 ( PF04237   Protein of unknown function (DUF419) )
	B> PF04944 ( PF04944   Uncharacterised BCR (COG3801) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04237 has a PDB structure (may not be up to date)
	PF04237 	   d.198.3.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 306 ) 6768972_PF07396_PF07642            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07396  is 6749690 with Jaccard = 0.9855	|PF07396|=69 [ 68	0	1100142	1 ]
 parent [ 6749690 ] : 6768972 0.0044603 (=40/(38*236)) 99.6757
  given [ 6749690 ] : 6749690 0.0190538 (=265/(114*122)) 98.599
	best keyword for cluster 6749690 is PF07396 with Jaccard = 0.9855	[ 68	0	1100142	1 ]	1.0000 0.9855
sibling [ 6749690 ] : 6743610 0.027027 (=1/(1*37)) 98.1081
	best keyword for cluster 6743610 is PF07642 with Jaccard = 1.0000	[ 10	0	1100201	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07396 ( PF07396   Phosphate-selective porin O and P )
	B> PF07642 ( PF07642   Protein of unknown function (DUF1597) )

	Only A has a clan ( CL0193.8 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07396 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 307 ) 6761897_PF02949_PF08395            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02949  is 6645471 with Jaccard = 0.9852	|PF02949|=202 [ 200	1	1100008	2 ]
 parent [ 6645471 ] : 6761897 0.00846702 (=399/(204*231)) 99.365
  given [ 6645471 ] : 6645471 0.218905 (=132/(3*201)) 78.6399
	best keyword for cluster 6645471 is PF02949 with Jaccard = 0.9852	[ 200	1	1100008	2 ]	0.9950 0.9901
sibling [ 6645471 ] : 6760503 0.00888743 (=27/(14*217)) 99.2911
	best keyword for cluster 6760503 is PF08395 with Jaccard = 0.8587	[ 158	20	1100027	6 ]	0.8876 0.9634

	SUGGESTING RELATEDNESS OF:   
	A> PF02949 ( PF02949   7tm Odorant receptor )
	B> PF08395 ( PF08395   7tm Chemosensory receptor )

	they come from the same clan: CL0176.5 : PF02949	PF08395	PF03268	PF06151

	the two keywords coincide on Uniref90 proteins: |PF02949| = 202 ,	 |PF08395| = 164 , 	|PF02949^PF08395| = 1	( 0.5% and 0.6% )

	Neither  PF02949 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 308 ) 6719873_PF01199_PF07891            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01199  is 6533282 with Jaccard = 0.9851	|PF01199|=67 [ 66	0	1100144	1 ]
 parent [ 6533282 ] : 6719873 0.0464345 (=56/(67*18)) 95.397
  given [ 6533282 ] : 6533282 0.742424 (=49/(1*66)) 27.9689
	best keyword for cluster 6533282 is PF01199 with Jaccard = 0.9851	[ 66	0	1100144	1 ]	1.0000 0.9851
sibling [ 6533282 ] : 6677252 0.125 (=10/(8*10)) 87.5001
	best keyword for cluster 6677252 is PF07891 with Jaccard = 0.9167	[ 11	0	1100199	1 ]	1.0000 0.9167

	SUGGESTING RELATEDNESS OF:   
	A> PF01199 ( PF01199   Ribosomal protein L34e )
	B> PF07891 ( PF07891   Protein of unknown function (DUF1666) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01199| = 67 ,	 |PF07891| = 12 , 	|PF01199^PF07891| = 1	( 1.5% and 8.3% )

	Neither  PF01199 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 309 ) 6763000_PF00210_PF00301            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00210  is 6725009 with Jaccard = 0.9850	|PF00210|=657 [ 655	8	1099546	2 ]
 parent [ 6725009 ] : 6763000 0.0082766 (=3783/(742*616)) 99.4169
  given [ 6725009 ] : 6725009 0.0485667 (=6433/(443*299)) 96.1086
	best keyword for cluster 6725009 is PF00210 with Jaccard = 0.9850	[ 655	8	1099546	2 ]	0.9879 0.9970
sibling [ 6725009 ] : 6762149 0.00847315 (=101/(20*596)) 99.3782
	best keyword for cluster 6762149 is PF00301 with Jaccard = 0.6096	[ 278	144	1099755	34 ]	0.6588 0.8910

	SUGGESTING RELATEDNESS OF:   
	A> PF00210 ( PF00210   Ferritin-like domain )
	B> PF00301 ( PF00301   Rubredoxin )

	A and B come from a different clan ( CL0044.8 , CL0045.7 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF00210 and PF00301 have PDB structures
	PF00210 	   a.25.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00210 SSF47240 0.870 (average over 2111 mutual instances, PF00210 2114 appearances, SSF47240 6970 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 310 ) 6735487_PF02613_PF06192            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02613  is 6514361 with Jaccard = 0.9844	|PF02613|=64 [ 63	0	1100147	1 ]
 parent [ 6514361 ] : 6735487 0.0338066 (=400/(68*174)) 97.3208
  given [ 6514361 ] : 6514361 0.831746 (=262/(5*63)) 17.2396
	best keyword for cluster 6514361 is PF02613 with Jaccard = 0.9844	[ 63	0	1100147	1 ]	1.0000 0.9844
sibling [ 6514361 ] : 6734154 0.037639 (=44/(167*7)) 97.1834
	best keyword for cluster 6734154 is PF06192 with Jaccard = 1.0000	[ 112	0	1100099	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02613 ( PF02613   Nitrate reductase delta subunit )
	B> PF06192 ( PF06192   Cytoplasmic chaperone TorD )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02613 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 311 ) 6746560_PF01248_PF04296            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04296  is 6662780 with Jaccard = 0.9843	|PF04296|=127 [ 125	0	1100084	2 ]
 parent [ 6662780 ] : 6746560 0.0190943 (=1197/(139*451)) 98.3545
  given [ 6662780 ] : 6662780 0.203463 (=188/(7*132)) 83.8411
	best keyword for cluster 6662780 is PF04296 with Jaccard = 0.9843	[ 125	0	1100084	2 ]	1.0000 0.9843
sibling [ 6662780 ] : 6745249 0.0222222 (=10/(1*450)) 98.2502
	best keyword for cluster 6745249 is PF01248 with Jaccard = 0.9758	[ 403	1	1099798	9 ]	0.9975 0.9782

	SUGGESTING RELATEDNESS OF:   
	A> PF04296 ( PF04296   Protein of unknown function (DUF448) )
	B> PF01248 ( PF01248   Ribosomal protein L7Ae/L30e/S12e/Gadd45 family )

	Only B has a clan ( CL0101.7 ).

	the two keywords coincide on Uniref90 proteins: |PF01248| = 412 ,	 |PF04296| = 127 , 	|PF01248^PF04296| = 6	( 1.5% and 4.7% )

	both PF04296 and PF01248 have PDB structures
	PF04296 	   d.192.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF04296 SSF64376 0.891 (average over 378 mutual instances, PF04296 378 appearances, SSF64376 390 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 312 ) 6752660_PF01580_PF02534            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02534  is 6745172 with Jaccard = 0.9841	|PF02534|=189 [ 186	0	1100022	3 ]
 parent [ 6745172 ] : 6752660 0.0166277 (=6394/(290*1326)) 98.8145
  given [ 6745172 ] : 6745172 0.0186667 (=77/(15*275)) 98.2469
	best keyword for cluster 6745172 is PF02534 with Jaccard = 0.9841	[ 186	0	1100022	3 ]	1.0000 0.9841
sibling [ 6745172 ] : 6752078 0.0193745 (=381/(15*1311)) 98.7745
	best keyword for cluster 6752078 is PF01580 with Jaccard = 0.6172	[ 424	260	1099524	3 ]	0.6199 0.9930

	SUGGESTING RELATEDNESS OF:   
	A> PF02534 ( PF02534   TraG/TraD family )
	B> PF01580 ( PF01580   FtsK/SpoIIIE family )

	Only A has a clan ( CL0023.26 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF02534 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 313 ) 6734435_PF00379_PF07912            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00379  is 6708405 with Jaccard = 0.9840	|PF00379|=374 [ 368	0	1099837	6 ]
 parent [ 6708405 ] : 6734435 0.0299226 (=205/(17*403)) 97.2188
  given [ 6708405 ] : 6708405 0.0690919 (=245/(394*9)) 93.7012
	best keyword for cluster 6708405 is PF00379 with Jaccard = 0.9840	[ 368	0	1099837	6 ]	1.0000 0.9840
sibling [ 6708405 ] : 6699062 0.0857143 (=6/(10*7)) 92.0091
	best keyword for cluster 6699062 is PF07912 with Jaccard = 0.7273	[ 8	3	1100200	0 ]	0.7273 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00379 ( PF00379   Insect cuticle protein )
	B> PF07912 ( PF07912   ERp29, N-terminal domain )

	Only B has a clan ( CL0172.11 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF00379 has a PDB structure (may not be up to date)
		PF07912 	   c.47.1.7

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 314 ) 6750301_PF00246_PF04952            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04952  is 6716227 with Jaccard = 0.9840	|PF04952|=187 [ 185	1	1100023	2 ]
 parent [ 6716227 ] : 6750301 0.0184347 (=2452/(235*566)) 98.6405
  given [ 6716227 ] : 6716227 0.0616883 (=57/(4*231)) 94.9217
	best keyword for cluster 6716227 is PF04952 with Jaccard = 0.9840	[ 185	1	1100023	2 ]	0.9946 0.9893
sibling [ 6716227 ] : 6736069 0.0278559 (=109/(7*559)) 97.3805
	best keyword for cluster 6736069 is PF00246 with Jaccard = 0.9802	[ 494	2	1099707	8 ]	0.9960 0.9841

	SUGGESTING RELATEDNESS OF:   
	A> PF04952 ( PF04952   Succinylglutamate desuccinylase / Aspartoacylase family )
	B> PF00246 ( PF00246   Zinc carboxypeptidase )

	they come from the same clan: CL0035.11 : PF05343	PF04389	PF01546	PF02127	PF00883	PF00246	PF05450	PF04952
	the two keywords do not coincide on UniRef90 proteins

	both PF04952 and PF00246 have PDB structures
	PF04952 	   c.56.5.7
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 315 ) 6697008_PF00351_PF00800            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00351  is 6592298 with Jaccard = 0.9836	|PF00351|=122 [ 120	0	1100089	2 ]
 parent [ 6592298 ] : 6697008 0.0939436 (=5308/(129*438)) 91.7375
  given [ 6592298 ] : 6592298 0.448819 (=114/(2*127)) 58.1228
	best keyword for cluster 6592298 is PF00351 with Jaccard = 0.9836	[ 120	0	1100089	2 ]	1.0000 0.9836
sibling [ 6592298 ] : 6670139 0.154473 (=6370/(301*137)) 85.5298
	best keyword for cluster 6670139 is PF00800 with Jaccard = 0.6867	[ 274	123	1099812	2 ]	0.6902 0.9928

	SUGGESTING RELATEDNESS OF:   
	A> PF00351 ( PF00351   Biopterin-dependent aromatic amino acid hydroxylase )
	B> PF00800 ( PF00800   Prephenate dehydratase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF00351 and PF00800 have PDB structures
	PF00351 	   d.178.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00351 SSF56534 0.655 (average over 406 mutual instances, PF00351 407 appearances, SSF56534 477 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 316 ) 6746019_PF01111_PF03657            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01111  is 6557550 with Jaccard = 0.9833	|PF01111|=60 [ 59	0	1100151	1 ]
 parent [ 6557550 ] : 6746019 0.0169697 (=56/(66*50)) 98.3115
  given [ 6557550 ] : 6557550 0.570312 (=73/(2*64)) 44.8558
	best keyword for cluster 6557550 is PF01111 with Jaccard = 0.9833	[ 59	0	1100151	1 ]	1.0000 0.9833
sibling [ 6557550 ] : 6680536 0.126819 (=61/(37*13)) 88.3219
	best keyword for cluster 6680536 is PF03657 with Jaccard = 0.8000	[ 20	5	1100186	0 ]	0.8000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01111 ( PF01111   Cyclin-dependent kinase regulatory subunit )
	B> PF03657 ( PF03657   Uncharacterised protein family (UPF0113) )

	Only B has a clan ( CL0178.11 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF01111 has a PDB structure (may not be up to date)
	PF01111 	   d.97.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF01111 SSF55637 0.893 (average over 132 mutual instances, PF01111 134 appearances, SSF55637 134 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 317 ) 6737467_PF00884_PF02995            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02995  is 6562652 with Jaccard = 0.9833	|PF02995|=60 [ 59	0	1100151	1 ]
 parent [ 6562652 ] : 6737467 0.0356349 (=2245/(60*1050)) 97.531
  given [ 6562652 ] : 6562652 0.538462 (=224/(8*52)) 49.051
	best keyword for cluster 6562652 is PF02995 with Jaccard = 0.9833	[ 59	0	1100151	1 ]	1.0000 0.9833
sibling [ 6562652 ] : 6725365 0.0522488 (=273/(5*1045)) 96.1516
	best keyword for cluster 6725365 is PF00884 with Jaccard = 0.9673	[ 888	1	1099293	29 ]	0.9989 0.9684

	SUGGESTING RELATEDNESS OF:   
	A> PF02995 ( PF02995   Protein of unknown function (DUF229) )
	B> PF00884 ( PF00884   Sulfatase )

	they come from the same clan: CL0088.10 : PF00884	PF01663	PF08665	PF01676	PF02995	PF07394	PF00245
	the two keywords do not coincide on UniRef90 proteins

	only  PF02995 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 318 ) 6670998_PF00764_PF06508            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00764  is 6536206 with Jaccard = 0.9831	|PF00764|=236 [ 232	0	1099975	4 ]
 parent [ 6536206 ] : 6670998 0.190128 (=10562/(256*217)) 85.7853
  given [ 6536206 ] : 6536206 0.713725 (=182/(1*255)) 29.72
	best keyword for cluster 6536206 is PF00764 with Jaccard = 0.9831	[ 232	0	1099975	4 ]	1.0000 0.9831
sibling [ 6536206 ] : 6606478 0.407407 (=88/(1*216)) 64.9645
	best keyword for cluster 6606478 is PF06508 with Jaccard = 0.7606	[ 197	1	1099952	61 ]	0.9949 0.7636

	SUGGESTING RELATEDNESS OF:   
	A> PF00764 ( PF00764   Arginosuccinate synthase )
	B> PF06508 ( PF06508   ExsB )

	they come from the same clan: CL0039.7 : PF00764	PF00733	PF01171	PF01902	PF06508	PF02540	PF01507	PF02568	PF03054
	the two keywords do not coincide on UniRef90 proteins

	only  PF00764 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 319 ) 6610063_PF01159_PF01777            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01159  is 6549264 with Jaccard = 0.9831	|PF01159|=58 [ 58	1	1100152	0 ]
 parent [ 6549264 ] : 6610063 0.431795 (=1703/(68*58)) 66.7831
  given [ 6549264 ] : 6549264 0.630769 (=123/(3*65)) 38.2656
	best keyword for cluster 6549264 is PF01159 with Jaccard = 0.9831	[ 58	1	1100152	0 ]	0.9831 1.0000
sibling [ 6549264 ] : 6601011 0.401786 (=45/(2*56)) 62.1429
	best keyword for cluster 6601011 is PF01777 with Jaccard = 0.9818	[ 54	1	1100156	0 ]	0.9818 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01159 ( PF01159   Ribosomal protein L6e )
	B> PF01777 ( PF01777   Ribosomal L27e protein family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01159 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 320 ) 6745432_PF01984_PF05024            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01984  is 6677614 with Jaccard = 0.9831	|PF01984|=59 [ 58	0	1100152	1 ]
 parent [ 6677614 ] : 6745432 0.0177404 (=57/(51*63)) 98.2659
  given [ 6677614 ] : 6677614 0.129032 (=8/(1*62)) 87.6037
	best keyword for cluster 6677614 is PF01984 with Jaccard = 0.9831	[ 58	0	1100152	1 ]	1.0000 0.9831
sibling [ 6677614 ] : 6661139 0.180851 (=34/(4*47)) 83.5267
	best keyword for cluster 6661139 is PF05024 with Jaccard = 0.9556	[ 43	1	1100166	1 ]	0.9773 0.9773

	SUGGESTING RELATEDNESS OF:   
	A> PF01984 ( PF01984   Double-stranded DNA-binding domain )
	B> PF05024 ( PF05024   N-acetylglucosaminyl transferase component (Gpi1) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01984| = 59 ,	 |PF05024| = 44 , 	|PF01984^PF05024| = 1	( 1.7% and 2.3% )

	only  PF01984 has a PDB structure (may not be up to date)
	PF01984 	   a.5.6.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF01984 SSF46950 0.690 (average over 132 mutual instances, PF01984 133 appearances, SSF46950 134 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 321 ) 6736703_PF00731_PF01259            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01259  is 6727661 with Jaccard = 0.9830	|PF01259|=293 [ 289	1	1099917	4 ]
 parent [ 6727661 ] : 6736703 0.026216 (=3048/(337*345)) 97.4498
  given [ 6727661 ] : 6727661 0.0356394 (=306/(27*318)) 96.4414
	best keyword for cluster 6727661 is PF01259 with Jaccard = 0.9830	[ 289	1	1099917	4 ]	0.9966 0.9863
sibling [ 6727661 ] : 6542403 0.724826 (=13547/(70*267)) 33.7314
	best keyword for cluster 6542403 is PF00731 with Jaccard = 0.8664	[ 253	1	1099919	38 ]	0.9961 0.8694

	SUGGESTING RELATEDNESS OF:   
	A> PF01259 ( PF01259   SAICAR synthetase )
	B> PF00731 ( PF00731   AIR carboxylase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00731| = 291 ,	 |PF01259| = 293 , 	|PF00731^PF01259| = 11	( 3.8% and 3.8% )

	both PF01259 and PF00731 have PDB structures
		PF00731 	   c.23.8.1

	SUPERFAM mapping significantly overlapping:
     1	PF00731 SSF52255 0.971 (average over 914 mutual instances, PF00731 1034 appearances, SSF52255 1010 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 322 ) 6689461_PF00860_PF00916            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00860  is 6658329 with Jaccard = 0.9828	|PF00860|=578 [ 570	2	1099631	8 ]
 parent [ 6658329 ] : 6689461 0.114728 (=51742/(637*708)) 90.1518
  given [ 6658329 ] : 6658329 0.194574 (=19299/(366*271)) 82.808
	best keyword for cluster 6658329 is PF00860 with Jaccard = 0.9828	[ 570	2	1099631	8 ]	0.9965 0.9862
sibling [ 6658329 ] : 6677780 0.12973 (=456/(5*703)) 87.6633
	best keyword for cluster 6677780 is PF00916 with Jaccard = 0.9812	[ 626	6	1099573	6 ]	0.9905 0.9905

	SUGGESTING RELATEDNESS OF:   
	A> PF00860 ( PF00860   Permease family )
	B> PF00916 ( PF00916   Sulfate transporter family )

	they come from the same clan: CL0062.8 : PF00860	PF03222	PF02133	PF00916	PF00474	PF03845	PF01235	PF00955	PF07331	PF02361	PF05525	PF03594	PF01490	PF00324
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF00860 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 323 ) 6733526_PF00324_PF03845            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03845  is 6703700 with Jaccard = 0.9826	|PF03845|=115 [ 113	0	1100096	2 ]
 parent [ 6703700 ] : 6733526 0.0355525 (=10998/(123*2515)) 97.1121
  given [ 6703700 ] : 6703700 0.0779661 (=46/(5*118)) 92.8466
	best keyword for cluster 6703700 is PF03845 with Jaccard = 0.9826	[ 113	0	1100096	2 ]	1.0000 0.9826
sibling [ 6703700 ] : 6727943 0.0502552 (=31744/(283*2232)) 96.478
	best keyword for cluster 6727943 is PF00324 with Jaccard = 0.8716	[ 2016	261	1097898	36 ]	0.8854 0.9825

	SUGGESTING RELATEDNESS OF:   
	A> PF03845 ( PF03845   Spore germination protein )
	B> PF00324 ( PF00324   Amino acid permease )

	they come from the same clan: CL0062.8 : PF00860	PF03222	PF02133	PF00916	PF00474	PF03845	PF01235	PF00955	PF07331	PF02361	PF05525	PF03594	PF01490	PF00324
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03845 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 324 ) 6745291_PF01333_PF05896            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01333  is 6448190 with Jaccard = 0.9825	|PF01333|=57 [ 56	0	1100154	1 ]
 parent [ 6448190 ] : 6745291 0.0301637 (=105/(59*59)) 98.2552
  given [ 6448190 ] : 6448190 1 (=58/(1*58)) 1.18178
	best keyword for cluster 6448190 is PF01333 with Jaccard = 0.9825	[ 56	0	1100154	1 ]	1.0000 0.9825
sibling [ 6448190 ] : 6691814 0.131579 (=15/(57*2)) 90.6596
	best keyword for cluster 6691814 is PF05896 with Jaccard = 1.0000	[ 48	0	1100163	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01333 ( PF01333   Apocytochrome F, C-terminal )
	B> PF05896 ( PF05896   Na(+)-translocating NADH-quinone reductase subunit A (NQRA) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01333 has a PDB structure (may not be up to date)
	PF01333 	   b.84.2.2  i.4.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 325 ) 6758782_PF01774_PF08514            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01774  is 6594794 with Jaccard = 0.9825	|PF01774|=114 [ 112	0	1100097	2 ]
 parent [ 6594794 ] : 6758782 0.00815719 (=115/(133*106)) 99.1982
  given [ 6594794 ] : 6594794 0.441463 (=543/(10*123)) 59.4299
	best keyword for cluster 6594794 is PF01774 with Jaccard = 0.9825	[ 112	0	1100097	2 ]	1.0000 0.9825
sibling [ 6594794 ] : 6745117 0.0177536 (=49/(60*46)) 98.2436
	best keyword for cluster 6745117 is PF08514 with Jaccard = 0.8913	[ 41	0	1100165	5 ]	1.0000 0.8913

	SUGGESTING RELATEDNESS OF:   
	A> PF01774 ( PF01774   UreD urease accessory protein )
	B> PF08514 ( PF08514   STAG domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01774| = 114 ,	 |PF08514| = 46 , 	|PF01774^PF08514| = 1	( 0.9% and 2.2% )

	Neither  PF01774 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 326 ) 6612606_PF04632_PF05976            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05976  is 6577279 with Jaccard = 0.9825	|PF05976|=57 [ 56	0	1100154	1 ]
 parent [ 6577279 ] : 6612606 0.367611 (=6617/(120*150)) 67.5995
  given [ 6577279 ] : 6577279 0.502356 (=1386/(31*89)) 52.8915
	best keyword for cluster 6577279 is PF05976 with Jaccard = 0.9825	[ 56	0	1100154	1 ]	1.0000 0.9825
sibling [ 6577279 ] : 6603184 0.401932 (=1373/(28*122)) 63.1805
	best keyword for cluster 6603184 is PF04632 with Jaccard = 0.9861	[ 71	0	1100139	1 ]	1.0000 0.9861

	SUGGESTING RELATEDNESS OF:   
	A> PF05976 ( PF05976   Bacterial membrane protein of unknown function (DUF893) )
	B> PF04632 ( PF04632   Fusaric acid resistance protein conserved region )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05976 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05976 SSF103473 0.754 (average over 1 mutual instances, PF05976 1 appearances, SSF103473 39293 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 327 ) 6779061_PF04833_PF06568            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06568  is 6756758 with Jaccard = 0.9825	|PF06568|=56 [ 56	1	1100154	0 ]
 parent [ 6756758 ] : 6779061 0.00093985 (=10/(112*95)) 99.9384
  given [ 6756758 ] : 6756758 0.0151203 (=22/(97*15)) 99.0763
	best keyword for cluster 6756758 is PF06568 with Jaccard = 0.9825	[ 56	1	1100154	0 ]	0.9825 1.0000
sibling [ 6756758 ] : 6773606 0.00261233 (=5/(66*29)) 99.821
	best keyword for cluster 6773606 is PF04833 with Jaccard = 0.9677	[ 30	1	1100180	0 ]	0.9677 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06568 ( PF06568   Domain of unknown function (DUF1127) )
	B> PF04833 ( PF04833   Phytochelatin synthetase-like conserved region )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06568 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 328 ) 6769321_PF01746_PF02590            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02590  is 6717257 with Jaccard = 0.9824	|PF02590|=170 [ 167	0	1100041	3 ]
 parent [ 6717257 ] : 6769321 0.00484979 (=257/(192*276)) 99.6875
  given [ 6717257 ] : 6717257 0.0511464 (=29/(3*189)) 95.0562
	best keyword for cluster 6717257 is PF02590 with Jaccard = 0.9824	[ 167	0	1100041	3 ]	1.0000 0.9824
sibling [ 6717257 ] : 6768473 0.00363636 (=1/(1*275)) 99.6567
	best keyword for cluster 6768473 is PF01746 with Jaccard = 0.7628	[ 238	0	1099899	74 ]	1.0000 0.7628

	SUGGESTING RELATEDNESS OF:   
	A> PF02590 ( PF02590   Uncharacterized ACR, COG1576 )
	B> PF01746 ( PF01746   tRNA (Guanine-1)-methyltransferase )

	they come from the same clan: CL0098.7 : PF02590	PF02598	PF04013	PF04452	PF00588	PF01746
	the two keywords do not coincide on UniRef90 proteins

	both PF02590 and PF01746 have PDB structures
	PF02590 	   c.116.1.3
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 329 ) 6676104_PF04131_PF05690            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04131  is 6218438 with Jaccard = 0.9818	|PF04131|=55 [ 54	0	1100156	1 ]
 parent [ 6218438 ] : 6676104 0.166173 (=3031/(60*304)) 87.1964
  given [ 6218438 ] : 6218438 1 (=611/(13*47)) 8.88923e-16
	best keyword for cluster 6218438 is PF04131 with Jaccard = 0.9818	[ 54	0	1100156	1 ]	1.0000 0.9818
sibling [ 6218438 ] : 6663867 0.165017 (=50/(1*303)) 84.0322
	best keyword for cluster 6663867 is PF05690 with Jaccard = 0.6281	[ 179	104	1099926	2 ]	0.6325 0.9890

	SUGGESTING RELATEDNESS OF:   
	A> PF04131 ( PF04131   Putative N-acetylmannosamine-6-phosphate epimerase )
	B> PF05690 ( PF05690   Thiazole biosynthesis protein ThiG )

	they come from the same clan: CL0036.17 : PF05690	PF01680	PF00834	PF01729	PF00697	PF03740	PF01884	PF00724	PF00215	PF03060	PF04095	PF04131	PF00478	PF00218	PF00977	PF01645	PF04309	PF01070	PF01207	PF04481	PF04476	PF01180	PF00701	PF01791	PF03932	PF03437	PF01081	PF00121	PF09370	PF02581	PF00290
	the two keywords do not coincide on UniRef90 proteins

	both PF04131 and PF05690 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF05690 SSF110399 0.947 (average over 551 mutual instances, PF05690 572 appearances, SSF110399 583 appearances)
     2	PF04131 SSF51366 0.852 (average over 132 mutual instances, PF04131 147 appearances, SSF51366 8168 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 330 ) 6547153_PF01680_PF05690            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01680  is 6507898 with Jaccard = 0.9817	|PF01680|=109 [ 107	0	1100102	2 ]
 parent [ 6507898 ] : 6547153 0.673611 (=14356/(192*111)) 36.8294
  given [ 6507898 ] : 6507898 0.936364 (=103/(1*110)) 14.5589
	best keyword for cluster 6507898 is PF01680 with Jaccard = 0.9817	[ 107	0	1100102	2 ]	1.0000 0.9817
sibling [ 6507898 ] : 6404184 0.999859 (=7099/(50*142)) 0.0148517
	best keyword for cluster 6404184 is PF05690 with Jaccard = 0.9724	[ 176	0	1100030	5 ]	1.0000 0.9724

	SUGGESTING RELATEDNESS OF:   
	A> PF01680 ( PF01680   SOR/SNZ family )
	B> PF05690 ( PF05690   Thiazole biosynthesis protein ThiG )

	they come from the same clan: CL0036.17 : PF05690	PF01680	PF00834	PF01729	PF00697	PF03740	PF01884	PF00724	PF00215	PF03060	PF04095	PF04131	PF00478	PF00218	PF00977	PF01645	PF04309	PF01070	PF01207	PF04481	PF04476	PF01180	PF00701	PF01791	PF03932	PF03437	PF01081	PF00121	PF09370	PF02581	PF00290

	the two keywords coincide on Uniref90 proteins: |PF01680| = 109 ,	 |PF05690| = 181 , 	|PF01680^PF05690| = 4	( 3.7% and 2.2% )

	only  PF01680 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05690 SSF110399 0.947 (average over 551 mutual instances, PF05690 572 appearances, SSF110399 583 appearances)
     2	PF01680 SSF51366 0.778 (average over 331 mutual instances, PF01680 336 appearances, SSF51366 8168 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 331 ) 6777157_PF01801_PF04089            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04089  is 6762730 with Jaccard = 0.9811	|PF04089|=53 [ 52	0	1100158	1 ]
 parent [ 6762730 ] : 6777157 0.00161179 (=7/(101*43)) 99.9044
  given [ 6762730 ] : 6762730 0.00690449 (=12/(22*79)) 99.4054
	best keyword for cluster 6762730 is PF04089 with Jaccard = 0.9811	[ 52	0	1100158	1 ]	1.0000 0.9811
sibling [ 6762730 ] : 6769411 0.004329 (=2/(21*22)) 99.6905
	best keyword for cluster 6769411 is PF01801 with Jaccard = 0.9000	[ 9	1	1100201	0 ]	0.9000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04089 ( PF04089   BRICHOS domain )
	B> PF01801 ( PF01801   Cytomegalovirus glycoprotein L )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04089 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 332 ) 6754109_PF03932_PF06089            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06089  is 6386946 with Jaccard = 0.9811	|PF06089|=53 [ 52	0	1100158	1 ]
 parent [ 6386946 ] : 6754109 0.0108696 (=54/(54*92)) 98.913
  given [ 6386946 ] : 6386946 1 (=53/(1*53)) 0.00126453
	best keyword for cluster 6386946 is PF06089 with Jaccard = 0.9811	[ 52	0	1100158	1 ]	1.0000 0.9811
sibling [ 6386946 ] : 6675070 0.138577 (=37/(3*89)) 86.9357
	best keyword for cluster 6675070 is PF03932 with Jaccard = 0.9880	[ 82	0	1100128	1 ]	1.0000 0.9880

	SUGGESTING RELATEDNESS OF:   
	A> PF06089 ( PF06089   L-asparaginase II )
	B> PF03932 ( PF03932   CutC family )

	Only B has a clan ( CL0036.17 ).

	the two keywords coincide on Uniref90 proteins: |PF03932| = 83 ,	 |PF06089| = 53 , 	|PF03932^PF06089| = 1	( 1.2% and 1.9% )

	only  PF06089 has a PDB structure (may not be up to date)
		PF03932 	   c.1.30.1

	SUPERFAM mapping significantly overlapping:
     1	PF03932 SSF110395 0.885 (average over 322 mutual instances, PF03932 322 appearances, SSF110395 322 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 333 ) 6743873_PF01987_PF02342            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01987  is 6734663 with Jaccard = 0.9809	|PF01987|=157 [ 154	0	1100054	3 ]
 parent [ 6734663 ] : 6743873 0.0188837 (=653/(182*190)) 98.1331
  given [ 6734663 ] : 6734663 0.0377785 (=117/(19*163)) 97.2396
	best keyword for cluster 6734663 is PF01987 with Jaccard = 0.9809	[ 154	0	1100054	3 ]	1.0000 0.9809
sibling [ 6734663 ] : 6737941 0.031746 (=6/(1*189)) 97.5831
	best keyword for cluster 6737941 is PF02342 with Jaccard = 0.8917	[ 140	4	1100054	13 ]	0.9722 0.9150

	SUGGESTING RELATEDNESS OF:   
	A> PF01987 ( PF01987   Protein of unknown function DUF124 )
	B> PF02342 ( PF02342   Bacterial stress protein )

	Only B has a clan ( CL0128.6 ).

	the two keywords coincide on Uniref90 proteins: |PF01987| = 157 ,	 |PF02342| = 153 , 	|PF01987^PF02342| = 5	( 3.2% and 3.3% )

	only  PF01987 has a PDB structure (may not be up to date)
	PF01987 	   b.82.5.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF01987 SSF51219 0.979 (average over 382 mutual instances, PF01987 383 appearances, SSF51219 425 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 334 ) 6720443_PF00214_PF02039            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00214  is 6607292 with Jaccard = 0.9808	|PF00214|=52 [ 51	0	1100159	1 ]
 parent [ 6607292 ] : 6720443 0.055668 (=55/(19*52)) 95.4812
  given [ 6607292 ] : 6607292 0.34955 (=194/(15*37)) 65.4522
	best keyword for cluster 6607292 is PF00214 with Jaccard = 0.9808	[ 51	0	1100159	1 ]	1.0000 0.9808
sibling [ 6607292 ] : 6526218 0.77381 (=65/(7*12)) 23.8695
	best keyword for cluster 6526218 is PF02039 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00214 ( PF00214   Calcitonin / CGRP / IAPP family )
	B> PF02039 ( PF02039   Adrenomedullin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00214 has a PDB structure (may not be up to date)
	PF00214 	   j.42.1.1  j.6.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 335 ) 6717053_PF01087_PF01230            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01230  is 6708055 with Jaccard = 0.9807	|PF01230|=622 [ 611	1	1099588	11 ]
 parent [ 6708055 ] : 6717053 0.0614693 (=6374/(139*746)) 95.0262
  given [ 6708055 ] : 6708055 0.0708838 (=158/(3*743)) 93.6409
	best keyword for cluster 6708055 is PF01230 with Jaccard = 0.9807	[ 611	1	1099588	11 ]	0.9984 0.9823
sibling [ 6708055 ] : 6651712 0.229323 (=183/(6*133)) 80.5232
	best keyword for cluster 6651712 is PF01087 with Jaccard = 0.6316	[ 84	7	1100078	42 ]	0.9231 0.6667

	SUGGESTING RELATEDNESS OF:   
	A> PF01230 ( PF01230   HIT domain )
	B> PF01087 ( PF01087   Galactose-1-phosphate uridyl transferase, N-terminal domain )

	Only A has a clan ( CL0265.2 ).

	the two keywords coincide on Uniref90 proteins: |PF01087| = 126 ,	 |PF01230| = 622 , 	|PF01087^PF01230| = 1	( 0.8% and 0.2% )

	both PF01230 and PF01087 have PDB structures
	PF01230 	   d.13.1.1
	PF01087 	   d.13.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF01230 SSF54197 0.754 (average over 1865 mutual instances, PF01230 1906 appearances, SSF54197 2604 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 336 ) 6734162_PF02661_PF05012            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05012  is 6689837 with Jaccard = 0.9804	|PF05012|=101 [ 100	1	1100109	1 ]
 parent [ 6689837 ] : 6734162 0.0426443 (=3915/(214*429)) 97.185
  given [ 6689837 ] : 6689837 0.0985169 (=1116/(96*118)) 90.2336
	best keyword for cluster 6689837 is PF05012 with Jaccard = 0.9804	[ 100	1	1100109	1 ]	0.9901 0.9901
sibling [ 6689837 ] : 6733450 0.0341909 (=101/(7*422)) 97.1024
	best keyword for cluster 6733450 is PF02661 with Jaccard = 0.9878	[ 323	0	1099884	4 ]	1.0000 0.9878

	SUGGESTING RELATEDNESS OF:   
	A> PF05012 ( PF05012   Prophage maintenance system killer protein )
	B> PF02661 ( PF02661   Fic protein family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05012 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 337 ) 6773633_PF00424_PF00539            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00424  is 6771379 with Jaccard = 0.9801	|PF00424|=704 [ 690	0	1099507	14 ]
 parent [ 6771379 ] : 6773633 0.00216402 (=2092/(763*1267)) 99.8217
  given [ 6771379 ] : 6771379 0.00262467 (=2/(1*762)) 99.7559
	best keyword for cluster 6771379 is PF00424 with Jaccard = 0.9801	[ 690	0	1099507	14 ]	1.0000 0.9801
sibling [ 6771379 ] : 6772814 0.00316957 (=20/(5*1262)) 99.7995
	best keyword for cluster 6772814 is PF00539 with Jaccard = 0.8292	[ 743	152	1099315	1 ]	0.8302 0.9987

	SUGGESTING RELATEDNESS OF:   
	A> PF00424 ( PF00424   REV protein (anti-repression trans-activator protein) )
	B> PF00539 ( PF00539   Transactivating regulatory protein (Tat) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00424| = 704 ,	 |PF00539| = 744 , 	|PF00424^PF00539| = 3	( 0.4% and 0.4% )

	both PF00424 and PF00539 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 338 ) 6756753_PF01578_PF05140            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05140  is 6739787 with Jaccard = 0.9798	|PF05140|=99 [ 97	0	1100112	2 ]
 parent [ 6739787 ] : 6756753 0.0110831 (=1187/(153*700)) 99.0758
  given [ 6739787 ] : 6739787 0.0241379 (=28/(145*8)) 97.7558
	best keyword for cluster 6739787 is PF05140 with Jaccard = 0.9798	[ 97	0	1100112	2 ]	1.0000 0.9798
sibling [ 6739787 ] : 6748536 0.0195652 (=135/(10*690)) 98.5049
	best keyword for cluster 6748536 is PF01578 with Jaccard = 0.9609	[ 540	21	1099649	1 ]	0.9626 0.9982

	SUGGESTING RELATEDNESS OF:   
	A> PF05140 ( PF05140   ResB-like family )
	B> PF01578 ( PF01578   Cytochrome C assembly protein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01578| = 541 ,	 |PF05140| = 99 , 	|PF01578^PF05140| = 2	( 0.4% and 2.0% )

	Neither  PF05140 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 339 ) 6688990_PF02065_PF05691            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05691  is 6658986 with Jaccard = 0.9796	|PF05691|=49 [ 48	0	1100162	1 ]
 parent [ 6658986 ] : 6688990 0.118551 (=1466/(229*54)) 90.0517
  given [ 6658986 ] : 6658986 0.169935 (=26/(51*3)) 83.0067
	best keyword for cluster 6658986 is PF05691 with Jaccard = 0.9796	[ 48	0	1100162	1 ]	1.0000 0.9796
sibling [ 6658986 ] : 6658269 0.201794 (=270/(6*223)) 82.7771
	best keyword for cluster 6658269 is PF02065 with Jaccard = 0.9788	[ 185	4	1100022	0 ]	0.9788 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05691 ( PF05691   Raffinose synthase or seed imbibition protein Sip1 )
	B> PF02065 ( PF02065   Melibiase )

	Only B has a clan ( CL0058.10 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF05691 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 340 ) 6741149_PF00226_PF06386            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06386  is 6598397 with Jaccard = 0.9792	|PF06386|=48 [ 47	0	1100163	1 ]
 parent [ 6598397 ] : 6741149 0.0238828 (=4425/(60*3088)) 97.89
  given [ 6598397 ] : 6598397 0.426901 (=73/(3*57)) 60.9933
	best keyword for cluster 6598397 is PF06386 with Jaccard = 0.9792	[ 47	0	1100163	1 ]	1.0000 0.9792
sibling [ 6598397 ] : 6737905 0.0323698 (=897/(9*3079)) 97.5773
	best keyword for cluster 6737905 is PF00226 with Jaccard = 0.9188	[ 2468	109	1097525	109 ]	0.9577 0.9577

	SUGGESTING RELATEDNESS OF:   
	A> PF06386 ( PF06386   Gas vesicle synthesis protein GvpL/GvpF )
	B> PF00226 ( PF00226   DnaJ domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00226| = 2577 ,	 |PF06386| = 48 , 	|PF00226^PF06386| = 3	( 0.1% and 6.2% )

	only  PF06386 has a PDB structure (may not be up to date)
		PF00226 	   a.2.3.1

	SUPERFAM mapping significantly overlapping:
     1	PF00226 SSF46565 0.626 (average over 6995 mutual instances, PF00226 11372 appearances, SSF46565 12650 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 341 ) 6727943_PF00324_PF01235            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00324  is 6725973 with Jaccard = 0.9791	|PF00324|=2052 [ 2016	7	1098152	36 ]
 parent [ 6725973 ] : 6727943 0.0502552 (=31744/(283*2232)) 96.478
  given [ 6725973 ] : 6725973 0.0417415 (=372/(4*2228)) 96.2301
	best keyword for cluster 6725973 is PF00324 with Jaccard = 0.9791	[ 2016	7	1098152	36 ]	0.9965 0.9825
sibling [ 6725973 ] : 6675002 0.137993 (=154/(4*279)) 86.9053
	best keyword for cluster 6675002 is PF01235 with Jaccard = 0.9961	[ 254	0	1099956	1 ]	1.0000 0.9961

	SUGGESTING RELATEDNESS OF:   
	A> PF00324 ( PF00324   Amino acid permease )
	B> PF01235 ( PF01235   Sodium:alanine symporter family )

	they come from the same clan: CL0062.8 : PF00860	PF03222	PF02133	PF00916	PF00474	PF03845	PF01235	PF00955	PF07331	PF02361	PF05525	PF03594	PF01490	PF00324
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF00324 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 342 ) 6749989_PF01694_PF04511            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01694  is 6743480 with Jaccard = 0.9791	|PF01694|=574 [ 563	1	1099636	11 ]
 parent [ 6743480 ] : 6749989 0.0176152 (=1321/(109*688)) 98.6183
  given [ 6743480 ] : 6743480 0.0210166 (=86/(6*682)) 98.0967
	best keyword for cluster 6743480 is PF01694 with Jaccard = 0.9791	[ 563	1	1099636	11 ]	0.9982 0.9808
sibling [ 6743480 ] : 6718815 0.0498282 (=58/(12*97)) 95.2505
	best keyword for cluster 6718815 is PF04511 with Jaccard = 0.9239	[ 85	3	1100119	4 ]	0.9659 0.9551

	SUGGESTING RELATEDNESS OF:   
	A> PF01694 ( PF01694   Rhomboid family )
	B> PF04511 ( PF04511   Der1-like family )

	they come from the same clan: CL0207.4 : PF04511	PF08551	PF01694
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01694 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 343 ) 6731304_PF00893_PF02694            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00893  is 6678461 with Jaccard = 0.9787	|PF00893|=327 [ 321	1	1099883	6 ]
 parent [ 6678461 ] : 6731304 0.0412858 (=1337/(368*88)) 96.8609
  given [ 6678461 ] : 6678461 0.122569 (=353/(8*360)) 87.8186
	best keyword for cluster 6678461 is PF00893 with Jaccard = 0.9787	[ 321	1	1099883	6 ]	0.9969 0.9817
sibling [ 6678461 ] : 6723718 0.047619 (=16/(84*4)) 95.9557
	best keyword for cluster 6723718 is PF02694 with Jaccard = 1.0000	[ 71	0	1100140	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00893 ( PF00893   Small Multidrug Resistance protein )
	B> PF02694 ( PF02694   Uncharacterised BCR, YnfA/UPF0060 family )

	they come from the same clan: CL0184.5 : PF07857	PF04342	PF00892	PF05653	PF06027	PF00893	PF04142	PF06379	PF06800	PF03151	PF08449	PF02694
	the two keywords do not coincide on UniRef90 proteins

	only  PF00893 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 344 ) 6673941_PF03150_PF06537            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06537  is 6484455 with Jaccard = 0.9783	|PF06537|=46 [ 45	0	1100165	1 ]
 parent [ 6484455 ] : 6673941 0.169163 (=2101/(54*230)) 86.6617
  given [ 6484455 ] : 6484455 0.95 (=190/(4*50)) 6.77325
	best keyword for cluster 6484455 is PF06537 with Jaccard = 0.9783	[ 45	0	1100165	1 ]	1.0000 0.9783
sibling [ 6484455 ] : 6631950 0.299808 (=468/(7*223)) 75.1271
	best keyword for cluster 6631950 is PF03150 with Jaccard = 1.0000	[ 156	0	1100055	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06537 ( PF06537   Protein of unknown function (DUF1111) )
	B> PF03150 ( PF03150   Di-haem cytochrome c peroxidase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06537 has a PDB structure (may not be up to date)
		PF03150 	   a.3.1.5

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 345 ) 6752337_PF01923_PF03928            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03928  is 6729997 with Jaccard = 0.9779	|PF03928|=181 [ 177	0	1100030	4 ]
 parent [ 6729997 ] : 6752337 0.0124398 (=413/(166*200)) 98.7917
  given [ 6729997 ] : 6729997 0.0455623 (=269/(36*164)) 96.7207
	best keyword for cluster 6729997 is PF03928 with Jaccard = 0.9779	[ 177	0	1100030	4 ]	1.0000 0.9779
sibling [ 6729997 ] : 6676336 0.145455 (=24/(1*165)) 87.2621
	best keyword for cluster 6676336 is PF01923 with Jaccard = 0.9329	[ 153	0	1100047	11 ]	1.0000 0.9329

	SUGGESTING RELATEDNESS OF:   
	A> PF03928 ( PF03928   Domain of unknown function (DUF336) )
	B> PF01923 ( PF01923   Cobalamin adenosyltransferase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01923| = 164 ,	 |PF03928| = 181 , 	|PF01923^PF03928| = 2	( 1.2% and 1.1% )

	both PF03928 and PF01923 have PDB structures
	PF03928 	   d.110.9.1
	PF01923 	   a.25.2.2

	SUPERFAM mapping significantly overlapping:
     1	PF01923 SSF89028 0.931 (average over 509 mutual instances, PF01923 510 appearances, SSF89028 542 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 346 ) 6760615_PF05489_PF06893            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05489  is 6668813 with Jaccard = 0.9778	|PF05489|=45 [ 44	0	1100166	1 ]
 parent [ 6668813 ] : 6760615 0.00887949 (=21/(55*43)) 99.2972
  given [ 6668813 ] : 6668813 0.177177 (=118/(37*18)) 85.1917
	best keyword for cluster 6668813 is PF05489 with Jaccard = 0.9778	[ 44	0	1100166	1 ]	1.0000 0.9778
sibling [ 6668813 ] : 6755880 0.0238095 (=1/(1*42)) 99.0238
	best keyword for cluster 6755880 is PF06893 with Jaccard = 0.9286	[ 26	2	1100183	0 ]	0.9286 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05489 ( PF05489   Phage Tail Protein X )
	B> PF06893 ( PF06893   Bacteriophage Mu P protein )

	Only A has a clan ( CL0187.6 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05489 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 347 ) 6764035_PF01157_PF03524            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03524  is 6733553 with Jaccard = 0.9776	|PF03524|=134 [ 131	0	1100077	3 ]
 parent [ 6733553 ] : 6764035 0.00624117 (=106/(193*88)) 99.4673
  given [ 6733553 ] : 6733553 0.0349138 (=243/(48*145)) 97.1173
	best keyword for cluster 6733553 is PF03524 with Jaccard = 0.9776	[ 131	0	1100077	3 ]	1.0000 0.9776
sibling [ 6733553 ] : 6762942 0.0114943 (=1/(1*87)) 99.4138
	best keyword for cluster 6762942 is PF01157 with Jaccard = 1.0000	[ 81	0	1100130	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03524 ( PF03524   Conjugal transfer protein )
	B> PF01157 ( PF01157   Ribosomal protein L21e )

	Only B has a clan ( CL0107.7 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF03524 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01157 SSF50104 0.997 (average over 235 mutual instances, PF01157 236 appearances, SSF50104 9220 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 348 ) 6670696_PF01041_PF01276            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01041  is 6650488 with Jaccard = 0.9774	|PF01041|=654 [ 649	10	1099547	5 ]
 parent [ 6650488 ] : 6670696 0.163765 (=18852/(159*724)) 85.6823
  given [ 6650488 ] : 6650488 0.235131 (=170/(1*723)) 80.2206
	best keyword for cluster 6650488 is PF01041 with Jaccard = 0.9774	[ 649	10	1099547	5 ]	0.9848 0.9924
sibling [ 6650488 ] : 6532901 0.760684 (=356/(3*156)) 27.5861
	best keyword for cluster 6532901 is PF01276 with Jaccard = 0.9605	[ 146	5	1100059	1 ]	0.9669 0.9932

	SUGGESTING RELATEDNESS OF:   
	A> PF01041 ( PF01041   DegT/DnrJ/EryC1/StrS aminotransferase family )
	B> PF01276 ( PF01276   Orn/Lys/Arg decarboxylase, major domain )

	they come from the same clan: CL0061.8 : PF05889	PF00464	PF03841	PF00282	PF01276	PF02347	PF01041	PF01053	PF01212	PF00266	PF00202	PF00155	PF06838	PF04864
	the two keywords do not coincide on UniRef90 proteins

	both PF01041 and PF01276 have PDB structures
	PF01041 	   c.67.1.4
	
	SUPERFAM mapping significantly overlapping:
     1	PF01041 SSF53383 0.932 (average over 1805 mutual instances, PF01041 1817 appearances, SSF53383 34644 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 349 ) 6599234_PF04101_PF06925            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06925  is 6478966 with Jaccard = 0.9773	|PF06925|=43 [ 43	1	1100167	0 ]
 parent [ 6478966 ] : 6599234 0.41884 (=5927/(267*53)) 61.167
  given [ 6478966 ] : 6478966 0.945833 (=227/(5*48)) 5.4698
	best keyword for cluster 6478966 is PF06925 with Jaccard = 0.9773	[ 43	1	1100167	0 ]	0.9773 1.0000
sibling [ 6478966 ] : 6375579 1 (=16892/(103*164)) 0.000250955
	best keyword for cluster 6375579 is PF04101 with Jaccard = 0.6096	[ 242	0	1099814	155 ]	1.0000 0.6096

	SUGGESTING RELATEDNESS OF:   
	A> PF06925 ( PF06925   Monogalactosyldiacylglycerol (MGDG) synthase )
	B> PF04101 ( PF04101   Glycosyltransferase family 28 C-terminal domain )

	they come from the same clan: CL0113.8 : PF06925	PF02684	PF04464	PF04101	PF01075	PF03033	PF00982	PF00534	PF05693	PF02350	PF04007	PF06722	PF05159	PF08660	PF00343	PF00201

	the two keywords coincide on Uniref90 proteins: |PF04101| = 397 ,	 |PF06925| = 43 , 	|PF04101^PF06925| = 19	( 4.8% and 44.2% )

	only  PF06925 has a PDB structure (may not be up to date)
		PF04101 	   c.87.1.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 350 ) 6736629_PF00823_PF08237            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08237  is 6681375 with Jaccard = 0.9773	|PF08237|=44 [ 43	0	1100167	1 ]
 parent [ 6681375 ] : 6736629 0.0264975 (=403/(67*227)) 97.439
  given [ 6681375 ] : 6681375 0.121118 (=117/(21*46)) 88.525
	best keyword for cluster 6681375 is PF08237 with Jaccard = 0.9773	[ 43	0	1100167	1 ]	1.0000 0.9773
sibling [ 6681375 ] : 6716602 0.0560538 (=50/(4*223)) 94.993
	best keyword for cluster 6716602 is PF00823 with Jaccard = 0.7875	[ 126	27	1100051	7 ]	0.8235 0.9474

	SUGGESTING RELATEDNESS OF:   
	A> PF08237 ( PF08237   PE-PPE domain )
	B> PF00823 ( PF00823   PPE family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00823| = 133 ,	 |PF08237| = 44 , 	|PF00823^PF08237| = 3	( 2.3% and 6.8% )

	Neither  PF08237 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 351 ) 6750901_PF01722_PF02657            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02657  is 6551215 with Jaccard = 0.9769	|PF02657|=130 [ 127	0	1100081	3 ]
 parent [ 6551215 ] : 6750901 0.0133596 (=645/(142*340)) 98.6871
  given [ 6551215 ] : 6551215 0.601918 (=251/(3*139)) 39.9631
	best keyword for cluster 6551215 is PF02657 with Jaccard = 0.9769	[ 127	0	1100081	3 ]	1.0000 0.9769
sibling [ 6551215 ] : 6717119 0.0530973 (=18/(1*339)) 95.0367
	best keyword for cluster 6717119 is PF01722 with Jaccard = 0.9810	[ 310	0	1099895	6 ]	1.0000 0.9810

	SUGGESTING RELATEDNESS OF:   
	A> PF02657 ( PF02657   Fe-S metabolism associated domain )
	B> PF01722 ( PF01722   BolA-like protein )

	Only A has a clan ( CL0233.3 ).

	the two keywords coincide on Uniref90 proteins: |PF01722| = 316 ,	 |PF02657| = 130 , 	|PF01722^PF02657| = 2	( 0.6% and 1.5% )

	both PF02657 and PF01722 have PDB structures
		PF01722 	   d.52.6.1

	SUPERFAM mapping significantly overlapping:
     1	PF01722 SSF82657 0.885 (average over 968 mutual instances, PF01722 976 appearances, SSF82657 980 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 352 ) 6752913_PF00817_PF02961            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00817  is 6743482 with Jaccard = 0.9768	|PF00817|=516 [ 505	1	1099694	11 ]
 parent [ 6743482 ] : 6752913 0.0182865 (=1162/(94*676)) 98.8327
  given [ 6743482 ] : 6743482 0.0211403 (=99/(7*669)) 98.0969
	best keyword for cluster 6743482 is PF00817 with Jaccard = 0.9768	[ 505	1	1099694	11 ]	0.9980 0.9787
sibling [ 6743482 ] : 6742618 0.0300065 (=46/(73*21)) 98.0191
	best keyword for cluster 6742618 is PF02961 with Jaccard = 0.7857	[ 11	3	1100197	0 ]	0.7857 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00817 ( PF00817   impB/mucB/samB family )
	B> PF02961 ( PF02961   Barrier to autointegration factor )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF00817 and PF02961 have PDB structures
	PF00817 	   d.240.1.1  e.8.1.7
	PF02961 	   a.60.5.1

	SUPERFAM mapping significantly overlapping:
     1	PF02961 SSF47798 0.975 (average over 29 mutual instances, PF02961 29 appearances, SSF47798 29 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 353 ) 6750980_PF00639_PF04319            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04319  is 6524652 with Jaccard = 0.9767	|PF04319|=43 [ 42	0	1100168	1 ]
 parent [ 6524652 ] : 6750980 0.0135312 (=506/(45*831)) 98.6941
  given [ 6524652 ] : 6524652 0.792683 (=130/(4*41)) 22.8423
	best keyword for cluster 6524652 is PF04319 with Jaccard = 0.9767	[ 42	0	1100168	1 ]	1.0000 0.9767
sibling [ 6524652 ] : 6747622 0.019593 (=129/(8*823)) 98.4372
	best keyword for cluster 6747622 is PF00639 with Jaccard = 0.9720	[ 520	5	1099676	10 ]	0.9905 0.9811

	SUGGESTING RELATEDNESS OF:   
	A> PF04319 ( PF04319   NifZ domain )
	B> PF00639 ( PF00639   PPIC-type PPIASE domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00639| = 530 ,	 |PF04319| = 43 , 	|PF00639^PF04319| = 1	( 0.2% and 2.3% )

	only  PF04319 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 354 ) 6751748_PF03692_PF05779            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03692  is 6748798 with Jaccard = 0.9758	|PF03692|=289 [ 282	0	1099922	7 ]
 parent [ 6748798 ] : 6751748 0.0190383 (=700/(96*383)) 98.7501
  given [ 6748798 ] : 6748798 0.0207989 (=151/(20*363)) 98.5273
	best keyword for cluster 6748798 is PF03692 with Jaccard = 0.9758	[ 282	0	1099922	7 ]	1.0000 0.9758
sibling [ 6748798 ] : 6743017 0.035313 (=22/(89*7)) 98.0547
	best keyword for cluster 6743017 is PF05779 with Jaccard = 0.9868	[ 75	1	1100135	0 ]	0.9868 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03692 ( PF03692   Uncharacterised protein family (UPF0153) )
	B> PF05779 (  )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03692| = 289 ,	 |PF05779| = 75 , 	|PF03692^PF05779| = 1	( 0.3% and 1.3% )

	Neither  PF03692 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 355 ) 6746429_PF01763_PF03271            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01763  is 6488277 with Jaccard = 0.9756	|PF01763|=41 [ 40	0	1100170	1 ]
 parent [ 6488277 ] : 6746429 0.0265766 (=118/(40*111)) 98.3434
  given [ 6488277 ] : 6488277 0.923077 (=36/(1*39)) 7.76623
	best keyword for cluster 6488277 is PF01763 with Jaccard = 0.9756	[ 40	0	1100170	1 ]	1.0000 0.9756
sibling [ 6488277 ] : 6740396 0.0263736 (=48/(91*20)) 97.8172
	best keyword for cluster 6740396 is PF03271 with Jaccard = 0.7922	[ 61	15	1100134	1 ]	0.8026 0.9839

	SUGGESTING RELATEDNESS OF:   
	A> PF01763 ( PF01763   Herpesvirus UL6 like )
	B> PF03271 ( PF03271   EB1-like C-terminal motif )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01763 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 356 ) 6762508_PF03968_PF06835            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06835  is 6748807 with Jaccard = 0.9756	|PF06835|=81 [ 80	1	1100129	1 ]
 parent [ 6748807 ] : 6762508 0.00917447 (=549/(160*374)) 99.3945
  given [ 6748807 ] : 6748807 0.0194892 (=87/(124*36)) 98.5286
	best keyword for cluster 6748807 is PF06835 with Jaccard = 0.9756	[ 80	1	1100129	1 ]	0.9877 0.9877
sibling [ 6748807 ] : 6752400 0.0142857 (=52/(364*10)) 98.7961
	best keyword for cluster 6752400 is PF03968 with Jaccard = 0.7921	[ 221	46	1099932	12 ]	0.8277 0.9485

	SUGGESTING RELATEDNESS OF:   
	A> PF06835 ( PF06835   Protein of unknown function (DUF1239) )
	B> PF03968 ( PF03968   OstA-like protein )

	they come from the same clan: CL0259.2 : PF06835	PF03968
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06835 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 357 ) 6754277_PF01206_PF02635            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02635  is 6742716 with Jaccard = 0.9755	|PF02635|=161 [ 159	2	1100048	2 ]
 parent [ 6742716 ] : 6754277 0.0119638 (=1584/(313*423)) 98.9237
  given [ 6742716 ] : 6742716 0.0233392 (=397/(243*70)) 98.026
	best keyword for cluster 6742716 is PF02635 with Jaccard = 0.9755	[ 159	2	1100048	2 ]	0.9876 0.9876
sibling [ 6742716 ] : 6748727 0.0220923 (=612/(81*342)) 98.5214
	best keyword for cluster 6748727 is PF01206 with Jaccard = 0.9051	[ 248	7	1099937	19 ]	0.9725 0.9288

	SUGGESTING RELATEDNESS OF:   
	A> PF02635 ( PF02635   DsrE/DsrF-like family )
	B> PF01206 ( PF01206   SirA-like protein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01206| = 267 ,	 |PF02635| = 161 , 	|PF01206^PF02635| = 4	( 1.5% and 2.5% )

	both PF02635 and PF01206 have PDB structures
	PF02635 	   c.114.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF01206 SSF64307 0.950 (average over 849 mutual instances, PF01206 954 appearances, SSF64307 1042 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 358 ) 6542083_PF00393_PF03446            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00393  is 6532366 with Jaccard = 0.9753	|PF00393|=277 [ 276	6	1099928	1 ]
 parent [ 6532366 ] : 6542083 0.685969 (=116837/(308*553)) 33.5237
  given [ 6532366 ] : 6532366 0.73768 (=2410/(11*297)) 27.0546
	best keyword for cluster 6532366 is PF00393 with Jaccard = 0.9753	[ 276	6	1099928	1 ]	0.9787 0.9964
sibling [ 6532366 ] : 6532887 0.746717 (=7960/(20*533)) 27.5737
	best keyword for cluster 6532887 is PF03446 with Jaccard = 0.6203	[ 490	3	1099421	297 ]	0.9939 0.6226

	SUGGESTING RELATEDNESS OF:   
	A> PF00393 ( PF00393   6-phosphogluconate dehydrogenase, C-terminal domain )
	B> PF03446 ( PF03446   NAD binding domain of 6-phosphogluconate dehydrogenase )

	A and B come from a different clan ( CL0106.7 , CL0063.17 ).

	the two keywords coincide on Uniref90 proteins: |PF00393| = 277 ,	 |PF03446| = 787 , 	|PF00393^PF03446| = 255	( 92.1% and 32.4% )

	both PF00393 and PF03446 have PDB structures
	PF00393 	   a.100.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00393 SSF48179 0.846 (average over 1097 mutual instances, PF00393 2092 appearances, SSF48179 20570 appearances)
     2	PF03446 SSF51735 0.944 (average over 2816 mutual instances, PF03446 5512 appearances, SSF51735 164772 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 359 ) 6634810_PF03088_PF08450            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03088  is 6629232 with Jaccard = 0.9750	|PF03088|=80 [ 78	0	1100131	2 ]
 parent [ 6629232 ] : 6634810 0.273158 (=7201/(269*98)) 75.6875
  given [ 6629232 ] : 6629232 0.313978 (=146/(5*93)) 74.2173
	best keyword for cluster 6629232 is PF03088 with Jaccard = 0.9750	[ 78	0	1100131	2 ]	1.0000 0.9750
sibling [ 6629232 ] : 6626669 0.277154 (=148/(2*267)) 73.2134
	best keyword for cluster 6626669 is PF08450 with Jaccard = 0.8979	[ 211	16	1099976	8 ]	0.9295 0.9635

	SUGGESTING RELATEDNESS OF:   
	A> PF03088 ( PF03088   Strictosidine synthase )
	B> PF08450 ( PF08450   SMP-30/Gluconolaconase/LRE-like region )

	they come from the same clan: CL0186.8 : PF03088	PF08450	PF06739	PF07494	PF01011	PF02897	PF07676	PF08801	PF01436	PF06433	PF00058	PF01839	PF00930	PF02239	PF01731	PF00400
	the two keywords do not coincide on UniRef90 proteins

	only  PF03088 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 360 ) 6700753_PF05105_PF05895            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05105  is 6680068 with Jaccard = 0.9750	|PF05105|=40 [ 39	0	1100171	1 ]
 parent [ 6680068 ] : 6700753 0.077381 (=39/(12*42)) 92.3104
  given [ 6680068 ] : 6680068 0.138158 (=21/(4*38)) 88.1723
	best keyword for cluster 6680068 is PF05105 with Jaccard = 0.9750	[ 39	0	1100171	1 ]	1.0000 0.9750
sibling [ 6680068 ] : 5822328 1 (=32/(8*4)) 9.37506e-52
	best keyword for cluster 5822328 is PF05895 with Jaccard = 0.9231	[ 12	0	1100198	1 ]	1.0000 0.9231

	SUGGESTING RELATEDNESS OF:   
	A> PF05105 ( PF05105   Holin family )
	B> PF05895 ( PF05895   Siphovirus protein of unknown function (DUF859) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF05105| = 40 ,	 |PF05895| = 13 , 	|PF05105^PF05895| = 1	( 2.5% and 7.7% )

	Neither  PF05105 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 361 ) 6732749_PF00581_PF00899            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00899  is 6728865 with Jaccard = 0.9749	|PF00899|=901 [ 893	15	1099295	8 ]
 parent [ 6728865 ] : 6732749 0.0326903 (=54432/(1529*1089)) 97.0243
  given [ 6728865 ] : 6728865 0.043517 (=976/(21*1068)) 96.594
	best keyword for cluster 6728865 is PF00899 with Jaccard = 0.9749	[ 893	15	1099295	8 ]	0.9835 0.9911
sibling [ 6728865 ] : 6721893 0.0539344 (=329/(4*1525)) 95.6893
	best keyword for cluster 6721893 is PF00581 with Jaccard = 0.8021	[ 1050	8	1098902	251 ]	0.9924 0.8071

	SUGGESTING RELATEDNESS OF:   
	A> PF00899 ( PF00899   ThiF family )
	B> PF00581 ( PF00581   Rhodanese-like domain )

	Only A has a clan ( CL0063.17 ).

	the two keywords coincide on Uniref90 proteins: |PF00581| = 1301 ,	 |PF00899| = 901 , 	|PF00581^PF00899| = 72	( 5.5% and 8.0% )

	both PF00899 and PF00581 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00581 SSF52821 0.763 (average over 3964 mutual instances, PF00581 4463 appearances, SSF52821 6143 appearances)
     2	PF00899 SSF69572 0.518 (average over 2370 mutual instances, PF00899 2642 appearances, SSF69572 3931 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 362 ) 6737404_PF00485_PF01121            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01121  is 6731962 with Jaccard = 0.9746	|PF01121|=313 [ 307	2	1099896	6 ]
 parent [ 6731962 ] : 6737404 0.0391929 (=8508/(536*405)) 97.5261
  given [ 6731962 ] : 6731962 0.0310602 (=317/(27*378)) 96.9348
	best keyword for cluster 6731962 is PF01121 with Jaccard = 0.9746	[ 307	2	1099896	6 ]	0.9935 0.9808
sibling [ 6731962 ] : 6720335 0.0650943 (=207/(6*530)) 95.464
	best keyword for cluster 6720335 is PF00485 with Jaccard = 0.8920	[ 347	4	1099822	38 ]	0.9886 0.9013

	SUGGESTING RELATEDNESS OF:   
	A> PF01121 ( PF01121   Dephospho-CoA kinase )
	B> PF00485 ( PF00485   Phosphoribulokinase / Uridine kinase family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01121 and PF00485 have PDB structures
	PF01121 	   c.37.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 363 ) 6740876_PF03410_PF05193            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05193  is 6728858 with Jaccard = 0.9741	|PF05193|=945 [ 939	19	1099247	6 ]
 parent [ 6728858 ] : 6740876 0.0290778 (=548/(1047*18)) 97.8647
  given [ 6728858 ] : 6728858 0.0353167 (=184/(1042*5)) 96.5927
	best keyword for cluster 6728858 is PF05193 with Jaccard = 0.9741	[ 939	19	1099247	6 ]	0.9802 0.9937
sibling [ 6728858 ] : 6733557 0.0588235 (=1/(1*17)) 97.1176
	best keyword for cluster 6733557 is PF03410 with Jaccard = 0.9231	[ 12	1	1100198	0 ]	0.9231 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05193 ( PF05193   Peptidase M16 inactive domain )
	B> PF03410 ( PF03410   Protein G1 )

	they come from the same clan: CL0094.7 : PF02664	PF00675	PF05193	PF03410
	the two keywords do not coincide on UniRef90 proteins

	only  PF05193 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 364 ) 6710584_PF01417_PF07651            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01417  is 6577412 with Jaccard = 0.9735	|PF01417|=113 [ 110	0	1100098	3 ]
 parent [ 6577412 ] : 6710584 0.0803684 (=890/(113*98)) 94.0104
  given [ 6577412 ] : 6577412 0.477477 (=106/(2*111)) 52.9269
	best keyword for cluster 6577412 is PF01417 with Jaccard = 0.9735	[ 110	0	1100098	3 ]	1.0000 0.9735
sibling [ 6577412 ] : 6694982 0.0930851 (=35/(4*94)) 91.3228
	best keyword for cluster 6694982 is PF07651 with Jaccard = 0.6960	[ 87	1	1100086	37 ]	0.9886 0.7016

	SUGGESTING RELATEDNESS OF:   
	A> PF01417 ( PF01417   ENTH domain )
	B> PF07651 ( PF07651   ANTH domain )

	they come from the same clan: CL0009.14 : PF01417	PF07651	PF00790
	the two keywords do not coincide on UniRef90 proteins

	both PF01417 and PF07651 have PDB structures
	PF01417 	   a.118.9.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF01417 SSF48464 0.847 (average over 240 mutual instances, PF01417 240 appearances, SSF48464 1729 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 365 ) 6758123_PF03672_PF04729            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03672  is 6439359 with Jaccard = 0.9730	|PF03672|=37 [ 36	0	1100174	1 ]
 parent [ 6439359 ] : 6758123 0.0160595 (=41/(37*69)) 99.1604
  given [ 6439359 ] : 6439359 0.995238 (=209/(7*30)) 0.62387
	best keyword for cluster 6439359 is PF03672 with Jaccard = 0.9730	[ 36	0	1100174	1 ]	1.0000 0.9730
sibling [ 6439359 ] : 6742166 0.0274725 (=20/(56*13)) 97.9857
	best keyword for cluster 6742166 is PF04729 with Jaccard = 1.0000	[ 47	0	1100164	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03672 ( PF03672   Uncharacterised protein family (UPF0154) )
	B> PF04729 ( PF04729   Anti-silencing protein, ASF1-like )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF03672 has a PDB structure (may not be up to date)
		PF04729 	   b.1.22.1

	SUPERFAM mapping significantly overlapping:
     1	PF04729 SSF101546 0.968 (average over 114 mutual instances, PF04729 114 appearances, SSF101546 115 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 366 ) 6711543_PF00004_PF06309            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06309  is 6689666 with Jaccard = 0.9730	|PF06309|=36 [ 36	1	1100174	0 ]
 parent [ 6689666 ] : 6711543 0.0669631 (=24623/(51*7210)) 94.1712
  given [ 6689666 ] : 6689666 0.111111 (=30/(45*6)) 90.2085
	best keyword for cluster 6689666 is PF06309 with Jaccard = 0.9730	[ 36	1	1100174	0 ]	0.9730 1.0000
sibling [ 6689666 ] : 6708571 0.0663275 (=10964/(23*7187)) 93.739
	best keyword for cluster 6708571 is PF00004 with Jaccard = 0.6403	[ 3979	2070	1093997	165 ]	0.6578 0.9602

	SUGGESTING RELATEDNESS OF:   
	A> PF06309 ( PF06309   Torsin )
	B> PF00004 ( PF00004   ATPase family associated with various cellular activities (AAA) )

	Only B has a clan ( CL0023.26 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF06309 has a PDB structure (may not be up to date)
		PF00004 	   c.37.1.1  c.37.1.20

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 367 ) 6740652_PF01758_PF03547            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03547  is 6693831 with Jaccard = 0.9729	|PF03547|=405 [ 395	1	1099805	10 ]
 parent [ 6693831 ] : 6740652 0.0282873 (=6970/(448*550)) 97.8419
  given [ 6693831 ] : 6693831 0.112273 (=2866/(67*381)) 91.0553
	best keyword for cluster 6693831 is PF03547 with Jaccard = 0.9729	[ 395	1	1099805	10 ]	0.9975 0.9753
sibling [ 6693831 ] : 6740319 0.0255009 (=14/(1*549)) 97.8069
	best keyword for cluster 6740319 is PF01758 with Jaccard = 0.8565	[ 376	62	1099772	1 ]	0.8584 0.9973

	SUGGESTING RELATEDNESS OF:   
	A> PF03547 ( PF03547   Membrane transport protein )
	B> PF01758 ( PF01758   Sodium Bile acid symporter family )

	they come from the same clan: CL0064.7 : PF06826	PF03547	PF03601	PF05684	PF05982	PF03616	PF06965	PF00999	PF03977	PF01758
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03547 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 368 ) 6751086_PF01274_PF03328            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01274  is 6537339 with Jaccard = 0.9713	|PF01274|=174 [ 169	0	1100037	5 ]
 parent [ 6537339 ] : 6751086 0.0176249 (=1499/(189*450)) 98.7029
  given [ 6537339 ] : 6537339 0.713358 (=6259/(82*107)) 30.4034
	best keyword for cluster 6537339 is PF01274 with Jaccard = 0.9713	[ 169	0	1100037	5 ]	1.0000 0.9713
sibling [ 6537339 ] : 6712190 0.0746986 (=3668/(264*186)) 94.2795
	best keyword for cluster 6712190 is PF03328 with Jaccard = 0.9634	[ 368	12	1099829	2 ]	0.9684 0.9946

	SUGGESTING RELATEDNESS OF:   
	A> PF01274 ( PF01274   Malate synthase )
	B> PF03328 ( PF03328   HpcH/HpaI aldolase/citrate lyase family )

	they come from the same clan: CL0151.7 : PF03328	PF01274	PF02896	PF00224
	the two keywords do not coincide on UniRef90 proteins

	both PF01274 and PF03328 have PDB structures
		PF03328 	   c.1.12.5

	SUPERFAM mapping significantly overlapping:
     1	PF03328 SSF51621 0.868 (average over 1134 mutual instances, PF03328 1215 appearances, SSF51621 12495 appearances)
     2	PF01274 SSF51645 0.908 (average over 679 mutual instances, PF01274 686 appearances, SSF51645 767 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 369 ) 6779200_PF05910_PF07893            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07893  is 6771772 with Jaccard = 0.9710	|PF07893|=69 [ 67	0	1100142	2 ]
 parent [ 6771772 ] : 6779200 0.000873536 (=22/(115*219)) 99.9408
  given [ 6771772 ] : 6771772 0.00328407 (=8/(87*28)) 99.7681
	best keyword for cluster 6771772 is PF07893 with Jaccard = 0.9710	[ 67	0	1100142	2 ]	1.0000 0.9710
sibling [ 6771772 ] : 6774540 0.00195713 (=21/(145*74)) 99.8444
	best keyword for cluster 6774540 is PF05910 with Jaccard = 0.6571	[ 23	12	1100176	0 ]	0.6571 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07893 ( PF07893   Protein of unknown function (DUF1668) )
	B> PF05910 ( PF05910   Plant protein of unknown function (DUF868) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07893 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 370 ) 6767275_PF01208_PF04217            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01208  is 6735556 with Jaccard = 0.9706	|PF01208|=272 [ 264	0	1099939	8 ]
 parent [ 6735556 ] : 6767275 0.00695667 (=162/(319*73)) 99.6081
  given [ 6735556 ] : 6735556 0.0305466 (=76/(311*8)) 97.3307
	best keyword for cluster 6735556 is PF01208 with Jaccard = 0.9706	[ 264	0	1099939	8 ]	1.0000 0.9706
sibling [ 6735556 ] : 6764765 0.0138889 (=1/(1*72)) 99.5
	best keyword for cluster 6764765 is PF04217 with Jaccard = 1.0000	[ 22	0	1100189	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01208 ( PF01208   Uroporphyrinogen decarboxylase (URO-D) )
	B> PF04217 ( PF04217   Protein of unknown function, DUF412 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01208 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 371 ) 6650231_PF01148_PF01864            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01864  is 6497798 with Jaccard = 0.9706	|PF01864|=34 [ 33	0	1100177	1 ]
 parent [ 6497798 ] : 6650231 0.246841 (=3067/(355*35)) 80.061
  given [ 6497798 ] : 6497798 0.907407 (=196/(27*8)) 10.8695
	best keyword for cluster 6497798 is PF01864 with Jaccard = 0.9706	[ 33	0	1100177	1 ]	1.0000 0.9706
sibling [ 6497798 ] : 6448283 0.991523 (=13568/(44*311)) 1.19472
	best keyword for cluster 6448283 is PF01148 with Jaccard = 0.7222	[ 325	0	1099761	125 ]	1.0000 0.7222

	SUGGESTING RELATEDNESS OF:   
	A> PF01864 ( PF01864   Putative integral membrane protein DUF46 )
	B> PF01148 ( PF01148   Cytidylyltransferase family )

	they come from the same clan: CL0234.3 : PF01148	PF01864
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01864 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 372 ) 6740778_PF02001_PF04198            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02001  is 6731336 with Jaccard = 0.9706	|PF02001|=66 [ 66	2	1100143	0 ]
 parent [ 6731336 ] : 6740778 0.03243 (=432/(173*77)) 97.8562
  given [ 6731336 ] : 6731336 0.0361111 (=13/(72*5)) 96.8664
	best keyword for cluster 6731336 is PF02001 with Jaccard = 0.9706	[ 66	2	1100143	0 ]	0.9706 1.0000
sibling [ 6731336 ] : 6667839 0.2 (=264/(8*165)) 84.9514
	best keyword for cluster 6667839 is PF04198 with Jaccard = 0.9272	[ 140	9	1100060	2 ]	0.9396 0.9859

	SUGGESTING RELATEDNESS OF:   
	A> PF02001 ( PF02001   Protein of unknown function DUF134 )
	B> PF04198 ( PF04198   Putative sugar-binding domain )

	A and B come from a different clan ( CL0123.12 , CL0246.3 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02001 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF02001 SSF88659 0.548 (average over 16 mutual instances, PF02001 23 appearances, SSF88659 22430 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 373 ) 6616760_PF00014_PF02177            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02177  is 6605242 with Jaccard = 0.9706	|PF02177|=33 [ 33	1	1100177	0 ]
 parent [ 6605242 ] : 6616760 0.31396 (=4235/(329*41)) 69.1394
  given [ 6605242 ] : 6605242 0.358974 (=28/(2*39)) 64.1046
	best keyword for cluster 6605242 is PF02177 with Jaccard = 0.9706	[ 33	1	1100177	0 ]	0.9706 1.0000
sibling [ 6605242 ] : 6600405 0.464832 (=304/(2*327)) 61.9802
	best keyword for cluster 6600405 is PF00014 with Jaccard = 0.7529	[ 320	0	1099786	105 ]	1.0000 0.7529

	SUGGESTING RELATEDNESS OF:   
	A> PF02177 ( PF02177   Amyloid A4 extracellular domain )
	B> PF00014 ( PF00014   Kunitz/Bovine pancreatic trypsin inhibitor domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00014| = 426 ,	 |PF02177| = 33 , 	|PF00014^PF02177| = 13	( 3.1% and 39.4% )

	both PF02177 and PF00014 have PDB structures
	PF02177 	   d.170.2.1  d.230.3.1
	PF00014 	   g.8.1.1  g.8.1.2  k.35.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 374 ) 6745343_PF06295_PF06511            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06295  is 6539149 with Jaccard = 0.9706	|PF06295|=34 [ 33	0	1100177	1 ]
 parent [ 6539149 ] : 6745343 0.0223684 (=17/(38*20)) 98.2595
  given [ 6539149 ] : 6539149 0.695238 (=73/(3*35)) 31.858
	best keyword for cluster 6539149 is PF06295 with Jaccard = 0.9706	[ 33	0	1100177	1 ]	1.0000 0.9706
sibling [ 6539149 ] : 6727382 0.04 (=3/(5*15)) 96.4
	best keyword for cluster 6727382 is PF06511 with Jaccard = 0.8889	[ 8	1	1100202	0 ]	0.8889 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06295 ( PF06295   Protein of unknown function (DUF1043) )
	B> PF06511 ( PF06511   Invasion plasmid antigen IpaD )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06295 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 375 ) 6758309_PF03663_PF07470            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07470  is 6725770 with Jaccard = 0.9706	|PF07470|=101 [ 99	1	1100109	2 ]
 parent [ 6725770 ] : 6758309 0.0124713 (=348/(128*218)) 99.172
  given [ 6725770 ] : 6725770 0.0515516 (=206/(54*74)) 96.2043
	best keyword for cluster 6725770 is PF07470 with Jaccard = 0.9706	[ 99	1	1100109	2 ]	0.9900 0.9802
sibling [ 6725770 ] : 6748843 0.0219372 (=248/(85*133)) 98.5323
	best keyword for cluster 6748843 is PF03663 with Jaccard = 0.6067	[ 108	70	1100033	0 ]	0.6067 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07470 ( PF07470   Glycosyl Hydrolase Family 88 )
	B> PF03663 ( PF03663   Glycosyl hydrolase family 76 )

	Only A has a clan ( CL0059.10 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF07470 has a PDB structure (may not be up to date)
	PF07470 	   a.102.1.6  a.102.1.7
	
	SUPERFAM mapping significantly overlapping:
     1	PF03663 SSF48208 0.809 (average over 242 mutual instances, PF03663 251 appearances, SSF48208 6032 appearances)
     2	PF07470 SSF48208 0.854 (average over 295 mutual instances, PF07470 297 appearances, SSF48208 6032 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 376 ) 6646486_PF00928_PF01217            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01217  is 6600961 with Jaccard = 0.9702	|PF01217|=168 [ 163	0	1100043	5 ]
 parent [ 6600961 ] : 6646486 0.238624 (=8474/(184*193)) 78.9734
  given [ 6600961 ] : 6600961 0.412251 (=2894/(130*54)) 62.0741
	best keyword for cluster 6600961 is PF01217 with Jaccard = 0.9702	[ 163	0	1100043	5 ]	1.0000 0.9702
sibling [ 6600961 ] : 6624085 0.311862 (=2140/(47*146)) 72.0361
	best keyword for cluster 6624085 is PF00928 with Jaccard = 0.9176	[ 167	0	1100029	15 ]	1.0000 0.9176

	SUGGESTING RELATEDNESS OF:   
	A> PF01217 ( PF01217   Clathrin adaptor complex small chain )
	B> PF00928 ( PF00928   Adaptor complexes medium subunit family )

	Only A has a clan ( CL0212.4 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF01217 and PF00928 have PDB structures
	PF01217 	   d.110.4.2  i.23.1.1
	PF00928 	   b.2.7.1  i.23.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00928 SSF49447 0.984 (average over 510 mutual instances, PF00928 940 appearances, SSF49447 619 appearances)
     2	PF01217 SSF64356 0.981 (average over 601 mutual instances, PF01217 700 appearances, SSF64356 1711 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 377 ) 6577955_PF00025_PF00503            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00025  is 6559928 with Jaccard = 0.9700	|PF00025|=429 [ 420	4	1099778	9 ]
 parent [ 6559928 ] : 6577955 0.522522 (=80110/(466*329)) 53.0535
  given [ 6559928 ] : 6559928 0.551422 (=2268/(9*457)) 46.776
	best keyword for cluster 6559928 is PF00025 with Jaccard = 0.9700	[ 420	4	1099778	9 ]	0.9906 0.9790
sibling [ 6559928 ] : 6561247 0.585366 (=192/(1*328)) 47.9601
	best keyword for cluster 6561247 is PF00503 with Jaccard = 0.9517	[ 315	0	1099880	16 ]	1.0000 0.9517

	SUGGESTING RELATEDNESS OF:   
	A> PF00025 ( PF00025   ADP-ribosylation factor family )
	B> PF00503 ( PF00503   G-protein alpha subunit )

	they come from the same clan: CL0017.14 : PF00735	PF00071	PF06858	PF01926	PF08477	PF05049	PF00009	PF00503	PF00350	PF09439	PF03193	PF03029	PF00025	PF04548
	the two keywords do not coincide on UniRef90 proteins

	both PF00025 and PF00503 have PDB structures
	PF00025 	   c.37.1.8
	PF00503 	   j.56.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 378 ) 6722352_PF03083_PF04193            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04193  is 6695625 with Jaccard = 0.9698	|PF04193|=199 [ 193	0	1100012	6 ]
 parent [ 6695625 ] : 6722352 0.0591151 (=2314/(233*168)) 95.7616
  given [ 6695625 ] : 6695625 0.119113 (=1209/(58*175)) 91.4789
	best keyword for cluster 6695625 is PF04193 with Jaccard = 0.9698	[ 193	0	1100012	6 ]	1.0000 0.9698
sibling [ 6695625 ] : 6713456 0.0640625 (=82/(160*8)) 94.4822
	best keyword for cluster 6713456 is PF03083 with Jaccard = 0.9717	[ 103	3	1100105	0 ]	0.9717 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04193 ( PF04193   PQ loop repeat )
	B> PF03083 ( PF03083   MtN3/saliva family )

	they come from the same clan: CL0141.7 : PF04193	PF03083	PF07578	PF03650

	the two keywords coincide on Uniref90 proteins: |PF03083| = 103 ,	 |PF04193| = 199 , 	|PF03083^PF04193| = 1	( 1.0% and 0.5% )

	Neither  PF04193 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 379 ) 6701536_PF00977_PF04309            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04309  is 6313306 with Jaccard = 0.9697	|PF04309|=32 [ 32	1	1100178	0 ]
 parent [ 6313306 ] : 6701536 0.107232 (=2239/(36*580)) 92.446
  given [ 6313306 ] : 6313306 1 (=68/(2*34)) 1.25177e-08
	best keyword for cluster 6313306 is PF04309 with Jaccard = 0.9697	[ 32	1	1100178	0 ]	0.9697 1.0000
sibling [ 6313306 ] : 6685620 0.134715 (=78/(1*579)) 89.3861
	best keyword for cluster 6685620 is PF00977 with Jaccard = 0.9030	[ 484	50	1099675	2 ]	0.9064 0.9959

	SUGGESTING RELATEDNESS OF:   
	A> PF04309 ( PF04309   Glycerol-3-phosphate responsive antiterminator )
	B> PF00977 ( PF00977   Histidine biosynthesis protein )

	they come from the same clan: CL0036.17 : PF05690	PF01680	PF00834	PF01729	PF00697	PF03740	PF01884	PF00724	PF00215	PF03060	PF04095	PF04131	PF00478	PF00218	PF00977	PF01645	PF04309	PF01070	PF01207	PF04481	PF04476	PF01180	PF00701	PF01791	PF03932	PF03437	PF01081	PF00121	PF09370	PF02581	PF00290
	the two keywords do not coincide on UniRef90 proteins

	both PF04309 and PF00977 have PDB structures
	PF04309 	   c.1.29.1
	PF00977 	   c.1.2.1

	SUPERFAM mapping significantly overlapping:
     1	PF04309 SSF110391 0.979 (average over 128 mutual instances, PF04309 128 appearances, SSF110391 128 appearances)
     2	PF00977 SSF51366 0.938 (average over 1629 mutual instances, PF00977 1632 appearances, SSF51366 8168 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 380 ) 6737615_PF00462_PF03479            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03479  is 6690625 with Jaccard = 0.9695	|PF03479|=130 [ 127	1	1100080	3 ]
 parent [ 6690625 ] : 6737615 0.0264726 (=4185/(168*941)) 97.5474
  given [ 6690625 ] : 6690625 0.107533 (=748/(94*74)) 90.3884
	best keyword for cluster 6690625 is PF03479 with Jaccard = 0.9695	[ 127	1	1100080	3 ]	0.9922 0.9769
sibling [ 6690625 ] : 6731891 0.0352304 (=2148/(70*871)) 96.9284
	best keyword for cluster 6731891 is PF00462 with Jaccard = 0.7431	[ 729	79	1099230	173 ]	0.9022 0.8082

	SUGGESTING RELATEDNESS OF:   
	A> PF03479 ( PF03479   Domain of unknown function (DUF296) )
	B> PF00462 ( PF00462   Glutaredoxin )

	Only B has a clan ( CL0172.11 ).

	the two keywords coincide on Uniref90 proteins: |PF00462| = 902 ,	 |PF03479| = 130 , 	|PF00462^PF03479| = 8	( 0.9% and 6.2% )

	only  PF03479 has a PDB structure (may not be up to date)
		PF00462 	   c.47.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00462 SSF52833 0.710 (average over 2554 mutual instances, PF00462 2661 appearances, SSF52833 34965 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 381 ) 6752262_PF00463_PF02548            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00463  is 6709607 with Jaccard = 0.9693	|PF00463|=255 [ 253	6	1099950	2 ]
 parent [ 6709607 ] : 6752262 0.0167139 (=1768/(258*410)) 98.7864
  given [ 6709607 ] : 6709607 0.0622222 (=126/(405*5)) 93.8751
	best keyword for cluster 6709607 is PF00463 with Jaccard = 0.9693	[ 253	6	1099950	2 ]	0.9768 0.9922
sibling [ 6709607 ] : 6630499 0.303502 (=78/(1*257)) 74.8747
	best keyword for cluster 6630499 is PF02548 with Jaccard = 1.0000	[ 231	0	1099980	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00463 ( PF00463   Isocitrate lyase family )
	B> PF02548 ( PF02548   Ketopantoate hydroxymethyltransferase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF00463 and PF02548 have PDB structures
	PF00463 	   c.1.12.7
	
	SUPERFAM mapping significantly overlapping:
     1	PF00463 SSF51621 0.657 (average over 906 mutual instances, PF00463 913 appearances, SSF51621 12495 appearances)
     2	PF02548 SSF51621 0.983 (average over 760 mutual instances, PF02548 769 appearances, SSF51621 12495 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 382 ) 6712077_PF05995_PF07847            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05995  is 6597067 with Jaccard = 0.9692	|PF05995|=64 [ 63	1	1100146	1 ]
 parent [ 6597067 ] : 6712077 0.081474 (=241/(34*87)) 94.2655
  given [ 6597067 ] : 6597067 0.448443 (=835/(38*49)) 60.285
	best keyword for cluster 6597067 is PF05995 with Jaccard = 0.9692	[ 63	1	1100146	1 ]	0.9844 0.9844
sibling [ 6597067 ] : 6515298 0.875 (=56/(2*32)) 17.918
	best keyword for cluster 6515298 is PF07847 with Jaccard = 0.9412	[ 32	0	1100177	2 ]	1.0000 0.9412

	SUGGESTING RELATEDNESS OF:   
	A> PF05995 ( PF05995   Cysteine dioxygenase type I )
	B> PF07847 ( PF07847   Protein of unknown function (DUF1637) )

	Only A has a clan ( CL0029.13 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF05995 has a PDB structure (may not be up to date)
	PF05995 	   b.82.1.19
	
	SUPERFAM mapping significantly overlapping:
     1	PF05995 SSF51182 0.731 (average over 135 mutual instances, PF05995 135 appearances, SSF51182 14255 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 383 ) 6744929_PF00799_PF01492            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01492  is 6683146 with Jaccard = 0.9691	|PF01492|=193 [ 188	1	1100017	5 ]
 parent [ 6683146 ] : 6744929 0.0190774 (=1024/(189*284)) 98.2288
  given [ 6683146 ] : 6683146 0.143646 (=208/(8*181)) 88.977
	best keyword for cluster 6683146 is PF01492 with Jaccard = 0.9691	[ 188	1	1100017	5 ]	0.9947 0.9741
sibling [ 6683146 ] : 6737996 0.0397112 (=77/(7*277)) 97.5867
	best keyword for cluster 6737996 is PF00799 with Jaccard = 0.9538	[ 227	10	1099973	1 ]	0.9578 0.9956

	SUGGESTING RELATEDNESS OF:   
	A> PF01492 ( PF01492   Geminivirus C4 protein )
	B> PF00799 ( PF00799   Geminivirus Rep catalytic domain )

	Only B has a clan ( CL0169.6 ).

	the two keywords coincide on Uniref90 proteins: |PF00799| = 228 ,	 |PF01492| = 193 , 	|PF00799^PF01492| = 3	( 1.3% and 1.6% )

	only  PF01492 has a PDB structure (may not be up to date)
		PF00799 	   d.89.1.4

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 384 ) 6750118_PF00505_PF04769            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04769  is 6624153 with Jaccard = 0.9683	|PF04769|=63 [ 61	0	1100148	2 ]
 parent [ 6624153 ] : 6750118 0.0170587 (=1378/(70*1154)) 98.626
  given [ 6624153 ] : 6624153 0.289855 (=20/(1*69)) 72.0942
	best keyword for cluster 6624153 is PF04769 with Jaccard = 0.9683	[ 61	0	1100148	2 ]	1.0000 0.9683
sibling [ 6624153 ] : 6746762 0.0225881 (=284/(11*1143)) 98.3707
	best keyword for cluster 6746762 is PF00505 with Jaccard = 0.8042	[ 805	137	1099210	59 ]	0.8546 0.9317

	SUGGESTING RELATEDNESS OF:   
	A> PF04769 ( PF04769   Mating-type protein MAT alpha 1 )
	B> PF00505 ( PF00505   HMG (high mobility group) box )

	Only B has a clan ( CL0114.6 ).

	the two keywords coincide on Uniref90 proteins: |PF00505| = 864 ,	 |PF04769| = 63 , 	|PF00505^PF04769| = 3	( 0.3% and 4.8% )

	only  PF04769 has a PDB structure (may not be up to date)
		PF00505 	   a.21.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00505 SSF47095 0.800 (average over 2604 mutual instances, PF00505 2716 appearances, SSF47095 3113 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 385 ) 6747159_PF01566_PF05525            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05525  is 6700316 with Jaccard = 0.9683	|PF05525|=126 [ 122	0	1100085	4 ]
 parent [ 6700316 ] : 6747159 0.0203124 (=1455/(189*379)) 98.4008
  given [ 6700316 ] : 6700316 0.0931824 (=708/(131*58)) 92.2349
	best keyword for cluster 6700316 is PF05525 with Jaccard = 0.9683	[ 122	0	1100085	4 ]	1.0000 0.9683
sibling [ 6700316 ] : 6729842 0.0380184 (=99/(372*7)) 96.708
	best keyword for cluster 6729842 is PF01566 with Jaccard = 0.9893	[ 278	3	1099930	0 ]	0.9893 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05525 ( PF05525   Branched-chain amino acid transport protein )
	B> PF01566 ( PF01566   Natural resistance-associated macrophage protein )

	Only A has a clan ( CL0062.8 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05525 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 386 ) 6754973_PF03081_PF03106            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03106  is 6715920 with Jaccard = 0.9677	|PF03106|=278 [ 270	1	1099932	8 ]
 parent [ 6715920 ] : 6754973 0.0115311 (=532/(316*146)) 98.9681
  given [ 6715920 ] : 6715920 0.0513911 (=290/(19*297)) 94.8673
	best keyword for cluster 6715920 is PF03106 with Jaccard = 0.9677	[ 270	1	1099932	8 ]	0.9963 0.9712
sibling [ 6715920 ] : 6746456 0.0215827 (=21/(139*7)) 98.3464
	best keyword for cluster 6746456 is PF03081 with Jaccard = 0.9802	[ 99	1	1100110	1 ]	0.9900 0.9900

	SUGGESTING RELATEDNESS OF:   
	A> PF03106 ( PF03106   WRKY DNA -binding domain )
	B> PF03081 ( PF03081   Exo70 exocyst complex subunit )

	Only A has a clan ( CL0274.2 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF03106 and PF03081 have PDB structures
		PF03081 	   a.118.17.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 387 ) 6632019_PF02991_PF04110            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04110  is 6608773 with Jaccard = 0.9677	|PF04110|=31 [ 30	0	1100180	1 ]
 parent [ 6608773 ] : 6632019 0.287175 (=730/(31*82)) 75.1426
  given [ 6608773 ] : 6608773 0.4 (=12/(1*30)) 66.4
	best keyword for cluster 6608773 is PF04110 with Jaccard = 0.9677	[ 30	0	1100180	1 ]	1.0000 0.9677
sibling [ 6608773 ] : 6611291 0.35 (=56/(2*80)) 67.1418
	best keyword for cluster 6611291 is PF02991 with Jaccard = 0.9740	[ 75	0	1100134	2 ]	1.0000 0.9740

	SUGGESTING RELATEDNESS OF:   
	A> PF04110 ( PF04110   Ubiquitin-like autophagy protein Apg12 )
	B> PF02991 ( PF02991   Microtubule associated protein 1A/1B, light chain 3 )

	they come from the same clan: CL0072.14 : PF09138	PF03671	PF03658	PF00789	PF00240	PF02597	PF02824	PF02196	PF00788	PF00794	PF00564	PF02991	PF09379	PF08783	PF06071	PF07023	PF02017	PF04110	PF08817
	the two keywords do not coincide on UniRef90 proteins

	both PF04110 and PF02991 have PDB structures
	PF04110 	   d.15.1.7
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 388 ) 6675846_PF04740_PF06860            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04740  is 6640075 with Jaccard = 0.9677	|PF04740|=31 [ 30	0	1100180	1 ]
 parent [ 6640075 ] : 6675846 0.147147 (=196/(36*37)) 87.1267
  given [ 6640075 ] : 6640075 0.231183 (=43/(6*31)) 77.002
	best keyword for cluster 6640075 is PF04740 with Jaccard = 0.9677	[ 30	0	1100180	1 ]	1.0000 0.9677
sibling [ 6640075 ] : 6592383 0.428125 (=137/(20*16)) 58.2245
	best keyword for cluster 6592383 is PF06860 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04740 ( PF04740   Bacillus transposase protein )
	B> PF06860 ( PF06860   Protein of unknown function (DUF1252) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04740 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 389 ) 6761119_PF06610_PF07895            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07895  is 6703274 with Jaccard = 0.9677	|PF07895|=31 [ 30	0	1100180	1 ]
 parent [ 6703274 ] : 6761119 0.00735294 (=5/(34*20)) 99.3243
  given [ 6703274 ] : 6703274 0.107143 (=18/(28*6)) 92.7548
	best keyword for cluster 6703274 is PF07895 with Jaccard = 0.9677	[ 30	0	1100180	1 ]	1.0000 0.9677
sibling [ 6703274 ] : 6743335 0.0238095 (=2/(6*14)) 98.0833
	best keyword for cluster 6743335 is PF06610 with Jaccard = 1.0000	[ 12	0	1100199	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07895 ( PF07895   Protein of unknown function (DUF1673) )
	B> PF06610 ( PF06610   Protein of unknown function (DUF1144) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07895 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 390 ) 6758191_PF02269_PF04719            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02269  is 6732009 with Jaccard = 0.9672	|PF02269|=60 [ 59	1	1100150	1 ]
 parent [ 6732009 ] : 6758191 0.0151796 (=41/(37*73)) 99.1644
  given [ 6732009 ] : 6732009 0.0348259 (=14/(6*67)) 96.9421
	best keyword for cluster 6732009 is PF02269 with Jaccard = 0.9672	[ 59	1	1100150	1 ]	0.9833 0.9833
sibling [ 6732009 ] : 6725481 0.0555556 (=2/(1*36)) 96.1667
	best keyword for cluster 6725481 is PF04719 with Jaccard = 0.9706	[ 33	0	1100177	1 ]	1.0000 0.9706

	SUGGESTING RELATEDNESS OF:   
	A> PF02269 ( PF02269   Transcription initiation factor IID, 18kD subunit )
	B> PF04719 ( PF04719   hTAFII28-like protein conserved region )

	they come from the same clan: CL0012.11 : PF02969	PF00125	PF00808	PF07524	PF04719	PF02269	PF02291	PF03847
	the two keywords do not coincide on UniRef90 proteins

	both PF02269 and PF04719 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF04719 SSF47113 0.834 (average over 77 mutual instances, PF04719 77 appearances, SSF47113 7440 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 391 ) 6707884_PF03962_PF07106            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07106  is 6597200 with Jaccard = 0.9667	|PF07106|=30 [ 29	0	1100181	1 ]
 parent [ 6597200 ] : 6707884 0.0764007 (=90/(31*38)) 93.6075
  given [ 6597200 ] : 6597200 0.4 (=52/(26*5)) 60.4103
	best keyword for cluster 6597200 is PF07106 with Jaccard = 0.9667	[ 29	0	1100181	1 ]	1.0000 0.9667
sibling [ 6597200 ] : 6641149 0.228571 (=24/(3*35)) 77.347
	best keyword for cluster 6641149 is PF03962 with Jaccard = 1.0000	[ 28	0	1100183	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07106 ( PF07106   Tat binding protein 1(TBP-1)-interacting protein (TBPIP) )
	B> PF03962 ( PF03962   Mnd1 family )

	Only A has a clan ( CL0123.12 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07106 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 392 ) 6706286_PF07587_PF07627            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07627  is 6382483 with Jaccard = 0.9667	|PF07627|=30 [ 29	0	1100181	1 ]
 parent [ 6382483 ] : 6706286 0.0829538 (=255/(29*106)) 93.3371
  given [ 6382483 ] : 6382483 1 (=28/(1*28)) 0.000698447
	best keyword for cluster 6382483 is PF07627 with Jaccard = 0.9667	[ 29	0	1100181	1 ]	1.0000 0.9667
sibling [ 6382483 ] : 6690681 0.127273 (=133/(11*95)) 90.4088
	best keyword for cluster 6690681 is PF07587 with Jaccard = 0.7108	[ 59	24	1100128	0 ]	0.7108 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07627 ( PF07627   Protein of unknown function (DUF1588) )
	B> PF07587 ( PF07587   Protein of unknown function (DUF1553) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07627 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 393 ) 6747048_PF06725_PF06737            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06725  is 6735474 with Jaccard = 0.9663	|PF06725|=175 [ 172	3	1100033	3 ]
 parent [ 6735474 ] : 6747048 0.0165497 (=566/(225*152)) 98.3939
  given [ 6735474 ] : 6735474 0.0279811 (=130/(202*23)) 97.3184
	best keyword for cluster 6735474 is PF06725 with Jaccard = 0.9663	[ 172	3	1100033	3 ]	0.9829 0.9829
sibling [ 6735474 ] : 6741857 0.0295921 (=111/(31*121)) 97.9582
	best keyword for cluster 6741857 is PF06737 with Jaccard = 0.6190	[ 78	33	1100085	15 ]	0.7027 0.8387

	SUGGESTING RELATEDNESS OF:   
	A> PF06725 ( PF06725   3D domain )
	B> PF06737 ( PF06737   Transglycosylase-like domain )

	A and B come from a different clan ( CL0199.7 , CL0037.9 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF06725 and PF06737 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 394 ) 6745596_PF05163_PF07609            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05163  is 6722677 with Jaccard = 0.9659	|PF05163|=88 [ 85	0	1100123	3 ]
 parent [ 6722677 ] : 6745596 0.0254689 (=239/(136*69)) 98.2807
  given [ 6722677 ] : 6722677 0.0620783 (=138/(117*19)) 95.8099
	best keyword for cluster 6722677 is PF05163 with Jaccard = 0.9659	[ 85	0	1100123	3 ]	1.0000 0.9659
sibling [ 6722677 ] : 6735499 0.0336842 (=32/(19*50)) 97.3218
	best keyword for cluster 6735499 is PF07609 with Jaccard = 0.8571	[ 12	2	1100197	0 ]	0.8571 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05163 ( PF05163   DinB family )
	B> PF07609 ( PF07609   Protein of unknown function (DUF1572) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05163 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 395 ) 6769854_PF01193_PF03971            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01193  is 6757657 with Jaccard = 0.9655	|PF01193|=477 [ 476	16	1099718	1 ]
 parent [ 6757657 ] : 6769854 0.00438292 (=455/(633*164)) 99.7061
  given [ 6757657 ] : 6757657 0.0136474 (=975/(486*147)) 99.1322
	best keyword for cluster 6757657 is PF01193 with Jaccard = 0.9655	[ 476	16	1099718	1 ]	0.9675 0.9979
sibling [ 6757657 ] : 6767454 0.00513652 (=19/(27*137)) 99.6161
	best keyword for cluster 6767454 is PF03971 with Jaccard = 0.9737	[ 74	2	1100135	0 ]	0.9737 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01193 ( PF01193   RNA polymerase Rpb3/Rpb11 dimerisation domain )
	B> PF03971 ( PF03971   Monomeric isocitrate dehydrogenase )

	Only B has a clan ( CL0270.2 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF01193 and PF03971 have PDB structures
		PF03971 	   c.77.1.1  c.77.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF01193 SSF55257 0.889 (average over 2125 mutual instances, PF01193 5241 appearances, SSF55257 5319 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 396 ) 6764061_PF00723_PF02446            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02446  is 6555483 with Jaccard = 0.9655	|PF02446|=174 [ 168	0	1100037	6 ]
 parent [ 6555483 ] : 6764061 0.00593233 (=317/(183*292)) 99.4687
  given [ 6555483 ] : 6555483 0.588398 (=213/(2*181)) 43.0567
	best keyword for cluster 6555483 is PF02446 with Jaccard = 0.9655	[ 168	0	1100037	6 ]	1.0000 0.9655
sibling [ 6555483 ] : 6754817 0.0111959 (=88/(262*30)) 98.9578
	best keyword for cluster 6754817 is PF00723 with Jaccard = 0.8739	[ 201	27	1099981	2 ]	0.8816 0.9901

	SUGGESTING RELATEDNESS OF:   
	A> PF02446 ( PF02446   4-alpha-glucanotransferase )
	B> PF00723 ( PF00723   Glycosyl hydrolases family 15 )

	A and B come from a different clan ( CL0058.10 , CL0059.10 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF02446 and PF00723 have PDB structures
	PF02446 	   c.1.8.1
	PF00723 	   a.102.1.1  a.102.1.5

	SUPERFAM mapping significantly overlapping:
     1	PF00723 SSF48208 0.911 (average over 498 mutual instances, PF00723 605 appearances, SSF48208 6032 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 397 ) 6763259_PF00589_PF07512            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00589  is 6760844 with Jaccard = 0.9653	|PF00589|=2815 [ 2757	41	1097355	58 ]
 parent [ 6760844 ] : 6763259 0.00656587 (=1381/(3895*54)) 99.4296
  given [ 6760844 ] : 6760844 0.0096304 (=1961/(3842*53)) 99.3098
	best keyword for cluster 6760844 is PF00589 with Jaccard = 0.9653	[ 2757	41	1097355	58 ]	0.9853 0.9794
sibling [ 6760844 ] : 6761400 0.0188679 (=1/(1*53)) 99.3396
	best keyword for cluster 6761400 is PF07512 with Jaccard = 0.9231	[ 36	1	1100172	2 ]	0.9730 0.9474

	SUGGESTING RELATEDNESS OF:   
	A> PF00589 ( PF00589   Phage integrase family )
	B> PF07512 ( PF07512   Protein of unknown function (DUF1526) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00589 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00589 SSF56349 0.828 (average over 8330 mutual instances, PF00589 11867 appearances, SSF56349 10914 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 398 ) 6713233_PF06850_PF07167            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06850  is 6481929 with Jaccard = 0.9649	|PF06850|=57 [ 55	0	1100154	2 ]
 parent [ 6481929 ] : 6713233 0.0623937 (=1174/(64*294)) 94.439
  given [ 6481929 ] : 6481929 0.952381 (=60/(1*63)) 6.12756
	best keyword for cluster 6481929 is PF06850 with Jaccard = 0.9649	[ 55	0	1100154	2 ]	1.0000 0.9649
sibling [ 6481929 ] : 6699063 0.0950722 (=191/(7*287)) 92.0099
	best keyword for cluster 6699063 is PF07167 with Jaccard = 0.8445	[ 201	37	1099973	0 ]	0.8445 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06850 ( PF06850   PHB de-polymerase C-terminus )
	B> PF07167 ( PF07167   Poly-beta-hydroxybutyrate polymerase (PhaC) N-terminus )

	Only A has a clan ( CL0028.14 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06850 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 399 ) 6646401_PF04808_PF05515            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05515  is 6600116 with Jaccard = 0.9643	|PF05515|=28 [ 27	0	1100183	1 ]
 parent [ 6600116 ] : 6646401 0.276498 (=60/(31*7)) 78.9058
  given [ 6600116 ] : 6600116 0.453704 (=49/(4*27)) 61.6409
	best keyword for cluster 6600116 is PF05515 with Jaccard = 0.9643	[ 27	0	1100183	1 ]	1.0000 0.9643
sibling [ 6600116 ] : 6603207 0.5 (=5/(5*2)) 63.2
	best keyword for cluster 6603207 is PF04808 with Jaccard = 1.0000	[ 4	0	1100207	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05515 ( PF05515   Viral nucleic acid binding )
	B> PF04808 ( PF04808   Citrus tristeza virus (CTV) P23 protein )

	they come from the same clan: CL0140.6 : PF01623	PF04808	PF05515
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05515 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 400 ) 6675740_PF05171_PF06228            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06228  is 6246500 with Jaccard = 0.9643	|PF06228|=28 [ 27	0	1100183	1 ]
 parent [ 6246500 ] : 6675740 0.162162 (=162/(27*37)) 87.0664
  given [ 6246500 ] : 6246500 1 (=180/(15*12)) 1.3411e-13
	best keyword for cluster 6246500 is PF06228 with Jaccard = 0.9643	[ 27	0	1100183	1 ]	1.0000 0.9643
sibling [ 6246500 ] : 6650456 0.222222 (=8/(1*36)) 80.2005
	best keyword for cluster 6650456 is PF05171 with Jaccard = 1.0000	[ 29	0	1100182	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06228 ( PF06228   Protein of unknown function (DUF1008) )
	B> PF05171 ( PF05171   Haemin-degrading family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06228 has a PDB structure (may not be up to date)
		PF05171 	   e.62.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 401 ) 6725827_PF04357_PF05170            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04357  is 6700171 with Jaccard = 0.9640	|PF04357|=137 [ 134	2	1100072	3 ]
 parent [ 6700171 ] : 6725827 0.0495423 (=4005/(215*376)) 96.2123
  given [ 6700171 ] : 6700171 0.0930864 (=789/(52*163)) 92.202
	best keyword for cluster 6700171 is PF04357 with Jaccard = 0.9640	[ 134	2	1100072	3 ]	0.9853 0.9781
sibling [ 6700171 ] : 6714825 0.0662837 (=2054/(122*254)) 94.6996
	best keyword for cluster 6714825 is PF05170 with Jaccard = 0.6635	[ 140	49	1100000	22 ]	0.7407 0.8642

	SUGGESTING RELATEDNESS OF:   
	A> PF04357 ( PF04357   Family of unknown function (DUF490) )
	B> PF05170 ( PF05170   AsmA family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04357| = 137 ,	 |PF05170| = 162 , 	|PF04357^PF05170| = 5	( 3.6% and 3.1% )

	Neither  PF04357 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 402 ) 6764413_PF00994_PF01507            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01507  is 6703055 with Jaccard = 0.9637	|PF01507|=429 [ 425	12	1099770	4 ]
 parent [ 6703055 ] : 6764413 0.00563274 (=3609/(494*1297)) 99.4849
  given [ 6703055 ] : 6703055 0.0901515 (=1785/(44*450)) 92.7302
	best keyword for cluster 6703055 is PF01507 with Jaccard = 0.9637	[ 425	12	1099770	4 ]	0.9725 0.9907
sibling [ 6703055 ] : 6737864 0.0259578 (=6239/(224*1073)) 97.5717
	best keyword for cluster 6737864 is PF00994 with Jaccard = 0.6889	[ 815	356	1099028	12 ]	0.6960 0.9855

	SUGGESTING RELATEDNESS OF:   
	A> PF01507 ( PF01507   Phosphoadenosine phosphosulfate reductase family )
	B> PF00994 ( PF00994   Probable molybdopterin binding domain )

	Only A has a clan ( CL0039.7 ).

	the two keywords coincide on Uniref90 proteins: |PF00994| = 827 ,	 |PF01507| = 429 , 	|PF00994^PF01507| = 9	( 1.1% and 2.1% )

	both PF01507 and PF00994 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00994 SSF53218 0.869 (average over 2574 mutual instances, PF00994 4737 appearances, SSF53218 5120 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 403 ) 6712763_PF02581_PF08543            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02581  is 6681958 with Jaccard = 0.9637	|PF02581|=386 [ 372	0	1099825	14 ]
 parent [ 6681958 ] : 6712763 0.0572249 (=10345/(442*409)) 94.3662
  given [ 6681958 ] : 6681958 0.130247 (=211/(4*405)) 88.6761
	best keyword for cluster 6681958 is PF02581 with Jaccard = 0.9637	[ 372	0	1099825	14 ]	1.0000 0.9637
sibling [ 6681958 ] : 6619897 0.3 (=264/(2*440)) 70.4714
	best keyword for cluster 6619897 is PF08543 with Jaccard = 0.7875	[ 315	67	1099811	18 ]	0.8246 0.9459

	SUGGESTING RELATEDNESS OF:   
	A> PF02581 ( PF02581   Thiamine monophosphate synthase/TENI )
	B> PF08543 ( PF08543   Phosphomethylpyrimidine kinase )

	A and B come from a different clan ( CL0036.17 , CL0118.7 ).

	the two keywords coincide on Uniref90 proteins: |PF02581| = 386 ,	 |PF08543| = 333 , 	|PF02581^PF08543| = 22	( 5.7% and 6.6% )

	both PF02581 and PF08543 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF02581 SSF51391 0.858 (average over 1103 mutual instances, PF02581 1176 appearances, SSF51391 1335 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 404 ) 6692708_PF01180_PF01207            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01180  is 6682050 with Jaccard = 0.9631	|PF01180|=404 [ 392	3	1099804	12 ]
 parent [ 6682050 ] : 6692708 0.10585 (=27875/(604*436)) 90.8217
  given [ 6682050 ] : 6682050 0.116319 (=201/(4*432)) 88.7083
	best keyword for cluster 6682050 is PF01180 with Jaccard = 0.9631	[ 392	3	1099804	12 ]	0.9924 0.9703
sibling [ 6682050 ] : 6651219 0.210372 (=2219/(18*586)) 80.4374
	best keyword for cluster 6651219 is PF01207 with Jaccard = 0.9873	[ 545	3	1099659	4 ]	0.9945 0.9927

	SUGGESTING RELATEDNESS OF:   
	A> PF01180 ( PF01180   Dihydroorotate dehydrogenase )
	B> PF01207 ( PF01207   Dihydrouridine synthase (Dus) )

	they come from the same clan: CL0036.17 : PF05690	PF01680	PF00834	PF01729	PF00697	PF03740	PF01884	PF00724	PF00215	PF03060	PF04095	PF04131	PF00478	PF00218	PF00977	PF01645	PF04309	PF01070	PF01207	PF04481	PF04476	PF01180	PF00701	PF01791	PF03932	PF03437	PF01081	PF00121	PF09370	PF02581	PF00290
	the two keywords do not coincide on UniRef90 proteins

	both PF01180 and PF01207 have PDB structures
	PF01180 	   c.1.4.1
	PF01207 	   c.1.4.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 405 ) 6769456_PF01263_PF06799            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01263  is 6762597 with Jaccard = 0.9631	|PF01263|=406 [ 392	1	1099804	14 ]
 parent [ 6762597 ] : 6769456 0.00375536 (=77/(44*466)) 99.692
  given [ 6762597 ] : 6762597 0.0062004 (=50/(18*448)) 99.3995
	best keyword for cluster 6762597 is PF01263 with Jaccard = 0.9631	[ 392	1	1099804	14 ]	0.9975 0.9655
sibling [ 6762597 ] : 6756687 0.0130719 (=6/(17*27)) 99.0719
	best keyword for cluster 6756687 is PF06799 with Jaccard = 0.9583	[ 23	1	1100187	0 ]	0.9583 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01263 ( PF01263   Aldose 1-epimerase )
	B> PF06799 ( PF06799   Protein of unknown function (DUF1230) )

	Only A has a clan ( CL0103.7 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF01263 has a PDB structure (may not be up to date)
	PF01263 	   b.30.5.4  b.30.5.7
	
	SUPERFAM mapping significantly overlapping:
     1	PF01263 SSF74650 0.881 (average over 1335 mutual instances, PF01263 1358 appearances, SSF74650 5571 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 406 ) 6661132_PF01863_PF08325            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01863  is 6637941 with Jaccard = 0.9630	|PF01863|=240 [ 234	3	1099968	6 ]
 parent [ 6637941 ] : 6661132 0.226735 (=2417/(41*260)) 83.5231
  given [ 6637941 ] : 6637941 0.28418 (=291/(4*256)) 76.474
	best keyword for cluster 6637941 is PF01863 with Jaccard = 0.9630	[ 234	3	1099968	6 ]	0.9873 0.9750
sibling [ 6637941 ] : 6559434 0.551282 (=43/(2*39)) 46.2715
	best keyword for cluster 6559434 is PF08325 with Jaccard = 0.8605	[ 37	4	1100168	2 ]	0.9024 0.9487

	SUGGESTING RELATEDNESS OF:   
	A> PF01863 ( PF01863   Protein of unknown function DUF45 )
	B> PF08325 ( PF08325   WLM domain )

	they come from the same clan: CL0126.12 : PF08325	PF01421	PF01752	PF01457	PF02031	PF09471	PF05299	PF05547	PF05572	PF01434	PF01447	PF02128	PF02102	PF02074	PF01432	PF01742	PF01401	PF01431	PF05548	PF00413	PF01433	PF01863	PF07998	PF01400
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01863 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 407 ) 6759296_PF00164_PF01176            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01176  is 6705413 with Jaccard = 0.9628	|PF01176|=260 [ 259	9	1099942	1 ]
 parent [ 6705413 ] : 6759296 0.0125169 (=1186/(288*329)) 99.227
  given [ 6705413 ] : 6705413 0.0899031 (=1698/(187*101)) 93.1773
	best keyword for cluster 6705413 is PF01176 with Jaccard = 0.9628	[ 259	9	1099942	1 ]	0.9664 0.9962
sibling [ 6705413 ] : 6754276 0.0165533 (=73/(14*315)) 98.9237
	best keyword for cluster 6754276 is PF00164 with Jaccard = 0.9929	[ 278	1	1099931	1 ]	0.9964 0.9964

	SUGGESTING RELATEDNESS OF:   
	A> PF01176 ( PF01176   Translation initiation factor 1A / IF-1 )
	B> PF00164 ( PF00164   Ribosomal protein S12 )

	they come from the same clan: CL0021.12 : PF08402	PF03459	PF02765	PF00436	PF00575	PF01330	PF03870	PF00366	PF00164	PF00181	PF07497	PF04057	PF02303	PF08206	PF03919	PF01287	PF01176	PF01132	PF04076	PF03120	PF00313	PF01336	PF01588
	the two keywords do not coincide on UniRef90 proteins

	both PF01176 and PF00164 have PDB structures
	PF01176 	   b.40.4.5
	
	SUPERFAM mapping significantly overlapping:
     1	PF01176 SSF50249 0.835 (average over 1235 mutual instances, PF01176 1253 appearances, SSF50249 52669 appearances)
     2	PF00164 SSF50249 0.965 (average over 1574 mutual instances, PF00164 1579 appearances, SSF50249 52669 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 408 ) 6715860_PF00696_PF00742            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00696  is 6715372 with Jaccard = 0.9625	|PF00696|=1324 [ 1282	8	1098879	42 ]
 parent [ 6715372 ] : 6715860 0.0540314 (=24147/(311*1437)) 94.8587
  given [ 6715372 ] : 6715372 0.0668703 (=1708/(18*1419)) 94.7786
	best keyword for cluster 6715372 is PF00696 with Jaccard = 0.9625	[ 1282	8	1098879	42 ]	0.9938 0.9683
sibling [ 6715372 ] : 6653251 0.251613 (=78/(1*310)) 81.0781
	best keyword for cluster 6653251 is PF00742 with Jaccard = 0.7600	[ 247	26	1099886	52 ]	0.9048 0.8261

	SUGGESTING RELATEDNESS OF:   
	A> PF00696 ( PF00696   Amino acid kinase family )
	B> PF00742 ( PF00742   Homoserine dehydrogenase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00696| = 1324 ,	 |PF00742| = 299 , 	|PF00696^PF00742| = 66	( 5.0% and 22.1% )

	both PF00696 and PF00742 have PDB structures
	PF00696 	   c.73.1.1  c.73.1.2  c.73.1.3
	PF00742 	   d.81.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF00696 SSF53633 0.922 (average over 4687 mutual instances, PF00696 5933 appearances, SSF53633 7277 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 409 ) 6694770_PF01490_PF03222            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01490  is 6687942 with Jaccard = 0.9623	|PF01490|=664 [ 639	0	1099547	25 ]
 parent [ 6687942 ] : 6694770 0.108887 (=13129/(175*689)) 91.2519
  given [ 6687942 ] : 6687942 0.112044 (=307/(4*685)) 89.8465
	best keyword for cluster 6687942 is PF01490 with Jaccard = 0.9623	[ 639	0	1099547	25 ]	1.0000 0.9623
sibling [ 6687942 ] : 6677733 0.135659 (=70/(3*172)) 87.6466
	best keyword for cluster 6677733 is PF03222 with Jaccard = 0.8455	[ 104	19	1100088	0 ]	0.8455 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01490 ( PF01490   Transmembrane amino acid transporter protein )
	B> PF03222 ( PF03222   Tryptophan/tyrosine permease family )

	they come from the same clan: CL0062.8 : PF00860	PF03222	PF02133	PF00916	PF00474	PF03845	PF01235	PF00955	PF07331	PF02361	PF05525	PF03594	PF01490	PF00324
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01490 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 410 ) 6757909_PF04107_PF04169            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04107  is 6733210 with Jaccard = 0.9615	|PF04107|=155 [ 150	1	1100055	5 ]
 parent [ 6733210 ] : 6757909 0.00892443 (=404/(203*223)) 99.1493
  given [ 6733210 ] : 6733210 0.0402697 (=209/(30*173)) 97.0758
	best keyword for cluster 6733210 is PF04107 with Jaccard = 0.9615	[ 150	1	1100055	5 ]	0.9934 0.9677
sibling [ 6733210 ] : 6754704 0.0135135 (=3/(1*222)) 98.9504
	best keyword for cluster 6754704 is PF04169 with Jaccard = 0.6082	[ 104	67	1100040	0 ]	0.6082 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04107 ( PF04107   Glutamate-cysteine ligase family 2(GCS2) )
	B> PF04169 ( PF04169   Domain of unknown function (DUF404) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04107| = 155 ,	 |PF04169| = 104 , 	|PF04107^PF04169| = 3	( 1.9% and 2.9% )

	only  PF04107 has a PDB structure (may not be up to date)
	PF04107 	   d.128.1.3
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 411 ) 6758455_PF01032_PF02653            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02653  is 6748941 with Jaccard = 0.9612	|PF02653|=2086 [ 2006	1	1098124	80 ]
 parent [ 6748941 ] : 6758455 0.0121088 (=36405/(1338*2247)) 99.1798
  given [ 6748941 ] : 6748941 0.0165879 (=885/(24*2223)) 98.5394
	best keyword for cluster 6748941 is PF02653 with Jaccard = 0.9612	[ 2006	1	1098124	80 ]	0.9995 0.9616
sibling [ 6748941 ] : 6705925 0.0807105 (=32731/(874*464)) 93.2869
	best keyword for cluster 6705925 is PF01032 with Jaccard = 0.6647	[ 807	405	1098997	2 ]	0.6658 0.9975

	SUGGESTING RELATEDNESS OF:   
	A> PF02653 ( PF02653   Branched-chain amino acid transport system / permease component )
	B> PF01032 ( PF01032   FecCD transport family )

	they come from the same clan: CL0142.6 : PF00950	PF05145	PF02653	PF01032	PF01098
	the two keywords do not coincide on UniRef90 proteins

	only  PF02653 has a PDB structure (may not be up to date)
		PF01032 	   f.22.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 412 ) 6761875_PF00127_PF02298            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02298  is 6724386 with Jaccard = 0.9610	|PF02298|=153 [ 148	1	1100057	5 ]
 parent [ 6724386 ] : 6761875 0.00982134 (=785/(194*412)) 99.3639
  given [ 6724386 ] : 6724386 0.0561275 (=229/(24*170)) 96.0357
	best keyword for cluster 6724386 is PF02298 with Jaccard = 0.9610	[ 148	1	1100057	5 ]	0.9933 0.9673
sibling [ 6724386 ] : 6759281 0.00963948 (=50/(13*399)) 99.226
	best keyword for cluster 6759281 is PF00127 with Jaccard = 0.8987	[ 213	11	1099974	13 ]	0.9509 0.9425

	SUGGESTING RELATEDNESS OF:   
	A> PF02298 ( PF02298   Plastocyanin-like domain )
	B> PF00127 ( PF00127   Copper binding proteins, plastocyanin/azurin family )

	they come from the same clan: CL0026.14 : PF00394	PF00116	PF00127	PF07731	PF07732	PF02298	PF00812
	the two keywords do not coincide on UniRef90 proteins

	both PF02298 and PF00127 have PDB structures
	PF02298 	   b.6.1.1
	PF00127 	   b.6.1.1  i.4.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00127 SSF49503 0.766 (average over 579 mutual instances, PF00127 604 appearances, SSF49503 36729 appearances)
     2	PF02298 SSF49503 0.778 (average over 379 mutual instances, PF02298 385 appearances, SSF49503 36729 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 413 ) 6738168_PF02645_PF02734            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02645  is 6615390 with Jaccard = 0.9608	|PF02645|=204 [ 196	0	1100007	8 ]
 parent [ 6615390 ] : 6738168 0.0244044 (=1721/(215*328)) 97.6051
  given [ 6615390 ] : 6615390 0.341195 (=217/(212*3)) 68.597
	best keyword for cluster 6615390 is PF02645 with Jaccard = 0.9608	[ 196	0	1100007	8 ]	1.0000 0.9608
sibling [ 6615390 ] : 6690060 0.111692 (=2307/(243*85)) 90.2543
	best keyword for cluster 6690060 is PF02734 with Jaccard = 0.7448	[ 216	73	1099921	1 ]	0.7474 0.9954

	SUGGESTING RELATEDNESS OF:   
	A> PF02645 ( PF02645   Uncharacterised protein, DegV family COG1307 )
	B> PF02734 ( PF02734   DAK2 domain )

	Only A has a clan ( CL0245.3 ).

	the two keywords coincide on Uniref90 proteins: |PF02645| = 204 ,	 |PF02734| = 217 , 	|PF02645^PF02734| = 8	( 3.9% and 3.7% )

	both PF02645 and PF02734 have PDB structures
	PF02645 	   c.119.1.1
	PF02734 	   a.208.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF02734 SSF101473 0.827 (average over 691 mutual instances, PF02734 692 appearances, SSF101473 907 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 414 ) 6649345_PF00085_PF06201            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06201  is 6573663 with Jaccard = 0.9605	|PF06201|=74 [ 73	2	1100135	1 ]
 parent [ 6573663 ] : 6649345 0.212868 (=39086/(76*2416)) 79.8577
  given [ 6573663 ] : 6573663 0.493243 (=73/(2*74)) 51.9025
	best keyword for cluster 6573663 is PF06201 with Jaccard = 0.9605	[ 73	2	1100135	1 ]	0.9733 0.9865
sibling [ 6573663 ] : 6647387 0.233106 (=30270/(55*2361)) 79.1709
	best keyword for cluster 6647387 is PF00085 with Jaccard = 0.6089	[ 1470	603	1097797	341 ]	0.7091 0.8117

	SUGGESTING RELATEDNESS OF:   
	A> PF06201 ( PF06201   Domain of Unknown Function (DUF1000) )
	B> PF00085 ( PF00085   Thioredoxin )

	Only B has a clan ( CL0172.11 ).

	the two keywords coincide on Uniref90 proteins: |PF00085| = 1811 ,	 |PF06201| = 74 , 	|PF00085^PF06201| = 22	( 1.2% and 29.7% )

	both PF06201 and PF00085 have PDB structures
	PF06201 	   b.18.1.26
	
	SUPERFAM mapping significantly overlapping:
     1	PF00085 SSF52833 0.811 (average over 4892 mutual instances, PF00085 5078 appearances, SSF52833 34965 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 415 ) 6732968_PF03561_PF04115            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04115  is 6711570 with Jaccard = 0.9600	|PF04115|=50 [ 48	0	1100161	2 ]
 parent [ 6711570 ] : 6732968 0.0297719 (=124/(49*85)) 97.0479
  given [ 6711570 ] : 6711570 0.0583501 (=58/(14*71)) 94.1754
	best keyword for cluster 6711570 is PF04115 with Jaccard = 0.9600	[ 48	0	1100161	2 ]	1.0000 0.9600
sibling [ 6711570 ] : 6448104 1 (=94/(2*47)) 1.16911
	best keyword for cluster 6448104 is PF03561 with Jaccard = 0.9200	[ 46	0	1100161	4 ]	1.0000 0.9200

	SUGGESTING RELATEDNESS OF:   
	A> PF04115 ( PF04115   Ureidoglycolate hydrolase )
	B> PF03561 ( PF03561   Allantoicase repeat )

	Only B has a clan ( CL0202.5 ).

	the two keywords coincide on Uniref90 proteins: |PF03561| = 50 ,	 |PF04115| = 50 , 	|PF03561^PF04115| = 2	( 4.0% and 4.0% )

	both PF04115 and PF03561 have PDB structures
		PF03561 	   b.18.1.22

	SUPERFAM mapping significantly overlapping:
     1	PF03561 SSF49785 0.887 (average over 165 mutual instances, PF03561 166 appearances, SSF49785 13919 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 416 ) 6630082_PF04991_PF06828            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04991  is 6608805 with Jaccard = 0.9592	|PF04991|=48 [ 47	1	1100162	1 ]
 parent [ 6608805 ] : 6630082 0.29826 (=360/(17*71)) 74.6295
  given [ 6608805 ] : 6608805 0.346154 (=135/(6*65)) 66.4323
	best keyword for cluster 6608805 is PF04991 with Jaccard = 0.9592	[ 47	1	1100162	1 ]	0.9792 0.9792
sibling [ 6608805 ] : 6589624 0.428571 (=30/(7*10)) 57.1429
	best keyword for cluster 6589624 is PF06828 with Jaccard = 0.9231	[ 12	0	1100198	1 ]	1.0000 0.9231

	SUGGESTING RELATEDNESS OF:   
	A> PF04991 ( PF04991   LICD Protein Family )
	B> PF06828 ( PF06828   Fukutin-related )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04991 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 417 ) 6769494_PF03611_PF05437            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05437  is 6761469 with Jaccard = 0.9592	|PF05437|=147 [ 141	0	1100064	6 ]
 parent [ 6761469 ] : 6769494 0.00499182 (=61/(188*65)) 99.6936
  given [ 6761469 ] : 6761469 0.00803571 (=27/(20*168)) 99.3431
	best keyword for cluster 6761469 is PF05437 with Jaccard = 0.9592	[ 141	0	1100064	6 ]	1.0000 0.9592
sibling [ 6761469 ] : 6768874 0.015625 (=1/(1*64)) 99.6719
	best keyword for cluster 6768874 is PF03611 with Jaccard = 1.0000	[ 34	0	1100177	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05437 ( PF05437   Branched-chain amino acid transport protein (AzlD) )
	B> PF03611 ( PF03611   PTS system Galactitol-specific IIC component )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05437 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 418 ) 6384589_PF06071_PF08438            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08438  is 6203727 with Jaccard = 0.9592	|PF08438|=47 [ 47	2	1100162	0 ]
 parent [ 6203727 ] : 6384589 1 (=13328/(272*49)) 0.000979999
  given [ 6203727 ] : 6203727 1 (=558/(31*18)) 5.78851e-17
	best keyword for cluster 6203727 is PF08438 with Jaccard = 0.9592	[ 47	2	1100162	0 ]	0.9592 1.0000
sibling [ 6203727 ] : 6072285 1 (=2620/(10*262)) 7.63809e-28
	best keyword for cluster 6072285 is PF06071 with Jaccard = 0.9288	[ 248	3	1099944	16 ]	0.9880 0.9394

	SUGGESTING RELATEDNESS OF:   
	A> PF08438 ( PF08438   GTPase of unknown function C-terminal )
	B> PF06071 ( PF06071   Protein of unknown function (DUF933) )

	Only B has a clan ( CL0072.14 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF08438 has a PDB structure (may not be up to date)
		PF06071 	   d.15.10.2

	SUPERFAM mapping significantly overlapping:
     1	PF06071 SSF81271 0.987 (average over 918 mutual instances, PF06071 920 appearances, SSF81271 8501 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 419 ) 6608034_PF00456_PF02780            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00456  is 6598008 with Jaccard = 0.9585	|PF00456|=515 [ 508	15	1099681	7 ]
 parent [ 6598008 ] : 6608034 0.355364 (=179106/(809*623)) 65.9252
  given [ 6598008 ] : 6598008 0.435691 (=271/(1*622)) 60.5762
	best keyword for cluster 6598008 is PF00456 with Jaccard = 0.9585	[ 508	15	1099681	7 ]	0.9713 0.9864
sibling [ 6598008 ] : 6527603 0.762687 (=3066/(5*804)) 24.8444
	best keyword for cluster 6527603 is PF02780 with Jaccard = 0.6511	[ 724	20	1099099	368 ]	0.9731 0.6630

	SUGGESTING RELATEDNESS OF:   
	A> PF00456 ( PF00456   Transketolase, thiamine diphosphate binding domain )
	B> PF02780 ( PF02780   Transketolase, C-terminal domain )

	Only A has a clan ( CL0254.3 ).

	the two keywords coincide on Uniref90 proteins: |PF00456| = 515 ,	 |PF02780| = 1092 , 	|PF00456^PF02780| = 312	( 60.6% and 28.6% )

	both PF00456 and PF02780 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF02780 SSF52922 0.877 (average over 3553 mutual instances, PF02780 3663 appearances, SSF52922 11092 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 420 ) 6524758_PF01561_PF07948            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01561  is 6466261 with Jaccard = 0.9583	|PF01561|=24 [ 23	0	1100187	1 ]
 parent [ 6466261 ] : 6524758 0.796875 (=153/(8*24)) 22.9738
  given [ 6466261 ] : 6466261 0.968254 (=61/(21*3)) 3.17461
	best keyword for cluster 6466261 is PF01561 with Jaccard = 0.9583	[ 23	0	1100187	1 ]	1.0000 0.9583
sibling [ 6466261 ] : 6417709 1 (=7/(1*7)) 0.08
	best keyword for cluster 6417709 is PF07948 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01561 ( PF01561   Hantavirus glycoprotein G2 )
	B> PF07948 ( PF07948   Nairovirus M polyprotein-like )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01561 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 421 ) 6766089_PF01042_PF04013            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04013  is 6705175 with Jaccard = 0.9583	|PF04013|=23 [ 23	1	1100187	0 ]
 parent [ 6705175 ] : 6766089 0.00520186 (=159/(31*986)) 99.559
  given [ 6705175 ] : 6705175 0.0692308 (=9/(5*26)) 93.1417
	best keyword for cluster 6705175 is PF04013 with Jaccard = 0.9583	[ 23	1	1100187	0 ]	0.9583 1.0000
sibling [ 6705175 ] : 6744650 0.0224042 (=197/(9*977)) 98.2046
	best keyword for cluster 6744650 is PF01042 with Jaccard = 0.8986	[ 780	77	1099343	11 ]	0.9102 0.9861

	SUGGESTING RELATEDNESS OF:   
	A> PF04013 ( PF04013   Protein of unknown function (DUF358) )
	B> PF01042 ( PF01042   Endoribonuclease L-PSP )

	Only A has a clan ( CL0098.7 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF04013 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01042 SSF55298 0.919 (average over 2576 mutual instances, PF01042 2591 appearances, SSF55298 2787 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 422 ) 6573169_PF06800_PF07857            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07857  is 6364109 with Jaccard = 0.9583	|PF07857|=24 [ 23	0	1100187	1 ]
 parent [ 6364109 ] : 6573169 0.551087 (=507/(23*40)) 51.7544
  given [ 6364109 ] : 6364109 1 (=120/(8*15)) 4.417e-05
	best keyword for cluster 6364109 is PF07857 with Jaccard = 0.9583	[ 23	0	1100187	1 ]	1.0000 0.9583
sibling [ 6364109 ] : 6370256 1 (=39/(1*39)) 0.000102566
	best keyword for cluster 6370256 is PF06800 with Jaccard = 0.9730	[ 36	0	1100174	1 ]	1.0000 0.9730

	SUGGESTING RELATEDNESS OF:   
	A> PF07857 ( PF07857   CEO family (DUF1632) )
	B> PF06800 ( PF06800   Sugar transport protein )

	they come from the same clan: CL0184.5 : PF07857	PF04342	PF00892	PF05653	PF06027	PF00893	PF04142	PF06379	PF06800	PF03151	PF08449	PF02694
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07857 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF06800 SSF103473 0.646 (average over 1 mutual instances, PF06800 1 appearances, SSF103473 39293 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 423 ) 6767726_PF04521_PF04909            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04909  is 6736195 with Jaccard = 0.9578	|PF04909|=497 [ 477	1	1099713	20 ]
 parent [ 6736195 ] : 6767726 0.00426991 (=66/(533*29)) 99.6273
  given [ 6736195 ] : 6736195 0.0359814 (=2280/(354*179)) 97.3961
	best keyword for cluster 6736195 is PF04909 with Jaccard = 0.9578	[ 477	1	1099713	20 ]	0.9979 0.9598
sibling [ 6736195 ] : 6744628 0.0315789 (=6/(10*19)) 98.2037
	best keyword for cluster 6744628 is PF04521 with Jaccard = 0.9167	[ 11	1	1100199	0 ]	0.9167 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04909 ( PF04909   Amidohydrolase )
	B> PF04521 ( PF04521   ssRNA positive strand viral 18kD cysteine rich protein )

	Only A has a clan ( CL0034.9 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF04909 has a PDB structure (may not be up to date)
	PF04909 	   c.1.9.15
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 424 ) 6716601_PF03681_PF05534            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03681  is 6714979 with Jaccard = 0.9576	|PF03681|=235 [ 226	1	1099975	9 ]
 parent [ 6714979 ] : 6716601 0.0586912 (=1070/(59*309)) 94.9928
  given [ 6714979 ] : 6714979 0.0649065 (=118/(6*303)) 94.7177
	best keyword for cluster 6714979 is PF03681 with Jaccard = 0.9576	[ 226	1	1099975	9 ]	0.9956 0.9617
sibling [ 6714979 ] : 6556240 0.588663 (=405/(43*16)) 43.8636
	best keyword for cluster 6556240 is PF05534 with Jaccard = 0.8163	[ 40	0	1100162	9 ]	1.0000 0.8163

	SUGGESTING RELATEDNESS OF:   
	A> PF03681 ( PF03681   Uncharacterised protein family (UPF0150) )
	B> PF05534 ( PF05534   HicB family )

	Only B has a clan ( CL0057.9 ).

	the two keywords coincide on Uniref90 proteins: |PF03681| = 235 ,	 |PF05534| = 49 , 	|PF03681^PF05534| = 9	( 3.8% and 18.4% )

	Neither  PF03681 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05534 SSF47598 0.702 (average over 13 mutual instances, PF05534 13 appearances, SSF47598 883 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 425 ) 6606929_PF00400_PF08145            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08145  is 6590869 with Jaccard = 0.9574	|PF08145|=46 [ 45	1	1100164	1 ]
 parent [ 6590869 ] : 6606929 0.402307 (=79858/(50*3970)) 65.0467
  given [ 6590869 ] : 6590869 0.427083 (=41/(2*48)) 57.546
	best keyword for cluster 6590869 is PF08145 with Jaccard = 0.9574	[ 45	1	1100164	1 ]	0.9783 0.9783
sibling [ 6590869 ] : 6605419 0.38941 (=41457/(27*3943)) 64.2818
	best keyword for cluster 6605419 is PF00400 with Jaccard = 0.6629	[ 3780	20	1094509	1902 ]	0.9947 0.6653

	SUGGESTING RELATEDNESS OF:   
	A> PF08145 ( PF08145   BOP1NT (NUC169) domain )
	B> PF00400 ( PF00400   WD domain, G-beta repeat )

	Only B has a clan ( CL0186.8 ).

	the two keywords coincide on Uniref90 proteins: |PF00400| = 5682 ,	 |PF08145| = 46 , 	|PF00400^PF08145| = 37	( 0.7% and 80.4% )

	only  PF08145 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 426 ) 6722011_PF01037_PF06018            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01037  is 6699533 with Jaccard = 0.9569	|PF01037|=830 [ 821	28	1099353	9 ]
 parent [ 6699533 ] : 6722011 0.0687566 (=4119/(1051*57)) 95.7055
  given [ 6699533 ] : 6699533 0.0942857 (=99/(1*1050)) 92.1017
	best keyword for cluster 6699533 is PF01037 with Jaccard = 0.9569	[ 821	28	1099353	9 ]	0.9670 0.9892
sibling [ 6699533 ] : 6704992 0.0714286 (=28/(8*49)) 93.0987
	best keyword for cluster 6704992 is PF06018 with Jaccard = 0.9070	[ 39	4	1100168	0 ]	0.9070 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01037 ( PF01037   AsnC family )
	B> PF06018 ( PF06018   CodY GAF-like domain )

	A and B come from a different clan ( CL0032.9 , CL0161.7 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF01037 has a PDB structure (may not be up to date)
	PF01037 	   d.58.4.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF01037 SSF54909 0.871 (average over 3219 mutual instances, PF01037 3221 appearances, SSF54909 7040 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 427 ) 6751998_PF02610_PF02952            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02952  is 6741924 with Jaccard = 0.9565	|PF02952|=22 [ 22	1	1100188	0 ]
 parent [ 6741924 ] : 6751998 0.0145349 (=25/(40*43)) 98.7688
  given [ 6741924 ] : 6741924 0.020362 (=9/(17*26)) 97.9638
	best keyword for cluster 6741924 is PF02952 with Jaccard = 0.9565	[ 22	1	1100188	0 ]	0.9565 1.0000
sibling [ 6741924 ] : 6721528 0.0769231 (=3/(1*39)) 95.641
	best keyword for cluster 6721528 is PF02610 with Jaccard = 1.0000	[ 34	0	1100177	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02952 ( PF02952   L-fucose isomerase, C-terminal domain )
	B> PF02610 ( PF02610   L-arabinose isomerase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF02952 and PF02610 have PDB structures
	PF02952 	   b.43.2.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 428 ) 6700492_PF03344_PF05361            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05361  is 6593454 with Jaccard = 0.9565	|PF05361|=23 [ 22	0	1100188	1 ]
 parent [ 6593454 ] : 6700492 0.0933333 (=42/(25*18)) 92.2648
  given [ 6593454 ] : 6593454 0.434783 (=20/(2*23)) 58.6454
	best keyword for cluster 6593454 is PF05361 with Jaccard = 0.9565	[ 22	0	1100188	1 ]	1.0000 0.9565
sibling [ 6593454 ] : 6678883 0.133333 (=6/(3*15)) 87.9044
	best keyword for cluster 6678883 is PF03344 with Jaccard = 0.8571	[ 12	1	1100197	1 ]	0.9231 0.9231

	SUGGESTING RELATEDNESS OF:   
	A> PF05361 ( PF05361   PKC-activated protein phosphatase-1 inhibitor )
	B> PF03344 ( PF03344   Daxx Family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF05361 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05361 SSF81790 0.754 (average over 44 mutual instances, PF05361 46 appearances, SSF81790 47 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 429 ) 6739204_PF01488_PF02423            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02423  is 6690153 with Jaccard = 0.9563	|PF02423|=201 [ 197	5	1100005	4 ]
 parent [ 6690153 ] : 6739204 0.0290937 (=4466/(656*234)) 97.703
  given [ 6690153 ] : 6690153 0.103261 (=95/(4*230)) 90.2904
	best keyword for cluster 6690153 is PF02423 with Jaccard = 0.9563	[ 197	5	1100005	4 ]	0.9752 0.9801
sibling [ 6690153 ] : 6648777 0.240202 (=23535/(426*230)) 79.6281
	best keyword for cluster 6648777 is PF01488 with Jaccard = 0.9092	[ 571	16	1099583	41 ]	0.9727 0.9330

	SUGGESTING RELATEDNESS OF:   
	A> PF02423 ( PF02423   Ornithine cyclodeaminase/mu-crystallin family )
	B> PF01488 ( PF01488   Shikimate / quinate 5-dehydrogenase )

	they come from the same clan: CL0063.17 : PF03721	PF04820	PF02254	PF00899	PF01946	PF02882	PF01488	PF01118	PF08491	PF03435	PF04321	PF07992	PF00070	PF02719	PF02153	PF02423	PF05368	PF01210	PF07994	PF07993	PF03447	PF03446	PF01225	PF06039	PF01232	PF03949	PF05834	PF00056	PF08659	PF07991	PF03486	PF00044	PF00732	PF01134	PF01408	PF00996	PF00479	PF00743	PF01494	PF00890	PF03807	PF01370	PF00208	PF02670	PF01113	PF01266	PF02629	PF02558	PF01593	PF01262	PF00670	PF00107	PF00106	PF02737	PF01073	PF02826
	the two keywords do not coincide on UniRef90 proteins

	both PF02423 and PF01488 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF02423 SSF51735 0.965 (average over 621 mutual instances, PF02423 624 appearances, SSF51735 164772 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 430 ) 6748772_PF04740_PF06013            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06013  is 6742882 with Jaccard = 0.9560	|PF06013|=91 [ 87	0	1100120	4 ]
 parent [ 6742882 ] : 6748772 0.0160004 (=341/(144*148)) 98.5252
  given [ 6742882 ] : 6742882 0.0239808 (=30/(139*9)) 98.0418
	best keyword for cluster 6742882 is PF06013 with Jaccard = 0.9560	[ 87	0	1100120	4 ]	1.0000 0.9560
sibling [ 6742882 ] : 6737739 0.0250665 (=113/(46*98)) 97.5598
	best keyword for cluster 6737739 is PF04740 with Jaccard = 0.6383	[ 30	16	1100164	1 ]	0.6522 0.9677

	SUGGESTING RELATEDNESS OF:   
	A> PF06013 ( PF06013   Proteins of 100 residues with WXG )
	B> PF04740 ( PF04740   Bacillus transposase protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06013 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 431 ) 6760411_PF05347_PF05882            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05347  is 6756063 with Jaccard = 0.9549	|PF05347|=132 [ 127	1	1100078	5 ]
 parent [ 6756063 ] : 6760411 0.0103571 (=87/(35*240)) 99.2856
  given [ 6756063 ] : 6756063 0.0149173 (=83/(26*214)) 99.0359
	best keyword for cluster 6756063 is PF05347 with Jaccard = 0.9549	[ 127	1	1100078	5 ]	0.9922 0.9621
sibling [ 6756063 ] : 6746912 0.0294118 (=1/(1*34)) 98.3824
	best keyword for cluster 6746912 is PF05882 with Jaccard = 0.9310	[ 27	1	1100182	1 ]	0.9643 0.9643

	SUGGESTING RELATEDNESS OF:   
	A> PF05347 ( PF05347   Complex 1 protein (LYR family) )
	B> PF05882 ( PF05882   ACN9 family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05347 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05882 SSF46458 0.685 (average over 1 mutual instances, PF05882 1 appearances, SSF46458 5480 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 432 ) 6697014_PF01042_PF01902            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01042  is 6678666 with Jaccard = 0.9545	|PF01042|=791 [ 755	0	1099420	36 ]
 parent [ 6678666 ] : 6697014 0.0879872 (=8106/(107*861)) 91.7391
  given [ 6678666 ] : 6678666 0.13696 (=2528/(22*839)) 87.8361
	best keyword for cluster 6678666 is PF01042 with Jaccard = 0.9545	[ 755	0	1099420	36 ]	1.0000 0.9545
sibling [ 6678666 ] : 6637247 0.253205 (=79/(104*3)) 76.3247
	best keyword for cluster 6637247 is PF01902 with Jaccard = 0.9083	[ 99	0	1100102	10 ]	1.0000 0.9083

	SUGGESTING RELATEDNESS OF:   
	A> PF01042 ( PF01042   Endoribonuclease L-PSP )
	B> PF01902 ( PF01902   ATP-binding region )

	Only B has a clan ( CL0039.7 ).

	the two keywords coincide on Uniref90 proteins: |PF01042| = 791 ,	 |PF01902| = 109 , 	|PF01042^PF01902| = 23	( 2.9% and 21.1% )

	both PF01042 and PF01902 have PDB structures
		PF01902 	   c.26.2.1

	SUPERFAM mapping significantly overlapping:
     1	PF01042 SSF55298 0.919 (average over 2576 mutual instances, PF01042 2591 appearances, SSF55298 2787 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 433 ) 6776393_PF03414_PF04487            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03414  is 6762969 with Jaccard = 0.9545	|PF03414|=44 [ 42	0	1100167	2 ]
 parent [ 6762969 ] : 6776393 0.00128425 (=6/(73*64)) 99.8882
  given [ 6762969 ] : 6762969 0.00584795 (=6/(54*19)) 99.4152
	best keyword for cluster 6762969 is PF03414 with Jaccard = 0.9545	[ 42	0	1100167	2 ]	1.0000 0.9545
sibling [ 6762969 ] : 6764583 0.00651042 (=5/(16*48)) 99.4927
	best keyword for cluster 6764583 is PF04487 with Jaccard = 0.9474	[ 18	1	1100192	0 ]	0.9474 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03414 ( PF03414   Glycosyltransferase family 6 )
	B> PF04487 ( PF04487   CITED )

	Only A has a clan ( CL0110.6 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF03414 and PF04487 have PDB structures
	PF03414 	   c.68.1.9
	PF04487 	   j.96.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 434 ) 6731040_PF06542_PF06879            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06879  is 6616647 with Jaccard = 0.9524	|PF06879|=21 [ 20	0	1100190	1 ]
 parent [ 6616647 ] : 6731040 0.0396975 (=21/(23*23)) 96.8383
  given [ 6616647 ] : 6616647 0.333333 (=30/(18*5)) 69.067
	best keyword for cluster 6616647 is PF06879 with Jaccard = 0.9524	[ 20	0	1100190	1 ]	1.0000 0.9524
sibling [ 6616647 ] : 6692792 0.0916667 (=11/(8*15)) 90.8337
	best keyword for cluster 6692792 is PF06542 with Jaccard = 0.8571	[ 12	0	1100197	2 ]	1.0000 0.8571

	SUGGESTING RELATEDNESS OF:   
	A> PF06879 ( PF06879   Protein of unknown function (DUF1261) )
	B> PF06542 ( PF06542   Protein of unknown function (DUF1114) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06879 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 435 ) 6718526_PF07297_PF08510            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07297  is 6615336 with Jaccard = 0.9524	|PF07297|=21 [ 20	0	1100190	1 ]
 parent [ 6615336 ] : 6718526 0.06375 (=51/(32*25)) 95.2182
  given [ 6615336 ] : 6615336 0.326087 (=15/(23*2)) 68.5608
	best keyword for cluster 6615336 is PF07297 with Jaccard = 0.9524	[ 20	0	1100190	1 ]	1.0000 0.9524
sibling [ 6615336 ] : 6605222 0.366667 (=22/(2*30)) 64.0784
	best keyword for cluster 6605222 is PF08510 with Jaccard = 0.8000	[ 28	0	1100176	7 ]	1.0000 0.8000

	SUGGESTING RELATEDNESS OF:   
	A> PF07297 ( PF07297   Dolichol phosphate-mannose biosynthesis regulatory protein (DPM2) )
	B> PF08510 ( PF08510   PIG-P )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07297 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 436 ) 6762703_PF01862_PF07357            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07357  is 6043261 with Jaccard = 0.9524	|PF07357|=21 [ 20	0	1100190	1 ]
 parent [ 6043261 ] : 6762703 0.00765306 (=9/(21*56)) 99.404
  given [ 6043261 ] : 6043261 1 (=104/(8*13)) 2.30973e-30
	best keyword for cluster 6043261 is PF07357 with Jaccard = 0.9524	[ 20	0	1100190	1 ]	1.0000 0.9524
sibling [ 6043261 ] : 6756740 0.0132576 (=7/(44*12)) 99.0752
	best keyword for cluster 6756740 is PF01862 with Jaccard = 0.9737	[ 37	1	1100173	0 ]	0.9737 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07357 ( PF07357   Dinitrogenase reductase ADP-ribosyltransferase (DRAT) )
	B> PF01862 ( PF01862   Pyruvoyl-dependent arginine decarboxylase (PvlArgDC) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF07357 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01862 SSF56271 0.943 (average over 81 mutual instances, PF01862 81 appearances, SSF56271 104 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 437 ) 6722479_PF02453_PF07234            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02453  is 6622261 with Jaccard = 0.9521	|PF02453|=166 [ 159	1	1100044	7 ]
 parent [ 6622261 ] : 6722479 0.0706494 (=136/(175*11)) 95.7839
  given [ 6622261 ] : 6622261 0.310909 (=513/(10*165)) 71.4791
	best keyword for cluster 6622261 is PF02453 with Jaccard = 0.9521	[ 159	1	1100044	7 ]	0.9938 0.9578
sibling [ 6622261 ] : 6669678 0.25 (=6/(8*3)) 85.4167
	best keyword for cluster 6669678 is PF07234 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02453 ( PF02453   Reticulon )
	B> PF07234 ( PF07234   Protein of unknown function (DUF1426) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02453 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 438 ) 6772698_PF01974_PF06315            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01974  is 6748618 with Jaccard = 0.9515	|PF01974|=103 [ 98	0	1100108	5 ]
 parent [ 6748618 ] : 6772698 0.00242234 (=17/(121*58)) 99.7962
  given [ 6748618 ] : 6748618 0.0200501 (=16/(114*7)) 98.5119
	best keyword for cluster 6748618 is PF01974 with Jaccard = 0.9515	[ 98	0	1100108	5 ]	1.0000 0.9515
sibling [ 6748618 ] : 6768061 0.00555556 (=4/(40*18)) 99.6414
	best keyword for cluster 6768061 is PF06315 with Jaccard = 0.9667	[ 29	1	1100181	0 ]	0.9667 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01974 ( PF01974   tRNA intron endonuclease, catalytic C-terminal domain )
	B> PF06315 ( PF06315   Isocitrate dehydrogenase kinase/phosphatase (AceK) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01974 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01974 SSF53032 0.905 (average over 199 mutual instances, PF01974 259 appearances, SSF53032 237 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 439 ) 6746649_PF01920_PF02996            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02996  is 6695281 with Jaccard = 0.9515	|PF02996|=164 [ 157	1	1100046	7 ]
 parent [ 6695281 ] : 6746649 0.0216861 (=729/(176*191)) 98.3616
  given [ 6695281 ] : 6695281 0.0997721 (=613/(48*128)) 91.3885
	best keyword for cluster 6695281 is PF02996 with Jaccard = 0.9515	[ 157	1	1100046	7 ]	0.9937 0.9573
sibling [ 6695281 ] : 6714974 0.065107 (=487/(136*55)) 94.716
	best keyword for cluster 6714974 is PF01920 with Jaccard = 0.9382	[ 167	0	1100033	11 ]	1.0000 0.9382

	SUGGESTING RELATEDNESS OF:   
	A> PF02996 ( PF02996   Prefoldin subunit )
	B> PF01920 ( PF01920   Prefoldin subunit )

	they come from the same clan: CL0200.5 : PF01920	PF02996
	the two keywords do not coincide on UniRef90 proteins

	both PF02996 and PF01920 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF02996 SSF46579 0.892 (average over 327 mutual instances, PF02996 330 appearances, SSF46579 930 appearances)
     2	PF01920 SSF46579 0.944 (average over 357 mutual instances, PF01920 366 appearances, SSF46579 930 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 440 ) 6749744_PF03332_PF07851            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03332  is 6529894 with Jaccard = 0.9508	|PF03332|=58 [ 58	3	1100150	0 ]
 parent [ 6529894 ] : 6749744 0.021585 (=70/(69*47)) 98.6006
  given [ 6529894 ] : 6529894 0.753731 (=101/(2*67)) 25.8978
	best keyword for cluster 6529894 is PF03332 with Jaccard = 0.9508	[ 58	3	1100150	0 ]	0.9508 1.0000
sibling [ 6529894 ] : 6744182 0.0189189 (=7/(10*37)) 98.1625
	best keyword for cluster 6744182 is PF07851 with Jaccard = 1.0000	[ 20	0	1100191	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03332 ( PF03332   Eukaryotic phosphomannomutase )
	B> PF07851 ( PF07851   TMPIT-like protein )

	Only A has a clan ( CL0137.9 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF03332 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 441 ) 6546863_PF02958_PF07914            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02958  is 6532919 with Jaccard = 0.9504	|PF02958|=121 [ 115	0	1100090	6 ]
 parent [ 6532919 ] : 6546863 0.671008 (=3194/(40*119)) 36.5246
  given [ 6532919 ] : 6532919 0.771186 (=91/(1*118)) 27.6027
	best keyword for cluster 6532919 is PF02958 with Jaccard = 0.9504	[ 115	0	1100090	6 ]	1.0000 0.9504
sibling [ 6532919 ] : 6509948 0.897436 (=35/(1*39)) 15.3955
	best keyword for cluster 6509948 is PF07914 with Jaccard = 0.9310	[ 27	1	1100182	1 ]	0.9643 0.9643

	SUGGESTING RELATEDNESS OF:   
	A> PF02958 ( PF02958   Domain of unknown function (DUF227) )
	B> PF07914 ( PF07914   Protein of unknown function (DUF1679) )

	they come from the same clan: CL0016.14 : PF07714	PF00069	PF06293	PF03881	PF02958	PF07914	PF01633	PF04655	PF01636	PF03109	PF05445	PF01163	PF06176
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02958 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF07914 SSF56112 0.516 (average over 46 mutual instances, PF07914 47 appearances, SSF56112 66637 appearances)
     2	PF02958 SSF56112 0.668 (average over 283 mutual instances, PF02958 288 appearances, SSF56112 66637 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 442 ) 6448201_PF01018_PF06071            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01018  is 6439767 with Jaccard = 0.9502	|PF01018|=299 [ 286	2	1099910	13 ]
 parent [ 6439767 ] : 6448201 0.989609 (=100382/(321*316)) 1.18297
  given [ 6439767 ] : 6439767 0.993631 (=624/(2*314)) 0.64166
	best keyword for cluster 6439767 is PF01018 with Jaccard = 0.9502	[ 286	2	1099910	13 ]	0.9931 0.9565
sibling [ 6439767 ] : 6384589 1 (=13328/(272*49)) 0.000979999
	best keyword for cluster 6384589 is PF06071 with Jaccard = 0.7848	[ 248	52	1099895	16 ]	0.8267 0.9394

	SUGGESTING RELATEDNESS OF:   
	A> PF01018 ( PF01018   GTP1/OBG )
	B> PF06071 ( PF06071   Protein of unknown function (DUF933) )

	Only B has a clan ( CL0072.14 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF01018 and PF06071 have PDB structures
	PF01018 	   b.117.1.1
	PF06071 	   d.15.10.2

	SUPERFAM mapping significantly overlapping:
     1	PF06071 SSF81271 0.987 (average over 918 mutual instances, PF06071 920 appearances, SSF81271 8501 appearances)
     2	PF01018 SSF82051 0.983 (average over 948 mutual instances, PF01018 1220 appearances, SSF82051 2144 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 443 ) 6603163_PF00894_PF01690            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01690  is 6170440 with Jaccard = 0.9500	|PF01690|=20 [ 19	0	1100191	1 ]
 parent [ 6170440 ] : 6603163 0.368421 (=91/(19*13)) 63.1579
  given [ 6170440 ] : 6170440 1 (=48/(3*16)) 1.03942e-19
	best keyword for cluster 6170440 is PF01690 with Jaccard = 0.9500	[ 19	0	1100191	1 ]	1.0000 0.9500
sibling [ 6170440 ] : 6209349 1 (=30/(3*10)) 1.6738e-16
	best keyword for cluster 6209349 is PF00894 with Jaccard = 0.6500	[ 13	0	1100191	7 ]	1.0000 0.6500

	SUGGESTING RELATEDNESS OF:   
	A> PF01690 ( PF01690   Potato leaf roll virus readthrough protein )
	B> PF00894 ( PF00894   Luteovirus coat protein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00894| = 20 ,	 |PF01690| = 20 , 	|PF00894^PF01690| = 7	( 35.0% and 35.0% )

	Neither  PF01690 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 444 ) 5389938_PF05379_PF05413            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05379  is 5165011 with Jaccard = 0.9500	|PF05379|=20 [ 19	0	1100191	1 ]
 parent [ 5165011 ] : 5389938 1 (=140/(20*7)) 8.01509e-118
  given [ 5165011 ] : 5165011 1 (=19/(1*19)) 0
	best keyword for cluster 5165011 is PF05379 with Jaccard = 0.9500	[ 19	0	1100191	1 ]	1.0000 0.9500
sibling [ 5165011 ] : 5311096 1 (=6/(1*6)) 1.66672e-137
	best keyword for cluster 5311096 is PF05413 with Jaccard = 0.8571	[ 6	1	1100204	0 ]	0.8571 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05379 ( PF05379   Carlavirus endopeptidase )
	B> PF05413 ( PF05413   Putative closterovirus papain-like endopeptidase )

	Only A has a clan ( CL0125.9 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05379 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 445 ) 6715876_PF00430_PF01991            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01991  is 6654698 with Jaccard = 0.9492	|PF01991|=118 [ 112	0	1100093	6 ]
 parent [ 6654698 ] : 6715876 0.0651359 (=6397/(122*805)) 94.8622
  given [ 6654698 ] : 6654698 0.208547 (=122/(5*117)) 81.624
	best keyword for cluster 6654698 is PF01991 with Jaccard = 0.9492	[ 112	0	1100093	6 ]	1.0000 0.9492
sibling [ 6654698 ] : 6713393 0.0744906 (=3583/(65*740)) 94.4672
	best keyword for cluster 6713393 is PF00430 with Jaccard = 0.6472	[ 387	204	1099613	7 ]	0.6548 0.9822

	SUGGESTING RELATEDNESS OF:   
	A> PF01991 ( PF01991   ATP synthase (E/31 kDa) subunit )
	B> PF00430 ( PF00430   ATP synthase B/B' CF(0) )

	Only B has a clan ( CL0255.4 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF01991 has a PDB structure (may not be up to date)
		PF00430 	   f.23.21.1  j.35.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00430 SSF82607 0.684 (average over 1 mutual instances, PF00430 10 appearances, SSF82607 761 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 446 ) 6689485_PF04032_PF08296            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04032  is 6550824 with Jaccard = 0.9492	|PF04032|=59 [ 56	0	1100152	3 ]
 parent [ 6550824 ] : 6689485 0.132254 (=98/(57*13)) 90.1608
  given [ 6550824 ] : 6550824 0.648883 (=523/(26*31)) 39.5374
	best keyword for cluster 6550824 is PF04032 with Jaccard = 0.9492	[ 56	0	1100152	3 ]	1.0000 0.9492
sibling [ 6550824 ] : 6658361 0.190476 (=8/(7*6)) 82.8293
	best keyword for cluster 6658361 is PF08296 with Jaccard = 0.6364	[ 7	0	1100200	4 ]	1.0000 0.6364

	SUGGESTING RELATEDNESS OF:   
	A> PF04032 ( PF04032   RNAse P Rpr2/Rpp21/SNM1 subunit domain )
	B> PF08296 (  )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04032 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 447 ) 6733306_PF01888_PF02571            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02571  is 6497012 with Jaccard = 0.9490	|PF02571|=98 [ 93	0	1100113	5 ]
 parent [ 6497012 ] : 6733306 0.029596 (=293/(100*99)) 97.0857
  given [ 6497012 ] : 6497012 0.923913 (=680/(8*92)) 10.3702
	best keyword for cluster 6497012 is PF02571 with Jaccard = 0.9490	[ 93	0	1100113	5 ]	1.0000 0.9490
sibling [ 6497012 ] : 6731849 0.0408163 (=4/(1*98)) 96.9235
	best keyword for cluster 6731849 is PF01888 with Jaccard = 0.9894	[ 93	0	1100117	1 ]	1.0000 0.9894

	SUGGESTING RELATEDNESS OF:   
	A> PF02571 ( PF02571   Precorrin-6x reductase CbiJ/CobK )
	B> PF01888 ( PF01888   CbiD )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01888| = 94 ,	 |PF02571| = 98 , 	|PF01888^PF02571| = 3	( 3.2% and 3.1% )

	only  PF02571 has a PDB structure (may not be up to date)
		PF01888 	   e.54.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF01888 SSF111342 0.885 (average over 277 mutual instances, PF01888 277 appearances, SSF111342 288 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 448 ) 6736552_PF01052_PF04509            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01052  is 6700713 with Jaccard = 0.9489	|PF01052|=354 [ 353	18	1099839	1 ]
 parent [ 6700713 ] : 6736552 0.0275438 (=2237/(432*188)) 97.4309
  given [ 6700713 ] : 6700713 0.0790297 (=202/(6*426)) 92.3035
	best keyword for cluster 6700713 is PF01052 with Jaccard = 0.9489	[ 353	18	1099839	1 ]	0.9515 0.9972
sibling [ 6700713 ] : 6711572 0.0717665 (=622/(81*107)) 94.1761
	best keyword for cluster 6711572 is PF04509 with Jaccard = 0.7667	[ 115	0	1100061	35 ]	1.0000 0.7667

	SUGGESTING RELATEDNESS OF:   
	A> PF01052 ( PF01052   Surface presentation of antigens (SPOA) protein )
	B> PF04509 ( PF04509   CheC-like family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01052| = 354 ,	 |PF04509| = 150 , 	|PF01052^PF04509| = 27	( 7.6% and 18.0% )

	both PF01052 and PF04509 have PDB structures
		PF04509 	   d.252.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF01052 SSF101801 0.925 (average over 1196 mutual instances, PF01052 1196 appearances, SSF101801 1677 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 449 ) 6553137_PF00154_PF08423            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00154  is 6525584 with Jaccard = 0.9488	|PF00154|=332 [ 315	0	1099879	17 ]
 parent [ 6525584 ] : 6553137 0.618222 (=49616/(352*228)) 41.2553
  given [ 6525584 ] : 6525584 0.776627 (=3675/(14*338)) 23.2444
	best keyword for cluster 6525584 is PF00154 with Jaccard = 0.9488	[ 315	0	1099879	17 ]	1.0000 0.9488
sibling [ 6525584 ] : 6539891 0.692222 (=2483/(17*211)) 32.3724
	best keyword for cluster 6539891 is PF08423 with Jaccard = 0.9529	[ 182	8	1100020	1 ]	0.9579 0.9945

	SUGGESTING RELATEDNESS OF:   
	A> PF00154 ( PF00154   recA bacterial DNA recombination protein )
	B> PF08423 ( PF08423   Rad51 )

	they come from the same clan: CL0216.4 : PF08423	PF00154

	the two keywords coincide on Uniref90 proteins: |PF00154| = 332 ,	 |PF08423| = 183 , 	|PF00154^PF08423| = 9	( 2.7% and 4.9% )

	both PF00154 and PF08423 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 450 ) 6781262_PF02452_PF02495            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02495  is 6778505 with Jaccard = 0.9487	|PF02495|=77 [ 74	1	1100133	3 ]
 parent [ 6778505 ] : 6781262 0.000358262 (=28/(203*385)) 99.9707
  given [ 6778505 ] : 6778505 0.00136293 (=14/(96*107)) 99.9292
	best keyword for cluster 6778505 is PF02495 with Jaccard = 0.9487	[ 74	1	1100133	3 ]	0.9867 0.9610
sibling [ 6778505 ] : 6780292 0.00260417 (=1/(1*384)) 99.9583
	best keyword for cluster 6780292 is PF02452 with Jaccard = 0.9803	[ 149	3	1100059	0 ]	0.9803 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02495 ( PF02495   7kD viral coat protein )
	B> PF02452 ( PF02452   PemK-like protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02495 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF02452 SSF50118 0.943 (average over 466 mutual instances, PF02452 466 appearances, SSF50118 511 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 451 ) 6769850_PF03498_PF06680            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03498  is 6746534 with Jaccard = 0.9474	|PF03498|=19 [ 18	0	1100192	1 ]
 parent [ 6746534 ] : 6769850 0.00357143 (=3/(40*21)) 99.706
  given [ 6746534 ] : 6746534 0.0204604 (=8/(17*23)) 98.3524
	best keyword for cluster 6746534 is PF03498 with Jaccard = 0.9474	[ 18	0	1100192	1 ]	1.0000 0.9474
sibling [ 6746534 ] : 6758061 0.00909091 (=1/(10*11)) 99.1573
	best keyword for cluster 6758061 is PF06680 with Jaccard = 1.0000	[ 2	0	1100209	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03498 ( PF03498   Cytolethal distending toxin A/C family )
	B> PF06680 ( PF06680   Protein of unknown function (DUF1181) )

	Only A has a clan ( CL0066.9 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF03498 has a PDB structure (may not be up to date)
	PF03498 	   b.42.2.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF03498 SSF50370 0.866 (average over 96 mutual instances, PF03498 97 appearances, SSF50370 1691 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 452 ) 6685473_PF01947_PF04482            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04482  is 6392012 with Jaccard = 0.9474	|PF04482|=18 [ 18	1	1100192	0 ]
 parent [ 6392012 ] : 6685473 0.12963 (=49/(21*18)) 89.3686
  given [ 6392012 ] : 6392012 1 (=38/(2*19)) 0.00283734
	best keyword for cluster 6392012 is PF04482 with Jaccard = 0.9474	[ 18	1	1100192	0 ]	0.9474 1.0000
sibling [ 6392012 ] : 6643837 0.222222 (=16/(12*6)) 78.1124
	best keyword for cluster 6643837 is PF01947 with Jaccard = 0.7500	[ 12	0	1100195	4 ]	1.0000 0.7500

	SUGGESTING RELATEDNESS OF:   
	A> PF04482 ( PF04482   Protein of unknown function (DUF564) )
	B> PF01947 ( PF01947   Protein of unknown function DUF98 )

	Only B has a clan ( CL0122.6 ).

	the two keywords coincide on Uniref90 proteins: |PF01947| = 16 ,	 |PF04482| = 18 , 	|PF01947^PF04482| = 1	( 6.2% and 5.6% )

	Neither  PF04482 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 453 ) 6732698_PF06937_PF07763            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07763  is 6710207 with Jaccard = 0.9474	|PF07763|=19 [ 18	0	1100192	1 ]
 parent [ 6710207 ] : 6732698 0.0350877 (=16/(12*38)) 97.0156
  given [ 6710207 ] : 6710207 0.0727273 (=12/(33*5)) 93.9706
	best keyword for cluster 6710207 is PF07763 with Jaccard = 0.9474	[ 18	0	1100192	1 ]	1.0000 0.9474
sibling [ 6710207 ] : 6706399 0.0857143 (=3/(7*5)) 93.3657
	best keyword for cluster 6706399 is PF06937 with Jaccard = 1.0000	[ 7	0	1100204	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07763 ( PF07763   FEZ-like protein )
	B> PF06937 ( PF06937   EURL protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07763 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 454 ) 6709457_PF02525_PF03358            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02525  is 6697772 with Jaccard = 0.9470	|PF02525|=395 [ 375	1	1099815	20 ]
 parent [ 6697772 ] : 6709457 0.0851348 (=14733/(417*415)) 93.8555
  given [ 6697772 ] : 6697772 0.0871671 (=72/(2*413)) 91.8466
	best keyword for cluster 6697772 is PF02525 with Jaccard = 0.9470	[ 375	1	1099815	20 ]	0.9973 0.9494
sibling [ 6697772 ] : 6621507 0.33311 (=8455/(74*343)) 71.0028
	best keyword for cluster 6621507 is PF03358 with Jaccard = 0.6543	[ 371	0	1099644	196 ]	1.0000 0.6543

	SUGGESTING RELATEDNESS OF:   
	A> PF02525 ( PF02525   Flavodoxin-like fold )
	B> PF03358 ( PF03358   NADPH-dependent FMN reductase )

	they come from the same clan: CL0042.7 : PF00258	PF02525	PF07972	PF03358
	the two keywords do not coincide on UniRef90 proteins

	both PF02525 and PF03358 have PDB structures
	PF02525 	   c.23.5.3
	PF03358 	   c.23.5.4  c.23.5.6

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 455 ) 6757287_PF00293_PF06381            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00293  is 6755581 with Jaccard = 0.9465	|PF00293|=2744 [ 2602	5	1097462	142 ]
 parent [ 6755581 ] : 6757287 0.00979154 (=1628/(54*3079)) 99.1089
  given [ 6755581 ] : 6755581 0.0150833 (=1743/(38*3041)) 99.0025
	best keyword for cluster 6755581 is PF00293 with Jaccard = 0.9465	[ 2602	5	1097462	142 ]	0.9981 0.9483
sibling [ 6755581 ] : 6751497 0.0204082 (=5/(5*49)) 98.733
	best keyword for cluster 6751497 is PF06381 with Jaccard = 0.7500	[ 6	2	1100203	0 ]	0.7500 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00293 ( PF00293   NUDIX domain )
	B> PF06381 ( PF06381   Protein of unknown function (DUF1073) )

	Only A has a clan ( CL0261.2 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF00293 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00293 SSF55811 0.818 (average over 8148 mutual instances, PF00293 8350 appearances, SSF55811 10363 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 456 ) 6715444_PF00005_PF06792            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06792  is 6532908 with Jaccard = 0.9459	|PF06792|=37 [ 35	0	1100174	2 ]
 parent [ 6532908 ] : 6715444 0.0524778 (=39020/(37*20096)) 94.7958
  given [ 6532908 ] : 6532908 0.777778 (=28/(1*36)) 27.5943
	best keyword for cluster 6532908 is PF06792 with Jaccard = 0.9459	[ 35	0	1100174	2 ]	1.0000 0.9459
sibling [ 6532908 ] : 6713034 0.0696297 (=5596/(4*20092)) 94.4116
	best keyword for cluster 6713034 is PF00005 with Jaccard = 0.9910	[ 18190	107	1081855	59 ]	0.9942 0.9968

	SUGGESTING RELATEDNESS OF:   
	A> PF06792 ( PF06792   Uncharacterised protein family (UPF0261) )
	B> PF00005 ( PF00005   ABC transporter )

	Only B has a clan ( CL0023.26 ).

	the two keywords coincide on Uniref90 proteins: |PF00005| = 18249 ,	 |PF06792| = 37 , 	|PF00005^PF06792| = 2	( 0.0% and 5.4% )

	only  PF06792 has a PDB structure (may not be up to date)
		PF00005 	   c.37.1.12  j.35.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 457 ) 6683032_PF00326_PF07676            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00326  is 6658985 with Jaccard = 0.9455	|PF00326|=696 [ 676	19	1099496	20 ]
 parent [ 6658985 ] : 6683032 0.127578 (=40103/(780*403)) 88.9462
  given [ 6658985 ] : 6658985 0.179587 (=834/(774*6)) 83.0057
	best keyword for cluster 6658985 is PF00326 with Jaccard = 0.9455	[ 676	19	1099496	20 ]	0.9727 0.9713
sibling [ 6658985 ] : 6679084 0.148423 (=640/(11*392)) 87.9687
	best keyword for cluster 6679084 is PF07676 with Jaccard = 0.6368	[ 249	20	1099820	122 ]	0.9257 0.6712

	SUGGESTING RELATEDNESS OF:   
	A> PF00326 ( PF00326   Prolyl oligopeptidase family )
	B> PF07676 ( PF07676   WD40-like Beta Propeller Repeat )

	A and B come from a different clan ( CL0028.14 , CL0186.8 ).

	the two keywords coincide on Uniref90 proteins: |PF00326| = 696 ,	 |PF07676| = 371 , 	|PF00326^PF07676| = 60	( 8.6% and 16.2% )

	both PF00326 and PF07676 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 458 ) 6751762_PF01564_PF02675            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01564  is 6745131 with Jaccard = 0.9449	|PF01564|=263 [ 257	9	1099939	6 ]
 parent [ 6745131 ] : 6751762 0.0126661 (=734/(122*475)) 98.7514
  given [ 6745131 ] : 6745131 0.0194805 (=117/(13*462)) 98.2445
	best keyword for cluster 6745131 is PF01564 with Jaccard = 0.9449	[ 257	9	1099939	6 ]	0.9662 0.9772
sibling [ 6745131 ] : 6644380 0.279167 (=67/(2*120)) 78.2652
	best keyword for cluster 6644380 is PF02675 with Jaccard = 0.9652	[ 111	0	1100096	4 ]	1.0000 0.9652

	SUGGESTING RELATEDNESS OF:   
	A> PF01564 ( PF01564   Spermine/spermidine synthase )
	B> PF02675 ( PF02675   S-adenosylmethionine decarboxylase )

	Only A has a clan ( CL0102.14 ).

	the two keywords coincide on Uniref90 proteins: |PF01564| = 263 ,	 |PF02675| = 115 , 	|PF01564^PF02675| = 4	( 1.5% and 3.5% )

	both PF01564 and PF02675 have PDB structures
		PF02675 	   d.156.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF02675 SSF56276 0.924 (average over 324 mutual instances, PF02675 324 appearances, SSF56276 602 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 459 ) 6538599_PF02471_PF06780            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02471  is 5969585 with Jaccard = 0.9444	|PF02471|=18 [ 17	0	1100193	1 ]
 parent [ 5969585 ] : 6538599 0.71267 (=315/(17*26)) 31.2588
  given [ 5969585 ] : 5969585 1 (=70/(10*7)) 5.15297e-37
	best keyword for cluster 5969585 is PF02471 with Jaccard = 0.9444	[ 17	0	1100193	1 ]	1.0000 0.9444
sibling [ 5969585 ] : 6523733 0.791667 (=133/(14*12)) 22.0105
	best keyword for cluster 6523733 is PF06780 with Jaccard = 1.0000	[ 14	0	1100197	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02471 ( PF02471   Borrelia outer surface protein E )
	B> PF06780 ( PF06780   Erp protein C-terminus )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02471 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 460 ) 6767234_PF05339_PF07865            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07865  is 6627778 with Jaccard = 0.9444	|PF07865|=17 [ 17	1	1100193	0 ]
 parent [ 6627778 ] : 6767234 0.00396825 (=2/(21*24)) 99.6064
  given [ 6627778 ] : 6627778 0.288889 (=26/(15*6)) 73.651
	best keyword for cluster 6627778 is PF07865 with Jaccard = 0.9444	[ 17	1	1100193	0 ]	0.9444 1.0000
sibling [ 6627778 ] : 6749976 0.0234375 (=3/(16*8)) 98.6172
	best keyword for cluster 6749976 is PF05339 with Jaccard = 0.9000	[ 9	1	1100201	0 ]	0.9000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07865 ( PF07865   Protein of unknown function (DUF1652) )
	B> PF05339 ( PF05339   Protein of unknown function (DUF739) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07865 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 461 ) 6725933_PF02876_PF03642            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02876  is 6686928 with Jaccard = 0.9429	|PF02876|=69 [ 66	1	1100141	3 ]
 parent [ 6686928 ] : 6725933 0.0483721 (=104/(86*25)) 96.225
  given [ 6686928 ] : 6686928 0.118902 (=39/(82*4)) 89.6319
	best keyword for cluster 6686928 is PF02876 with Jaccard = 0.9429	[ 66	1	1100141	3 ]	0.9851 0.9565
sibling [ 6686928 ] : 6713745 0.0701754 (=8/(6*19)) 94.514
	best keyword for cluster 6713745 is PF03642 with Jaccard = 0.8571	[ 6	1	1100204	0 ]	0.8571 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02876 ( PF02876   Staphylococcal/Streptococcal toxin, beta-grasp domain )
	B> PF03642 ( PF03642   MAP domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF02876 and PF03642 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 462 ) 6718967_PF00957_PF08366            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00957  is 6714576 with Jaccard = 0.9419	|PF00957|=257 [ 243	1	1099953	14 ]
 parent [ 6714576 ] : 6718967 0.0633997 (=1308/(69*299)) 95.2742
  given [ 6714576 ] : 6714576 0.0604027 (=18/(1*298)) 94.6645
	best keyword for cluster 6714576 is PF00957 with Jaccard = 0.9419	[ 243	1	1099953	14 ]	0.9959 0.9455
sibling [ 6714576 ] : 6666944 0.166667 (=90/(60*9)) 84.7003
	best keyword for cluster 6666944 is PF08366 with Jaccard = 0.6047	[ 26	17	1100168	0 ]	0.6047 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00957 ( PF00957   Synaptobrevin )
	B> PF08366 ( PF08366   LLGL2 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00957| = 257 ,	 |PF08366| = 26 , 	|PF00957^PF08366| = 4	( 1.6% and 15.4% )

	only  PF00957 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 463 ) 6771350_PF01231_PF01648            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01648  is 6742879 with Jaccard = 0.9413	|PF01648|=493 [ 465	1	1099717	28 ]
 parent [ 6742879 ] : 6771350 0.00261378 (=136/(542*96)) 99.7551
  given [ 6742879 ] : 6742879 0.0265441 (=823/(65*477)) 98.0414
	best keyword for cluster 6742879 is PF01648 with Jaccard = 0.9413	[ 465	1	1099717	28 ]	0.9979 0.9432
sibling [ 6742879 ] : 6768552 0.00460526 (=7/(20*76)) 99.6595
	best keyword for cluster 6768552 is PF01231 with Jaccard = 1.0000	[ 40	0	1100171	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01648 ( PF01648   4'-phosphopantetheinyl transferase superfamily )
	B> PF01231 ( PF01231   Indoleamine 2,3-dioxygenase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01231| = 40 ,	 |PF01648| = 493 , 	|PF01231^PF01648| = 1	( 2.5% and 0.2% )

	both PF01648 and PF01231 have PDB structures
	PF01648 	   d.150.1.2
	PF01231 	   a.266.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF01648 SSF56214 0.575 (average over 1410 mutual instances, PF01648 1579 appearances, SSF56214 1581 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 464 ) 6687377_PF04410_PF05492            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05492  is 6515157 with Jaccard = 0.9412	|PF05492|=34 [ 32	0	1100177	2 ]
 parent [ 6515157 ] : 6687377 0.144828 (=294/(35*58)) 89.7339
  given [ 6515157 ] : 6515157 0.833333 (=55/(2*33)) 17.8184
	best keyword for cluster 6515157 is PF05492 with Jaccard = 0.9412	[ 32	0	1100177	2 ]	1.0000 0.9412
sibling [ 6515157 ] : 6678470 0.169697 (=28/(3*55)) 87.825
	best keyword for cluster 6678470 is PF04410 with Jaccard = 0.9487	[ 37	0	1100172	2 ]	1.0000 0.9487

	SUGGESTING RELATEDNESS OF:   
	A> PF05492 ( PF05492   NAF1 domain )
	B> PF04410 ( PF04410   Gar1 protein RNA binding region )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05492 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 465 ) 6713707_PF02274_PF04455            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02274  is 6686623 with Jaccard = 0.9409	|PF02274|=186 [ 175	0	1100025	11 ]
 parent [ 6686623 ] : 6713707 0.0582915 (=406/(35*199)) 94.5032
  given [ 6686623 ] : 6686623 0.129416 (=674/(31*168)) 89.577
	best keyword for cluster 6686623 is PF02274 with Jaccard = 0.9409	[ 175	0	1100025	11 ]	1.0000 0.9409
sibling [ 6686623 ] : 6647182 0.260684 (=61/(9*26)) 79.0054
	best keyword for cluster 6647182 is PF04455 with Jaccard = 0.8000	[ 20	0	1100186	5 ]	1.0000 0.8000

	SUGGESTING RELATEDNESS OF:   
	A> PF02274 ( PF02274   Amidinotransferase )
	B> PF04455 ( PF04455   LOR/SDH bifunctional enzyme conserved region )

	Only A has a clan ( CL0197.5 ).

	the two keywords coincide on Uniref90 proteins: |PF02274| = 186 ,	 |PF04455| = 25 , 	|PF02274^PF04455| = 6	( 3.2% and 24.0% )

	only  PF02274 has a PDB structure (may not be up to date)
	PF02274 	   d.126.1.2  d.126.1.3  d.126.1.4
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 466 ) 6756471_PF02606_PF03966            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03966  is 6670234 with Jaccard = 0.9409	|PF03966|=186 [ 175	0	1100025	11 ]
 parent [ 6670234 ] : 6756471 0.00945526 (=310/(194*169)) 99.0591
  given [ 6670234 ] : 6670234 0.174542 (=1363/(137*57)) 85.5941
	best keyword for cluster 6670234 is PF03966 with Jaccard = 0.9409	[ 175	0	1100025	11 ]	1.0000 0.9409
sibling [ 6670234 ] : 6714182 0.0633947 (=62/(6*163)) 94.5945
	best keyword for cluster 6714182 is PF02606 with Jaccard = 0.9790	[ 140	0	1100068	3 ]	1.0000 0.9790

	SUGGESTING RELATEDNESS OF:   
	A> PF03966 ( PF03966   Trm112p-like protein )
	B> PF02606 ( PF02606   Tetraacyldisaccharide-1-P 4'-kinase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02606| = 143 ,	 |PF03966| = 186 , 	|PF02606^PF03966| = 2	( 1.4% and 1.1% )

	Neither  PF03966 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 467 ) 6722022_PF01975_PF03133            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01975  is 6651096 with Jaccard = 0.9406	|PF01975|=219 [ 206	0	1099992	13 ]
 parent [ 6651096 ] : 6722022 0.0432618 (=2772/(233*275)) 95.7095
  given [ 6651096 ] : 6651096 0.227536 (=157/(230*3)) 80.3596
	best keyword for cluster 6651096 is PF01975 with Jaccard = 0.9406	[ 206	0	1099992	13 ]	1.0000 0.9406
sibling [ 6651096 ] : 6716160 0.0748489 (=644/(36*239)) 94.9095
	best keyword for cluster 6716160 is PF03133 with Jaccard = 0.9865	[ 219	2	1099989	1 ]	0.9910 0.9955

	SUGGESTING RELATEDNESS OF:   
	A> PF01975 ( PF01975   Survival protein SurE )
	B> PF03133 ( PF03133   Tubulin-tyrosine ligase family )

	Only B has a clan ( CL0179.8 ).

	the two keywords coincide on Uniref90 proteins: |PF01975| = 219 ,	 |PF03133| = 220 , 	|PF01975^PF03133| = 13	( 5.9% and 5.9% )

	only  PF01975 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01975 SSF64167 0.747 (average over 681 mutual instances, PF01975 682 appearances, SSF64167 709 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 468 ) 6749295_PF02334_PF03551            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03551  is 6736329 with Jaccard = 0.9404	|PF03551|=519 [ 489	1	1099691	30 ]
 parent [ 6736329 ] : 6749295 0.0179924 (=114/(11*576)) 98.568
  given [ 6736329 ] : 6736329 0.0315789 (=108/(6*570)) 97.41
	best keyword for cluster 6736329 is PF03551 with Jaccard = 0.9404	[ 489	1	1099691	30 ]	0.9980 0.9422
sibling [ 6736329 ] : 6730232 0.0333333 (=1/(5*6)) 96.75
	best keyword for cluster 6730232 is PF02334 with Jaccard = 0.7500	[ 3	1	1100207	0 ]	0.7500 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03551 ( PF03551   Transcriptional regulator PadR-like family )
	B> PF02334 ( PF02334   Replication terminator protein )

	Only A has a clan ( CL0123.12 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF03551 and PF02334 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 469 ) 6747098_PF02469_PF02676            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02469  is 6740345 with Jaccard = 0.9403	|PF02469|=266 [ 252	2	1099943	14 ]
 parent [ 6740345 ] : 6747098 0.0166176 (=226/(40*340)) 98.3971
  given [ 6740345 ] : 6740345 0.0308834 (=179/(18*322)) 97.81
	best keyword for cluster 6740345 is PF02469 with Jaccard = 0.9403	[ 252	2	1099943	14 ]	0.9921 0.9474
sibling [ 6740345 ] : 6516567 0.854396 (=311/(14*26)) 18.4009
	best keyword for cluster 6516567 is PF02676 with Jaccard = 0.8605	[ 37	0	1100168	6 ]	1.0000 0.8605

	SUGGESTING RELATEDNESS OF:   
	A> PF02469 ( PF02469   Fasciclin domain )
	B> PF02676 ( PF02676   TYW3 like )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02469| = 266 ,	 |PF02676| = 43 , 	|PF02469^PF02676| = 1	( 0.4% and 2.3% )

	both PF02469 and PF02676 have PDB structures
	PF02469 	   b.118.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF02676 SSF111278 0.843 (average over 90 mutual instances, PF02676 99 appearances, SSF111278 112 appearances)
     2	PF02469 SSF82153 0.813 (average over 790 mutual instances, PF02469 818 appearances, SSF82153 907 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 470 ) 6744540_PF00866_PF02982            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00866  is 6661533 with Jaccard = 0.9387	|PF00866|=154 [ 153	9	1100048	1 ]
 parent [ 6661533 ] : 6744540 0.0254237 (=819/(182*177)) 98.1969
  given [ 6661533 ] : 6661533 0.201317 (=1345/(131*51)) 83.6086
	best keyword for cluster 6661533 is PF00866 with Jaccard = 0.9387	[ 153	9	1100048	1 ]	0.9444 0.9935
sibling [ 6661533 ] : 6731451 0.0372093 (=32/(5*172)) 96.8798
	best keyword for cluster 6731451 is PF02982 with Jaccard = 0.6250	[ 10	5	1100195	1 ]	0.6667 0.9091

	SUGGESTING RELATEDNESS OF:   
	A> PF00866 ( PF00866   Ring hydroxylating beta subunit )
	B> PF02982 ( PF02982   Scytalone dehydratase )

	they come from the same clan: CL0051.9 : PF02982	PF00866	PF02136	PF05223	PF07858	PF07080	PF08332	PF07366
	the two keywords do not coincide on UniRef90 proteins

	both PF00866 and PF02982 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 471 ) 6747076_PF01782_PF04139            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01782  is 6656469 with Jaccard = 0.9381	|PF01782|=213 [ 212	13	1099985	1 ]
 parent [ 6656469 ] : 6747076 0.0168115 (=209/(259*48)) 98.3957
  given [ 6656469 ] : 6656469 0.197709 (=397/(251*8)) 82.1879
	best keyword for cluster 6656469 is PF01782 with Jaccard = 0.9381	[ 212	13	1099985	1 ]	0.9422 0.9953
sibling [ 6656469 ] : 6675885 0.149826 (=43/(7*41)) 87.1489
	best keyword for cluster 6675885 is PF04139 with Jaccard = 0.9667	[ 29	1	1100181	0 ]	0.9667 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01782 ( PF01782   RimM N-terminal domain )
	B> PF04139 ( PF04139   Rad9 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01782 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 472 ) 6628465_PF01647_PF06641            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01647  is 6436642 with Jaccard = 0.9375	|PF01647|=16 [ 15	0	1100195	1 ]
 parent [ 6436642 ] : 6628465 0.274306 (=79/(16*18)) 73.9709
  given [ 6436642 ] : 6436642 1 (=55/(11*5)) 0.504113
	best keyword for cluster 6436642 is PF01647 with Jaccard = 0.9375	[ 15	0	1100195	1 ]	1.0000 0.9375
sibling [ 6436642 ] : 6595671 0.402597 (=31/(11*7)) 59.8519
	best keyword for cluster 6595671 is PF06641 with Jaccard = 0.7143	[ 10	4	1100197	0 ]	0.7143 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01647 ( PF01647   Morbillivirus RNA polymerase alpha subunit )
	B> PF06641 ( PF06641   Paramyxovirus structural protein V )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01647| = 16 ,	 |PF06641| = 10 , 	|PF01647^PF06641| = 1	( 6.2% and 10.0% )

	Neither  PF01647 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 473 ) 6639142_PF05040_PF06765            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05040  is 6473928 with Jaccard = 0.9375	|PF05040|=16 [ 15	0	1100195	1 ]
 parent [ 6473928 ] : 6639142 0.293333 (=198/(15*45)) 76.7733
  given [ 6473928 ] : 6473928 0.96 (=48/(10*5)) 4.42584
	best keyword for cluster 6473928 is PF05040 with Jaccard = 0.9375	[ 15	0	1100195	1 ]	1.0000 0.9375
sibling [ 6473928 ] : 6570610 0.536 (=268/(20*25)) 51.1411
	best keyword for cluster 6570610 is PF06765 with Jaccard = 0.9091	[ 20	2	1100189	0 ]	0.9091 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05040 ( PF05040   Heparan sulfate 2-O-sulfotransferase (HS2ST) )
	B> PF06765 ( PF06765   Heparan sulfate 6-sulfotransferase (HS6ST) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05040 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 474 ) 6735547_PF04041_PF04616            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04616  is 6705404 with Jaccard = 0.9360	|PF04616|=292 [ 278	5	1099914	14 ]
 parent [ 6705404 ] : 6735547 0.0348639 (=1230/(336*105)) 97.33
  given [ 6705404 ] : 6705404 0.0782828 (=155/(6*330)) 93.1751
	best keyword for cluster 6705404 is PF04616 with Jaccard = 0.9360	[ 278	5	1099914	14 ]	0.9823 0.9521
sibling [ 6705404 ] : 6725529 0.0387205 (=23/(6*99)) 96.1743
	best keyword for cluster 6725529 is PF04041 with Jaccard = 0.9104	[ 61	6	1100144	0 ]	0.9104 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04616 ( PF04616   Glycosyl hydrolases family 43 )
	B> PF04041 ( PF04041   Domain of unknown function (DUF377) )

	they come from the same clan: CL0143.8 : PF03664	PF04616	PF00251	PF04041	PF02435
	the two keywords do not coincide on UniRef90 proteins

	both PF04616 and PF04041 have PDB structures
	PF04616 	   b.67.2.1
	PF04041 	   b.67.2.4

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 475 ) 6740120_PF00892_PF05653            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00892  is 6738900 with Jaccard = 0.9359	|PF00892|=2414 [ 2308	52	1097745	106 ]
 parent [ 6738900 ] : 6740120 0.030438 (=15310/(179*2810)) 97.7876
  given [ 6738900 ] : 6738900 0.0287992 (=726/(9*2801)) 97.6725
	best keyword for cluster 6738900 is PF00892 with Jaccard = 0.9359	[ 2308	52	1097745	106 ]	0.9780 0.9561
sibling [ 6738900 ] : 6711424 0.0645472 (=67/(6*173)) 94.1592
	best keyword for cluster 6711424 is PF05653 with Jaccard = 0.8814	[ 104	14	1100093	0 ]	0.8814 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00892 ( PF00892   Integral membrane protein DUF6 )
	B> PF05653 ( PF05653   Protein of unknown function (DUF803) )

	they come from the same clan: CL0184.5 : PF07857	PF04342	PF00892	PF05653	PF06027	PF00893	PF04142	PF06379	PF06800	PF03151	PF08449	PF02694
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF00892 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05653 SSF103473 0.757 (average over 2 mutual instances, PF05653 3 appearances, SSF103473 39293 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 476 ) 6750215_PF00534_PF05693            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00534  is 6749863 with Jaccard = 0.9342	|PF00534|=3857 [ 3692	95	1096259	165 ]
 parent [ 6749863 ] : 6750215 0.0195931 (=5703/(64*4548)) 98.6336
  given [ 6749863 ] : 6749863 0.0171615 (=4317/(56*4492)) 98.6096
	best keyword for cluster 6749863 is PF00534 with Jaccard = 0.9342	[ 3692	95	1096259	165 ]	0.9749 0.9572
sibling [ 6749863 ] : 6739900 0.0258621 (=9/(6*58)) 97.7667
	best keyword for cluster 6739900 is PF05693 with Jaccard = 0.9123	[ 52	3	1100154	2 ]	0.9455 0.9630

	SUGGESTING RELATEDNESS OF:   
	A> PF00534 ( PF00534   Glycosyl transferases group 1 )
	B> PF05693 ( PF05693   Glycogen synthase )

	they come from the same clan: CL0113.8 : PF06925	PF02684	PF04464	PF04101	PF01075	PF03033	PF00982	PF00534	PF05693	PF02350	PF04007	PF06722	PF05159	PF08660	PF00343	PF00201

	the two keywords coincide on Uniref90 proteins: |PF00534| = 3857 ,	 |PF05693| = 54 , 	|PF00534^PF05693| = 1	( 0.0% and 1.9% )

	only  PF00534 has a PDB structure (may not be up to date)
	PF00534 	   c.87.1.8
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 477 ) 6705032_PF03025_PF05776            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03025  is 6624819 with Jaccard = 0.9333	|PF03025|=15 [ 14	0	1100196	1 ]
 parent [ 6624819 ] : 6705032 0.0928571 (=13/(14*10)) 93.1071
  given [ 6624819 ] : 6624819 0.307692 (=4/(1*13)) 72.4616
	best keyword for cluster 6624819 is PF03025 with Jaccard = 0.9333	[ 14	0	1100196	1 ]	1.0000 0.9333
sibling [ 6624819 ] : 6695546 0.111111 (=1/(1*9)) 91.4445
	best keyword for cluster 6695546 is PF05776 with Jaccard = 1.0000	[ 6	0	1100205	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03025 ( PF03025   Papillomavirus E5 )
	B> PF05776 ( PF05776   Papillomavirus E5A protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03025 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 478 ) 6620928_PF06807_PF08160            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08160  is 6549379 with Jaccard = 0.9333	|PF08160|=14 [ 14	1	1100196	0 ]
 parent [ 6549379 ] : 6620928 0.317692 (=826/(52*50)) 70.9227
  given [ 6549379 ] : 6549379 0.655856 (=364/(15*37)) 38.3883
	best keyword for cluster 6549379 is PF08160 with Jaccard = 0.9333	[ 14	1	1100196	0 ]	0.9333 1.0000
sibling [ 6549379 ] : 6603400 0.397163 (=56/(3*47)) 63.3679
	best keyword for cluster 6603400 is PF06807 with Jaccard = 0.8857	[ 31	0	1100176	4 ]	1.0000 0.8857

	SUGGESTING RELATEDNESS OF:   
	A> PF08160 ( PF08160   NUC156 domain )
	B> PF06807 ( PF06807   Pre-mRNA cleavage complex II protein Clp1 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF06807| = 35 ,	 |PF08160| = 14 , 	|PF06807^PF08160| = 1	( 2.9% and 7.1% )

	Neither  PF08160 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 479 ) 6470551_PF00693_PF08465            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08465  is 6072180 with Jaccard = 0.9333	|PF08465|=14 [ 14	1	1100196	0 ]
 parent [ 6072180 ] : 6470551 0.966667 (=406/(15*28)) 3.86678
  given [ 6072180 ] : 6072180 1 (=14/(1*14)) 7.43001e-28
	best keyword for cluster 6072180 is PF08465 with Jaccard = 0.9333	[ 14	1	1100196	0 ]	0.9333 1.0000
sibling [ 6072180 ] : 6308051 1 (=115/(23*5)) 5.21742e-09
	best keyword for cluster 6308051 is PF00693 with Jaccard = 0.6512	[ 28	0	1100168	15 ]	1.0000 0.6512

	SUGGESTING RELATEDNESS OF:   
	A> PF08465 ( PF08465   Thymidine kinase from Herpesvirus C-terminal )
	B> PF00693 ( PF00693   Thymidine kinase from herpesvirus )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00693| = 43 ,	 |PF08465| = 14 , 	|PF00693^PF08465| = 7	( 16.3% and 50.0% )

	only  PF08465 has a PDB structure (may not be up to date)
		PF00693 	   c.37.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 480 ) 6509578_PF02491_PF06723            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02491  is 6470596 with Jaccard = 0.9330	|PF02491|=194 [ 181	0	1100017	13 ]
 parent [ 6470596 ] : 6509578 0.869734 (=41081/(209*226)) 15.108
  given [ 6470596 ] : 6470596 0.963933 (=3314/(18*191)) 3.88218
	best keyword for cluster 6470596 is PF02491 with Jaccard = 0.9330	[ 181	0	1100017	13 ]	1.0000 0.9330
sibling [ 6470596 ] : 6502948 0.899064 (=4035/(22*204)) 12.5566
	best keyword for cluster 6502948 is PF06723 with Jaccard = 0.9786	[ 183	4	1100024	0 ]	0.9786 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02491 ( PF02491   Cell division protein FtsA )
	B> PF06723 ( PF06723   MreB/Mbl protein )

	they come from the same clan: CL0108.10 : PF06406	PF00480	PF02541	PF00814	PF06723	PF05378	PF01968	PF00012	PF03727	PF00349	PF02685	PF01150	PF02491	PF00370	PF02782	PF02543	PF01869	PF00022	PF00871	PF03702
	the two keywords do not coincide on UniRef90 proteins

	both PF02491 and PF06723 have PDB structures
	PF02491 	   c.55.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 481 ) 6710627_PF01892_PF04608            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01892  is 6524055 with Jaccard = 0.9310	|PF01892|=29 [ 27	0	1100182	2 ]
 parent [ 6524055 ] : 6710627 0.0820747 (=413/(37*136)) 94.0248
  given [ 6524055 ] : 6524055 0.793706 (=227/(26*11)) 22.2785
	best keyword for cluster 6524055 is PF01892 with Jaccard = 0.9310	[ 27	0	1100182	2 ]	1.0000 0.9310
sibling [ 6524055 ] : 6594591 0.482422 (=494/(8*128)) 59.2157
	best keyword for cluster 6594591 is PF04608 with Jaccard = 1.0000	[ 113	0	1100098	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01892 (  )
	B> PF04608 ( PF04608   Phosphatidylglycerophosphatase A )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01892 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF04608 SSF101307 0.945 (average over 471 mutual instances, PF04608 471 appearances, SSF101307 476 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 482 ) 6690055_PF01039_PF06833            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06833  is 6349412 with Jaccard = 0.9310	|PF06833|=29 [ 27	0	1100182	2 ]
 parent [ 6349412 ] : 6690055 0.116854 (=2808/(27*890)) 90.2525
  given [ 6349412 ] : 6349412 1 (=182/(13*14)) 4.51987e-06
	best keyword for cluster 6349412 is PF06833 with Jaccard = 0.9310	[ 27	0	1100182	2 ]	1.0000 0.9310
sibling [ 6349412 ] : 6665646 0.164977 (=293/(2*888)) 84.4146
	best keyword for cluster 6665646 is PF01039 with Jaccard = 0.7153	[ 603	173	1099368	67 ]	0.7771 0.9000

	SUGGESTING RELATEDNESS OF:   
	A> PF06833 ( PF06833   Malonate decarboxylase gamma subunit (MdcE) )
	B> PF01039 ( PF01039   Carboxyl transferase domain )

	they come from the same clan: CL0127.6 : PF03255	PF01039	PF00574	PF01972	PF00378	PF06833	PF03572	PF01343
	the two keywords do not coincide on UniRef90 proteins

	only  PF06833 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 483 ) 6741784_PF05721_PF07350            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05721  is 6738761 with Jaccard = 0.9307	|PF05721|=272 [ 255	2	1099937	17 ]
 parent [ 6738761 ] : 6741784 0.026807 (=708/(77*343)) 97.9501
  given [ 6738761 ] : 6738761 0.0302721 (=267/(28*315)) 97.6616
	best keyword for cluster 6738761 is PF05721 with Jaccard = 0.9307	[ 255	2	1099937	17 ]	0.9922 0.9375
sibling [ 6738761 ] : 6713416 0.0729167 (=105/(32*45)) 94.4737
	best keyword for cluster 6713416 is PF07350 with Jaccard = 0.6744	[ 29	14	1100168	0 ]	0.6744 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05721 ( PF05721   Phytanoyl-CoA dioxygenase (PhyH) )
	B> PF07350 ( PF07350   Protein of unknown function (DUF1479) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF05721 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 484 ) 6742084_PF03288_PF05272            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03288  is 6734488 with Jaccard = 0.9301	|PF03288|=141 [ 133	2	1100068	8 ]
 parent [ 6734488 ] : 6742084 0.0221663 (=740/(156*214)) 97.9776
  given [ 6734488 ] : 6734488 0.0303658 (=44/(7*207)) 97.2229
	best keyword for cluster 6734488 is PF03288 with Jaccard = 0.9301	[ 133	2	1100068	8 ]	0.9852 0.9433
sibling [ 6734488 ] : 6725041 0.0424691 (=258/(81*75)) 96.1114
	best keyword for cluster 6725041 is PF05272 with Jaccard = 0.7826	[ 54	15	1100142	0 ]	0.7826 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03288 ( PF03288   Poxvirus D5 protein-like )
	B> PF05272 ( PF05272   Virulence-associated protein E )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF03288 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 485 ) 6704291_PF03734_PF06104            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03734  is 6652683 with Jaccard = 0.9301	|PF03734|=499 [ 466	2	1099710	33 ]
 parent [ 6652683 ] : 6704291 0.0912986 (=6115/(122*549)) 92.985
  given [ 6652683 ] : 6652683 0.231779 (=16530/(338*211)) 80.9584
	best keyword for cluster 6652683 is PF03734 with Jaccard = 0.9301	[ 466	2	1099710	33 ]	0.9957 0.9339
sibling [ 6652683 ] : 6623048 0.294956 (=269/(114*8)) 71.6842
	best keyword for cluster 6623048 is PF06104 with Jaccard = 0.6452	[ 40	22	1100149	0 ]	0.6452 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03734 ( PF03734   ErfK/YbiS/YcfS/YnhG )
	B> PF06104 ( PF06104   Bacterial protein of unknown function (DUF949) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03734| = 499 ,	 |PF06104| = 40 , 	|PF03734^PF06104| = 4	( 0.8% and 10.0% )

	only  PF03734 has a PDB structure (may not be up to date)
	PF03734 	   b.160.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 486 ) 6762972_PF02402_PF06809            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02402  is 6457547 with Jaccard = 0.9286	|PF02402|=14 [ 13	0	1100197	1 ]
 parent [ 6457547 ] : 6762972 0.00769231 (=3/(13*30)) 99.4154
  given [ 6457547 ] : 6457547 1 (=30/(3*10)) 2.02548
	best keyword for cluster 6457547 is PF02402 with Jaccard = 0.9286	[ 13	0	1100197	1 ]	1.0000 0.9286
sibling [ 6457547 ] : 6746646 0.0170455 (=3/(8*22)) 98.3614
	best keyword for cluster 6746646 is PF06809 with Jaccard = 0.8333	[ 10	0	1100199	2 ]	1.0000 0.8333

	SUGGESTING RELATEDNESS OF:   
	A> PF02402 ( PF02402   Lysis protein )
	B> PF06809 ( PF06809   Neural proliferation differentiation control-1 protein (NPDC1) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02402 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 487 ) 6604310_PF04352_PF05286            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04352  is 6558791 with Jaccard = 0.9286	|PF04352|=42 [ 39	0	1100169	3 ]
 parent [ 6558791 ] : 6604310 0.382979 (=126/(7*47)) 63.7324
  given [ 6558791 ] : 6558791 0.581395 (=100/(4*43)) 45.8572
	best keyword for cluster 6558791 is PF04352 with Jaccard = 0.9286	[ 39	0	1100169	3 ]	1.0000 0.9286
sibling [ 6558791 ] : 6490635 0.916667 (=11/(4*3)) 8.42992
	best keyword for cluster 6490635 is PF05286 with Jaccard = 0.6667	[ 4	2	1100205	0 ]	0.6667 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04352 ( PF04352   ProQ activator of osmoprotectant transporter ProP )
	B> PF05286 ( PF05286   Fertility inhibition protein (FINO) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04352 has a PDB structure (may not be up to date)
		PF05286 	   a.136.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF04352 SSF48657 0.708 (average over 183 mutual instances, PF04352 183 appearances, SSF48657 240 appearances)
     2	PF05286 SSF48657 0.922 (average over 46 mutual instances, PF05286 46 appearances, SSF48657 240 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 488 ) 6753644_PF07610_PF07955            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07955  is 6717867 with Jaccard = 0.9286	|PF07955|=14 [ 13	0	1100197	1 ]
 parent [ 6717867 ] : 6753644 0.0123494 (=41/(20*166)) 98.88
  given [ 6717867 ] : 6717867 0.0505051 (=5/(9*11)) 95.1319
	best keyword for cluster 6717867 is PF07955 with Jaccard = 0.9286	[ 13	0	1100197	1 ]	1.0000 0.9286
sibling [ 6717867 ] : 6745563 0.023913 (=132/(46*120)) 98.2777
	best keyword for cluster 6745563 is PF07610 with Jaccard = 0.7931	[ 23	6	1100182	0 ]	0.7931 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07955 ( PF07955   Protein of unknown function (DUF1687) )
	B> PF07610 ( PF07610   Protein of unknown function (DUF1573) )

	Only A has a clan ( CL0172.11 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF07955 has a PDB structure (may not be up to date)
	PF07955 	   c.47.1.18
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 489 ) 6588362_PF01412_PF08518            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08518  is 6495466 with Jaccard = 0.9286	|PF08518|=26 [ 26	2	1100183	0 ]
 parent [ 6495466 ] : 6588362 0.440382 (=4986/(34*333)) 56.5453
  given [ 6495466 ] : 6495466 0.920415 (=266/(17*17)) 9.91523
	best keyword for cluster 6495466 is PF08518 with Jaccard = 0.9286	[ 26	2	1100183	0 ]	0.9286 1.0000
sibling [ 6495466 ] : 6563040 0.510574 (=338/(2*331)) 49.5698
	best keyword for cluster 6563040 is PF01412 with Jaccard = 0.8867	[ 313	2	1099858	38 ]	0.9937 0.8917

	SUGGESTING RELATEDNESS OF:   
	A> PF08518 ( PF08518   Spa2 homology domain (SHD) of GIT )
	B> PF01412 ( PF01412   Putative GTPase activating protein for Arf )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01412| = 351 ,	 |PF08518| = 26 , 	|PF01412^PF08518| = 13	( 3.7% and 50.0% )

	only  PF08518 has a PDB structure (may not be up to date)
		PF01412 	   g.45.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF01412 SSF57863 0.943 (average over 744 mutual instances, PF01412 1021 appearances, SSF57863 1344 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 490 ) 6721128_PF05067_PF05974            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05974  is 6665821 with Jaccard = 0.9275	|PF05974|=69 [ 64	0	1100142	5 ]
 parent [ 6665821 ] : 6721128 0.0445054 (=292/(81*81)) 95.5809
  given [ 6665821 ] : 6665821 0.183761 (=43/(78*3)) 84.4548
	best keyword for cluster 6665821 is PF05974 with Jaccard = 0.9275	[ 64	0	1100142	5 ]	1.0000 0.9275
sibling [ 6665821 ] : 6677704 0.125 (=10/(1*80)) 87.6362
	best keyword for cluster 6677704 is PF05067 with Jaccard = 0.9870	[ 76	0	1100134	1 ]	1.0000 0.9870

	SUGGESTING RELATEDNESS OF:   
	A> PF05974 ( PF05974   Protein of unknown function (DUF892) )
	B> PF05067 ( PF05067   Manganese containing catalase )

	Only B has a clan ( CL0044.8 ).

	the two keywords coincide on Uniref90 proteins: |PF05067| = 77 ,	 |PF05974| = 69 , 	|PF05067^PF05974| = 4	( 5.2% and 5.8% )

	only  PF05974 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05974 SSF47240 0.882 (average over 92 mutual instances, PF05974 92 appearances, SSF47240 6970 appearances)
     2	PF05067 SSF47240 0.922 (average over 207 mutual instances, PF05067 207 appearances, SSF47240 6970 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 491 ) 6759074_PF00711_PF08131            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00711  is 6740379 with Jaccard = 0.9272	|PF00711|=146 [ 140	5	1100060	6 ]
 parent [ 6740379 ] : 6759074 0.0118758 (=44/(15*247)) 99.2137
  given [ 6740379 ] : 6740379 0.0303713 (=445/(148*99)) 97.8153
	best keyword for cluster 6740379 is PF00711 with Jaccard = 0.9272	[ 140	5	1100060	6 ]	0.9655 0.9589
sibling [ 6740379 ] : 6731564 0.0535714 (=3/(8*7)) 96.8929
	best keyword for cluster 6731564 is PF08131 with Jaccard = 0.8333	[ 5	0	1100205	1 ]	1.0000 0.8333

	SUGGESTING RELATEDNESS OF:   
	A> PF00711 ( PF00711   Beta defensin )
	B> PF08131 ( PF08131   Defensin-like peptide family )

	they come from the same clan: CL0075.8 : PF00711	PF08131	PF00323	PF07936	PF00706

	the two keywords coincide on Uniref90 proteins: |PF00711| = 146 ,	 |PF08131| = 6 , 	|PF00711^PF08131| = 1	( 0.7% and 16.7% )

	both PF00711 and PF08131 have PDB structures
	PF00711 	   g.9.1.1
	PF08131 	   g.9.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 492 ) 6627242_PF01676_PF08342            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08342  is 6497516 with Jaccard = 0.9271	|PF08342|=89 [ 89	7	1100115	0 ]
 parent [ 6497516 ] : 6627242 0.302146 (=9039/(108*277)) 73.4601
  given [ 6497516 ] : 6497516 0.897196 (=96/(1*107)) 10.6754
	best keyword for cluster 6497516 is PF08342 with Jaccard = 0.9271	[ 89	7	1100115	0 ]	0.9271 1.0000
sibling [ 6497516 ] : 6599411 0.443325 (=6602/(73*204)) 61.3576
	best keyword for cluster 6599411 is PF01676 with Jaccard = 0.6966	[ 248	6	1099855	102 ]	0.9764 0.7086

	SUGGESTING RELATEDNESS OF:   
	A> PF08342 ( PF08342   Phosphopentomutase N-terminal )
	B> PF01676 ( PF01676   Metalloenzyme superfamily )

	Only B has a clan ( CL0088.10 ).

	the two keywords coincide on Uniref90 proteins: |PF01676| = 350 ,	 |PF08342| = 89 , 	|PF01676^PF08342| = 86	( 24.6% and 96.6% )

	only  PF08342 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 493 ) 6766583_PF03364_PF08327            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08327  is 6745057 with Jaccard = 0.9269	|PF08327|=259 [ 241	1	1099951	18 ]
 parent [ 6745057 ] : 6766583 0.00644022 (=1897/(365*807)) 99.5812
  given [ 6745057 ] : 6745057 0.022882 (=531/(82*283)) 98.239
	best keyword for cluster 6745057 is PF08327 with Jaccard = 0.9269	[ 241	1	1099951	18 ]	0.9959 0.9305
sibling [ 6745057 ] : 6761345 0.00995196 (=667/(94*713)) 99.3363
	best keyword for cluster 6761345 is PF03364 with Jaccard = 0.7143	[ 290	90	1099805	26 ]	0.7632 0.9177

	SUGGESTING RELATEDNESS OF:   
	A> PF08327 ( PF08327   Activator of Hsp90 ATPase homolog 1-like protein )
	B> PF03364 ( PF03364   Polyketide cyclase / dehydrase and lipid transport )

	they come from the same clan: CL0209.4 : PF08327	PF00407	PF06240	PF02121	PF03364	PF00848	PF01852
	the two keywords do not coincide on UniRef90 proteins

	both PF08327 and PF03364 have PDB structures
	PF08327 	   d.129.3.5
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 494 ) 6727985_PF04586_PF05065            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05065  is 6704757 with Jaccard = 0.9268	|PF05065|=116 [ 114	7	1100088	2 ]
 parent [ 6704757 ] : 6727985 0.0363515 (=709/(106*184)) 96.4814
  given [ 6704757 ] : 6704757 0.0793951 (=294/(23*161)) 93.0433
	best keyword for cluster 6704757 is PF05065 with Jaccard = 0.9268	[ 114	7	1100088	2 ]	0.9421 0.9828
sibling [ 6704757 ] : 6697872 0.0926385 (=112/(93*13)) 91.8641
	best keyword for cluster 6697872 is PF04586 with Jaccard = 0.8142	[ 92	0	1100098	21 ]	1.0000 0.8142

	SUGGESTING RELATEDNESS OF:   
	A> PF05065 ( PF05065   Phage capsid family )
	B> PF04586 ( PF04586   Caudovirus prohead protease )

	Only B has a clan ( CL0201.5 ).

	the two keywords coincide on Uniref90 proteins: |PF04586| = 113 ,	 |PF05065| = 116 , 	|PF04586^PF05065| = 5	( 4.4% and 4.3% )

	Neither  PF05065 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF04586 SSF50789 0.760 (average over 36 mutual instances, PF04586 36 appearances, SSF50789 125 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 495 ) 6732074_PF01190_PF03251            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01190  is 6638470 with Jaccard = 0.9265	|PF01190|=67 [ 63	1	1100143	4 ]
 parent [ 6638470 ] : 6732074 0.0354651 (=61/(86*20)) 96.9513
  given [ 6638470 ] : 6638470 0.258889 (=466/(36*50)) 76.6064
	best keyword for cluster 6638470 is PF01190 with Jaccard = 0.9265	[ 63	1	1100143	4 ]	0.9844 0.9403
sibling [ 6638470 ] : 6704236 0.0879121 (=8/(13*7)) 92.9642
	best keyword for cluster 6704236 is PF03251 with Jaccard = 1.0000	[ 13	0	1100198	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01190 ( PF01190   Pollen proteins Ole e I family )
	B> PF03251 ( PF03251   Tymovirus 45/70Kd protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01190 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 496 ) 6617800_PF02626_PF02682            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02626  is 6427134 with Jaccard = 0.9239	|PF02626|=184 [ 170	0	1100027	14 ]
 parent [ 6427134 ] : 6617800 0.312951 (=8813/(189*149)) 69.5383
  given [ 6427134 ] : 6427134 0.998913 (=919/(184*5)) 0.215921
	best keyword for cluster 6427134 is PF02626 with Jaccard = 0.9239	[ 170	0	1100027	14 ]	1.0000 0.9239
sibling [ 6427134 ] : 6463866 0.974101 (=1354/(10*139)) 2.80598
	best keyword for cluster 6463866 is PF02682 with Jaccard = 0.7500	[ 132	0	1100035	44 ]	1.0000 0.7500

	SUGGESTING RELATEDNESS OF:   
	A> PF02626 ( PF02626   Allophanate hydrolase subunit 2 )
	B> PF02682 ( PF02682   Allophanate hydrolase subunit 1 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02626| = 184 ,	 |PF02682| = 176 , 	|PF02626^PF02682| = 57	( 31.0% and 32.4% )

	Neither  PF02626 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 497 ) 6744013_PF00650_PF04707            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00650  is 6709401 with Jaccard = 0.9231	|PF00650|=403 [ 384	13	1099795	19 ]
 parent [ 6709401 ] : 6744013 0.0190589 (=678/(77*462)) 98.147
  given [ 6709401 ] : 6709401 0.0777032 (=521/(15*447)) 93.8452
	best keyword for cluster 6709401 is PF00650 with Jaccard = 0.9231	[ 384	13	1099795	19 ]	0.9673 0.9529
sibling [ 6709401 ] : 6687641 0.116667 (=42/(5*72)) 89.7697
	best keyword for cluster 6687641 is PF04707 with Jaccard = 0.8118	[ 69	0	1100126	16 ]	1.0000 0.8118

	SUGGESTING RELATEDNESS OF:   
	A> PF00650 ( PF00650   CRAL/TRIO domain )
	B> PF04707 ( PF04707   PRELI-like family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00650| = 403 ,	 |PF04707| = 85 , 	|PF00650^PF04707| = 11	( 2.7% and 12.9% )

	only  PF00650 has a PDB structure (may not be up to date)
	PF00650 	   c.13.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00650 SSF52087 0.769 (average over 887 mutual instances, PF00650 1638 appearances, SSF52087 1745 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 498 ) 6723643_PF01210_PF02558            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02558  is 6656580 with Jaccard = 0.9231	|PF02558|=321 [ 300	4	1099886	21 ]
 parent [ 6656580 ] : 6723643 0.0517331 (=7358/(330*431)) 95.9455
  given [ 6656580 ] : 6656580 0.198171 (=130/(2*328)) 82.2279
	best keyword for cluster 6656580 is PF02558 with Jaccard = 0.9231	[ 300	4	1099886	21 ]	0.9868 0.9346
sibling [ 6656580 ] : 6691574 0.114798 (=2591/(370*61)) 90.5913
	best keyword for cluster 6691574 is PF01210 with Jaccard = 0.8753	[ 358	41	1099802	10 ]	0.8972 0.9728

	SUGGESTING RELATEDNESS OF:   
	A> PF02558 ( PF02558   Ketopantoate reductase PanE/ApbA )
	B> PF01210 ( PF01210   NAD-dependent glycerol-3-phosphate dehydrogenase N-terminus )

	they come from the same clan: CL0063.17 : PF03721	PF04820	PF02254	PF00899	PF01946	PF02882	PF01488	PF01118	PF08491	PF03435	PF04321	PF07992	PF00070	PF02719	PF02153	PF02423	PF05368	PF01210	PF07994	PF07993	PF03447	PF03446	PF01225	PF06039	PF01232	PF03949	PF05834	PF00056	PF08659	PF07991	PF03486	PF00044	PF00732	PF01134	PF01408	PF00996	PF00479	PF00743	PF01494	PF00890	PF03807	PF01370	PF00208	PF02670	PF01113	PF01266	PF02629	PF02558	PF01593	PF01262	PF00670	PF00107	PF00106	PF02737	PF01073	PF02826
	the two keywords do not coincide on UniRef90 proteins

	both PF02558 and PF01210 have PDB structures
	PF02558 	   c.2.1.6
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 499 ) 6747247_PF02566_PF02624            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02566  is 6733389 with Jaccard = 0.9231	|PF02566|=494 [ 456	0	1099717	38 ]
 parent [ 6733389 ] : 6747247 0.0177635 (=1152/(523*124)) 98.4085
  given [ 6733389 ] : 6733389 0.0351606 (=127/(516*7)) 97.0983
	best keyword for cluster 6733389 is PF02566 with Jaccard = 0.9231	[ 456	0	1099717	38 ]	1.0000 0.9231
sibling [ 6733389 ] : 6734835 0.0286499 (=73/(26*98)) 97.2563
	best keyword for cluster 6734835 is PF02624 with Jaccard = 0.7632	[ 87	22	1100097	5 ]	0.7982 0.9457

	SUGGESTING RELATEDNESS OF:   
	A> PF02566 ( PF02566   OsmC-like protein )
	B> PF02624 ( PF02624   YcaO-like family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02566| = 494 ,	 |PF02624| = 92 , 	|PF02566^PF02624| = 10	( 2.0% and 10.9% )

	only  PF02566 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF02566 SSF82784 0.853 (average over 1637 mutual instances, PF02566 1638 appearances, SSF82784 1704 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 500 ) 6578610_PF02087_PF03973            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03973  is 6516787 with Jaccard = 0.9231	|PF03973|=26 [ 24	0	1100185	2 ]
 parent [ 6516787 ] : 6578610 0.515152 (=136/(11*24)) 53.3059
  given [ 6516787 ] : 6516787 0.828125 (=106/(16*8)) 18.5923
	best keyword for cluster 6516787 is PF03973 with Jaccard = 0.9231	[ 24	0	1100185	2 ]	1.0000 0.9231
sibling [ 6516787 ] : 6497715 0.9 (=27/(5*6)) 10.8127
	best keyword for cluster 6497715 is PF02087 with Jaccard = 0.9000	[ 9	1	1100201	0 ]	0.9000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03973 ( PF03973   Triabin )
	B> PF02087 ( PF02087   Nitrophorin )

	they come from the same clan: CL0116.7 : PF03973	PF02087	PF08212	PF00061	PF02098	PF07137
	the two keywords do not coincide on UniRef90 proteins

	both PF03973 and PF02087 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF02087 SSF50814 0.793 (average over 15 mutual instances, PF02087 15 appearances, SSF50814 7354 appearances)
     2	PF03973 SSF50814 0.807 (average over 117 mutual instances, PF03973 117 appearances, SSF50814 7354 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 501 ) 6720727_PF03879_PF08524            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08524  is 6666603 with Jaccard = 0.9231	|PF08524|=13 [ 12	0	1100198	1 ]
 parent [ 6666603 ] : 6720727 0.0612083 (=154/(37*68)) 95.5088
  given [ 6666603 ] : 6666603 0.190476 (=40/(7*30)) 84.6291
	best keyword for cluster 6666603 is PF08524 with Jaccard = 0.9231	[ 12	0	1100198	1 ]	1.0000 0.9231
sibling [ 6666603 ] : 6715736 0.0727273 (=52/(55*13)) 94.8341
	best keyword for cluster 6715736 is PF03879 with Jaccard = 0.9444	[ 17	1	1100193	0 ]	0.9444 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08524 ( PF08524   rRNA processing )
	B> PF03879 ( PF03879   Cgr1 family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF08524 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 502 ) 6675685_PF00676_PF02780            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00676  is 6669739 with Jaccard = 0.9228	|PF00676|=653 [ 610	8	1099550	43 ]
 parent [ 6669739 ] : 6675685 0.146716 (=143089/(674*1447)) 87.0398
  given [ 6669739 ] : 6669739 0.153005 (=308/(3*671)) 85.4565
	best keyword for cluster 6669739 is PF00676 with Jaccard = 0.9228	[ 610	8	1099550	43 ]	0.9871 0.9342
sibling [ 6669739 ] : 6672641 0.152835 (=221/(1*1446)) 86.2285
	best keyword for cluster 6672641 is PF02780 with Jaccard = 0.7813	[ 1036	234	1098885	56 ]	0.8157 0.9487

	SUGGESTING RELATEDNESS OF:   
	A> PF00676 ( PF00676   Dehydrogenase E1 component )
	B> PF02780 ( PF02780   Transketolase, C-terminal domain )

	Only A has a clan ( CL0254.3 ).

	the two keywords coincide on Uniref90 proteins: |PF00676| = 653 ,	 |PF02780| = 1092 , 	|PF00676^PF02780| = 45	( 6.9% and 4.1% )

	both PF00676 and PF02780 have PDB structures
	PF00676 	   c.36.1.11
	
	SUPERFAM mapping significantly overlapping:
     1	PF02780 SSF52922 0.877 (average over 3553 mutual instances, PF02780 3663 appearances, SSF52922 11092 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 503 ) 6666407_PF00109_PF02803            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00108  is 6654688 with Jaccard = 0.9224	|PF00108|=920 [ 915	72	1099219	5 ]
 parent [ 6654688 ] : 6666407 0.19718 (=674108/(2735*1250)) 84.5803
  given [ 6654688 ] : 6654688 0.217774 (=272/(1*1249)) 81.6154
	best keyword for cluster 6654688 is PF02803 with Jaccard = 0.9323	[ 923	64	1099221	3 ]	0.9352 0.9968
sibling [ 6654688 ] : 6646279 0.21558 (=2355/(4*2731)) 78.8238
	best keyword for cluster 6646279 is PF00109 with Jaccard = 0.7916	[ 2009	469	1097673	60 ]	0.8107 0.9710

	SUGGESTING RELATEDNESS OF:   
	A> PF02803 ( PF02803   Thiolase, C-terminal domain )
	B> PF00109 ( PF00109   Beta-ketoacyl synthase, N-terminal domain )

	they come from the same clan: CL0046.10 : PF02803	PF02801	PF00109	PF01154	PF08392	PF00195	PF02797	PF08545	PF08541	PF00108

	the two keywords coincide on Uniref90 proteins: |PF00109| = 2069 ,	 |PF02803| = 926 , 	|PF00109^PF02803| = 8	( 0.4% and 0.9% )

	both PF02803 and PF00109 have PDB structures
		PF00109 	   c.95.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF02803 SSF53901 0.956 (average over 3202 mutual instances, PF02803 3237 appearances, SSF53901 32336 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 504 ) 6720290_PF00834_PF02749            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01729  is 6526941 with Jaccard = 0.9223	|PF01729|=294 [ 273	2	1099915	21 ]
 parent [ 6526941 ] : 6720290 0.0666635 (=8551/(299*429)) 95.4559
  given [ 6526941 ] : 6526941 0.768456 (=229/(1*298)) 24.094
	best keyword for cluster 6526941 is PF02749 with Jaccard = 0.9375	[ 270	5	1099923	13 ]	0.9818 0.9541
sibling [ 6526941 ] : 6647292 0.251808 (=6615/(355*74)) 79.1065
	best keyword for cluster 6647292 is PF00834 with Jaccard = 0.8157	[ 323	71	1099815	2 ]	0.8198 0.9938

	SUGGESTING RELATEDNESS OF:   
	A> PF02749 ( PF02749   Quinolinate phosphoribosyl transferase, N-terminal domain )
	B> PF00834 ( PF00834   Ribulose-phosphate 3 epimerase family )

	Only B has a clan ( CL0036.17 ).

	the two keywords coincide on Uniref90 proteins: |PF00834| = 325 ,	 |PF02749| = 283 , 	|PF00834^PF02749| = 1	( 0.3% and 0.4% )

	both PF02749 and PF00834 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00834 SSF51366 0.922 (average over 1075 mutual instances, PF00834 1075 appearances, SSF51366 8168 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 505 ) 6707441_PF05985_PF06751            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05985  is 6430145 with Jaccard = 0.9216	|PF05985|=51 [ 47	0	1100160	4 ]
 parent [ 6430145 ] : 6707441 0.0646401 (=185/(53*54)) 93.5371
  given [ 6430145 ] : 6430145 1 (=52/(1*52)) 0.288961
	best keyword for cluster 6430145 is PF05985 with Jaccard = 0.9216	[ 47	0	1100160	4 ]	1.0000 0.9216
sibling [ 6430145 ] : 6622681 0.384615 (=40/(2*52)) 71.6019
	best keyword for cluster 6622681 is PF06751 with Jaccard = 1.0000	[ 48	0	1100163	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05985 ( PF05985   Ethanolamine ammonia-lyase light chain (EutC) )
	B> PF06751 ( PF06751   Ethanolamine ammonia lyase large subunit (EutB) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF05985| = 51 ,	 |PF06751| = 48 , 	|PF05985^PF06751| = 4	( 7.8% and 8.3% )

	Neither  PF05985 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 506 ) 6772684_PF02687_PF05341            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02687  is 6754914 with Jaccard = 0.9207	|PF02687|=1752 [ 1613	0	1098459	139 ]
 parent [ 6754914 ] : 6772684 0.00222387 (=309/(1957*71)) 99.7956
  given [ 6754914 ] : 6754914 0.0148301 (=688/(24*1933)) 98.9635
	best keyword for cluster 6754914 is PF02687 with Jaccard = 0.9207	[ 1613	0	1098459	139 ]	1.0000 0.9207
sibling [ 6754914 ] : 6770973 0.0142857 (=1/(1*70)) 99.7429
	best keyword for cluster 6770973 is PF05341 with Jaccard = 0.9583	[ 23	1	1100187	0 ]	0.9583 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02687 ( PF02687   Predicted permease )
	B> PF05341 ( PF05341   Protein of unknown function (DUF708) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02687 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 507 ) 6690984_PF00228_PF04592            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00228  is 6608811 with Jaccard = 0.9206	|PF00228|=63 [ 58	0	1100148	5 ]
 parent [ 6608811 ] : 6690984 0.154971 (=106/(9*76)) 90.4818
  given [ 6608811 ] : 6608811 0.397516 (=192/(7*69)) 66.4337
	best keyword for cluster 6608811 is PF00228 with Jaccard = 0.9206	[ 58	0	1100148	5 ]	1.0000 0.9206
sibling [ 6608811 ] : 6423749 1 (=8/(1*8)) 0.154538
	best keyword for cluster 6423749 is PF04592 with Jaccard = 1.0000	[ 8	0	1100203	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00228 ( PF00228   Bowman-Birk serine protease inhibitor family )
	B> PF04592 ( PF04592   Selenoprotein P, N terminal region )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00228 has a PDB structure (may not be up to date)
	PF00228 	   g.3.13.1  j.38.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 508 ) 6740710_PF05277_PF05990            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05990  is 6579232 with Jaccard = 0.9206	|PF05990|=63 [ 58	0	1100148	5 ]
 parent [ 6579232 ] : 6740710 0.0262857 (=138/(70*75)) 97.848
  given [ 6579232 ] : 6579232 0.508772 (=493/(19*51)) 53.4905
	best keyword for cluster 6579232 is PF05990 with Jaccard = 0.9206	[ 58	0	1100148	5 ]	1.0000 0.9206
sibling [ 6579232 ] : 6729790 0.0540541 (=4/(1*74)) 96.7027
	best keyword for cluster 6729790 is PF05277 with Jaccard = 1.0000	[ 52	0	1100159	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05990 ( PF05990   Alpha/beta hydrolase of unknown function (DUF900) )
	B> PF05277 ( PF05277   Protein of unknown function (DUF726) )

	Only A has a clan ( CL0028.14 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05990 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 509 ) 6745274_PF02457_PF07949            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07949  is 6726164 with Jaccard = 0.9206	|PF07949|=63 [ 58	0	1100148	5 ]
 parent [ 6726164 ] : 6745274 0.0184584 (=307/(84*198)) 98.2536
  given [ 6726164 ] : 6726164 0.042735 (=20/(6*78)) 96.2522
	best keyword for cluster 6726164 is PF07949 with Jaccard = 0.9206	[ 58	0	1100148	5 ]	1.0000 0.9206
sibling [ 6726164 ] : 6730413 0.0435835 (=162/(21*177)) 96.7648
	best keyword for cluster 6730413 is PF02457 with Jaccard = 1.0000	[ 154	0	1100057	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07949 ( PF07949   YbbR-like protein )
	B> PF02457 ( PF02457   Domain of unknown function DUF147 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02457| = 154 ,	 |PF07949| = 63 , 	|PF02457^PF07949| = 4	( 2.6% and 6.3% )

	only  PF07949 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 510 ) 6776861_PF05160_PF07235            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05160  is 6768345 with Jaccard = 0.9200	|PF05160|=25 [ 23	0	1100186	2 ]
 parent [ 6768345 ] : 6776861 0.00170036 (=9/(67*79)) 99.8986
  given [ 6768345 ] : 6768345 0.0035014 (=5/(51*28)) 99.6521
	best keyword for cluster 6768345 is PF05160 with Jaccard = 0.9200	[ 23	0	1100186	2 ]	1.0000 0.9200
sibling [ 6768345 ] : 6751579 0.0192982 (=11/(57*10)) 98.7395
	best keyword for cluster 6751579 is PF07235 with Jaccard = 1.0000	[ 27	0	1100184	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05160 ( PF05160   DSS1/SEM1 family )
	B> PF07235 ( PF07235   Protein of unknown function (DUF1427) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF05160 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 511 ) 6756286_PF02957_PF08197            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08197  is 6495001 with Jaccard = 0.9200	|PF08197|=25 [ 23	0	1100186	2 ]
 parent [ 6495001 ] : 6756286 0.00952381 (=46/(23*210)) 99.049
  given [ 6495001 ] : 6495001 0.954545 (=21/(1*22)) 9.78709
	best keyword for cluster 6495001 is PF08197 with Jaccard = 0.9200	[ 23	0	1100186	2 ]	1.0000 0.9200
sibling [ 6495001 ] : 6753250 0.0138889 (=48/(192*18)) 98.8547
	best keyword for cluster 6753250 is PF02957 with Jaccard = 0.9353	[ 159	9	1100041	2 ]	0.9464 0.9876

	SUGGESTING RELATEDNESS OF:   
	A> PF08197 ( PF08197   pORF2a truncated protein )
	B> PF02957 ( PF02957   TT viral ORF2 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02957| = 161 ,	 |PF08197| = 25 , 	|PF02957^PF08197| = 2	( 1.2% and 8.0% )

	Neither  PF08197 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 512 ) 6713103_PF00535_PF04464            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04464  is 6699175 with Jaccard = 0.9189	|PF04464|=148 [ 136	0	1100063	12 ]
 parent [ 6699175 ] : 6713103 0.0587994 (=41373/(170*4139)) 94.424
  given [ 6699175 ] : 6699175 0.099375 (=159/(10*160)) 92.0341
	best keyword for cluster 6699175 is PF04464 with Jaccard = 0.9189	[ 136	0	1100063	12 ]	1.0000 0.9189
sibling [ 6699175 ] : 6711399 0.0686322 (=5091/(18*4121)) 94.1537
	best keyword for cluster 6711399 is PF00535 with Jaccard = 0.8802	[ 3496	139	1096239	337 ]	0.9618 0.9121

	SUGGESTING RELATEDNESS OF:   
	A> PF04464 ( PF04464   CDP-Glycerol:Poly(glycerophosphate) glycerophosphotransferase )
	B> PF00535 ( PF00535   Glycosyl transferase family 2 )

	A and B come from a different clan ( CL0113.8 , CL0110.6 ).

	the two keywords coincide on Uniref90 proteins: |PF00535| = 3833 ,	 |PF04464| = 148 , 	|PF00535^PF04464| = 31	( 0.8% and 20.9% )

	only  PF04464 has a PDB structure (may not be up to date)
		PF00535 	   c.68.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 513 ) 6496462_PF00195_PF08392            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08392  is 6450070 with Jaccard = 0.9189	|PF08392|=71 [ 68	3	1100137	3 ]
 parent [ 6450070 ] : 6496462 0.915768 (=15286/(78*214)) 10.0544
  given [ 6450070 ] : 6450070 0.987013 (=76/(1*77)) 1.30931
	best keyword for cluster 6450070 is PF08392 with Jaccard = 0.9189	[ 68	3	1100137	3 ]	0.9577 0.9577
sibling [ 6450070 ] : 6483175 0.941141 (=1551/(8*206)) 6.43543
	best keyword for cluster 6483175 is PF00195 with Jaccard = 0.8858	[ 194	5	1099992	20 ]	0.9749 0.9065

	SUGGESTING RELATEDNESS OF:   
	A> PF08392 ( PF08392   FAE1/Type III polyketide synthase-like protein )
	B> PF00195 ( PF00195   Chalcone and stilbene synthases, N-terminal domain )

	they come from the same clan: CL0046.10 : PF02803	PF02801	PF00109	PF01154	PF08392	PF00195	PF02797	PF08545	PF08541	PF00108
	the two keywords do not coincide on UniRef90 proteins

	only  PF08392 has a PDB structure (may not be up to date)
		PF00195 	   c.95.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF00195 SSF53901 0.548 (average over 1376 mutual instances, PF00195 1385 appearances, SSF53901 32336 appearances)
     2	PF08392 SSF53901 0.710 (average over 216 mutual instances, PF08392 218 appearances, SSF53901 32336 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 514 ) 6713039_PF04235_PF07786            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04235  is 6616824 with Jaccard = 0.9176	|PF04235|=78 [ 78	7	1100126	0 ]
 parent [ 6616824 ] : 6713039 0.0715755 (=905/(109*116)) 94.4147
  given [ 6616824 ] : 6616824 0.345804 (=684/(23*86)) 69.1724
	best keyword for cluster 6616824 is PF04235 with Jaccard = 0.9176	[ 78	7	1100126	0 ]	0.9176 1.0000
sibling [ 6616824 ] : 6701367 0.0982786 (=314/(45*71)) 92.4182
	best keyword for cluster 6701367 is PF07786 with Jaccard = 0.9167	[ 33	2	1100175	1 ]	0.9429 0.9706

	SUGGESTING RELATEDNESS OF:   
	A> PF04235 ( PF04235   Protein of unknown function (DUF418) )
	B> PF07786 ( PF07786   Protein of unknown function (DUF1624) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04235 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 515 ) 6774979_PF04325_PF05268            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04325  is 6678665 with Jaccard = 0.9175	|PF04325|=97 [ 89	0	1100114	8 ]
 parent [ 6678665 ] : 6774979 0.00163934 (=13/(130*61)) 99.8553
  given [ 6678665 ] : 6678665 0.160862 (=679/(67*63)) 87.8355
	best keyword for cluster 6678665 is PF04325 with Jaccard = 0.9175	[ 89	0	1100114	8 ]	1.0000 0.9175
sibling [ 6678665 ] : 6772826 0.0166667 (=1/(1*60)) 99.8
	best keyword for cluster 6772826 is PF05268 with Jaccard = 0.6667	[ 12	6	1100193	0 ]	0.6667 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04325 ( PF04325   Protein of unknown function (DUF465) )
	B> PF05268 ( PF05268   Phage tail fibre adhesin Gp38 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04325 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 516 ) 6717220_PF08209_PF08313            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08313  is 6683156 with Jaccard = 0.9167	|PF08313|=36 [ 33	0	1100175	3 ]
 parent [ 6683156 ] : 6717220 0.0571429 (=88/(44*35)) 95.0528
  given [ 6683156 ] : 6683156 0.130081 (=16/(3*41)) 88.9781
	best keyword for cluster 6683156 is PF08313 with Jaccard = 0.9167	[ 33	0	1100175	3 ]	1.0000 0.9167
sibling [ 6683156 ] : 6711396 0.0588235 (=2/(1*34)) 94.1529
	best keyword for cluster 6711396 is PF08209 with Jaccard = 1.0000	[ 19	0	1100192	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08313 ( PF08313   SCA7 )
	B> PF08209 ( PF08209   Sgf11 (transcriptional regulation protein) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF08209| = 19 ,	 |PF08313| = 36 , 	|PF08209^PF08313| = 1	( 5.3% and 2.8% )

	Neither  PF08313 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 517 ) 6765258_PF04326_PF04703            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04326  is 6760897 with Jaccard = 0.9155	|PF04326|=207 [ 195	6	1099998	12 ]
 parent [ 6760897 ] : 6765258 0.00657058 (=62/(28*337)) 99.5231
  given [ 6760897 ] : 6760897 0.00890269 (=43/(15*322)) 99.3127
	best keyword for cluster 6760897 is PF04326 with Jaccard = 0.9155	[ 195	6	1099998	12 ]	0.9701 0.9420
sibling [ 6760897 ] : 6754104 0.0125 (=2/(8*20)) 98.9125
	best keyword for cluster 6754104 is PF04703 with Jaccard = 0.8889	[ 8	0	1100202	1 ]	1.0000 0.8889

	SUGGESTING RELATEDNESS OF:   
	A> PF04326 ( PF04326   Divergent AAA domain )
	B> PF04703 ( PF04703   FaeA-like protein )

	Only B has a clan ( CL0123.12 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04326 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 518 ) 6714871_PF01170_PF01555            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01555  is 6669111 with Jaccard = 0.9146	|PF01555|=532 [ 514	30	1099649	18 ]
 parent [ 6669111 ] : 6714871 0.0729302 (=11094/(626*243)) 94.7007
  given [ 6669111 ] : 6669111 0.180676 (=561/(5*621)) 85.3135
	best keyword for cluster 6669111 is PF01555 with Jaccard = 0.9146	[ 514	30	1099649	18 ]	0.9449 0.9662
sibling [ 6669111 ] : 6705075 0.0742678 (=71/(4*239)) 93.1176
	best keyword for cluster 6705075 is PF01170 with Jaccard = 0.6875	[ 198	5	1099923	85 ]	0.9754 0.6996

	SUGGESTING RELATEDNESS OF:   
	A> PF01555 ( PF01555   DNA methylase )
	B> PF01170 ( PF01170   Putative RNA methylase family UPF0020 )

	they come from the same clan: CL0102.14 : PF06962	PF00398	PF06325	PF03291	PF01135	PF01358	PF06460	PF01189	PF05401	PF01234	PF01555	PF02384	PF07942	PF05175	PF05063	PF07109	PF02475	PF07021	PF08003	PF05148	PF01795	PF02390	PF01596	PF00891	PF09445	PF08242	PF08241	PF05971	PF02086	PF02527	PF08704	PF01728	PF01269	PF07669	PF06080	PF05891	PF05430	PF04816	PF04672	PF04445	PF04378	PF01861	PF03269	PF03141	PF07757	PF07279	PF05219	PF08123	PF00145	PF03602	PF02353	PF01739	PF06859	PF09243	PF01564	PF03848	PF05724	PF02005	PF05958	PF01209	PF01170

	the two keywords coincide on Uniref90 proteins: |PF01170| = 283 ,	 |PF01555| = 532 , 	|PF01170^PF01555| = 1	( 0.4% and 0.2% )

	only  PF01555 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 519 ) 6751290_PF02207_PF02617            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02617  is 6559223 with Jaccard = 0.9137	|PF02617|=197 [ 180	0	1100014	17 ]
 parent [ 6559223 ] : 6751290 0.0161439 (=559/(199*174)) 98.718
  given [ 6559223 ] : 6559223 0.593056 (=4697/(144*55)) 46.0208
	best keyword for cluster 6559223 is PF02617 with Jaccard = 0.9137	[ 180	0	1100014	17 ]	1.0000 0.9137
sibling [ 6559223 ] : 6723009 0.0523725 (=383/(71*103)) 95.8572
	best keyword for cluster 6723009 is PF02207 with Jaccard = 0.8106	[ 107	9	1100079	16 ]	0.9224 0.8699

	SUGGESTING RELATEDNESS OF:   
	A> PF02617 ( PF02617   ATP-dependent Clp protease adaptor protein ClpS )
	B> PF02207 ( PF02207   Putative zinc finger in N-recognin (UBR box) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02207| = 123 ,	 |PF02617| = 197 , 	|PF02207^PF02617| = 15	( 12.2% and 7.6% )

	only  PF02617 has a PDB structure (may not be up to date)
	PF02617 	   d.45.1.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF02617 SSF54736 0.941 (average over 609 mutual instances, PF02617 612 appearances, SSF54736 1680 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 520 ) 6694609_PF00809_PF01288            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01288  is 6568011 with Jaccard = 0.9118	|PF01288|=306 [ 279	0	1099905	27 ]
 parent [ 6568011 ] : 6694609 0.0884057 (=10681/(313*386)) 91.2336
  given [ 6568011 ] : 6568011 0.531169 (=818/(5*308)) 50.5696
	best keyword for cluster 6568011 is PF01288 with Jaccard = 0.9118	[ 279	0	1099905	27 ]	1.0000 0.9118
sibling [ 6568011 ] : 6680245 0.137511 (=158/(3*383)) 88.2278
	best keyword for cluster 6680245 is PF00809 with Jaccard = 0.6455	[ 335	1	1099692	183 ]	0.9970 0.6467

	SUGGESTING RELATEDNESS OF:   
	A> PF01288 ( PF01288   7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase (HPPK) )
	B> PF00809 ( PF00809   Pterin binding enzyme )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00809| = 518 ,	 |PF01288| = 306 , 	|PF00809^PF01288| = 34	( 6.6% and 11.1% )

	both PF01288 and PF00809 have PDB structures
	PF01288 	   d.58.30.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00809 SSF51717 0.783 (average over 1837 mutual instances, PF00809 3831 appearances, SSF51717 4784 appearances)
     2	PF01288 SSF55083 0.821 (average over 930 mutual instances, PF01288 1027 appearances, SSF55083 1126 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 521 ) 6732270_PF01061_PF03379            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03379  is 6606312 with Jaccard = 0.9115	|PF03379|=110 [ 103	3	1100098	7 ]
 parent [ 6606312 ] : 6732270 0.0383941 (=7443/(122*1589)) 96.9763
  given [ 6606312 ] : 6606312 0.377679 (=423/(10*112)) 64.7614
	best keyword for cluster 6606312 is PF03379 with Jaccard = 0.9115	[ 103	3	1100098	7 ]	0.9717 0.9364
sibling [ 6606312 ] : 6703361 0.0853535 (=39843/(1200*389)) 92.7892
	best keyword for cluster 6703361 is PF01061 with Jaccard = 0.6168	[ 1109	8	1098413	681 ]	0.9928 0.6196

	SUGGESTING RELATEDNESS OF:   
	A> PF03379 ( PF03379   CcmB protein )
	B> PF01061 ( PF01061   ABC-2 type transporter )

	they come from the same clan: CL0181.5 : PF01061	PF03379	PF06182
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03379 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 522 ) 6762033_PF00432_PF03936            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03936  is 6756742 with Jaccard = 0.9103	|PF03936|=280 [ 274	21	1099910	6 ]
 parent [ 6756742 ] : 6762033 0.00836484 (=1593/(360*529)) 99.3721
  given [ 6756742 ] : 6756742 0.0121238 (=224/(62*298)) 99.0753
	best keyword for cluster 6756742 is PF03936 with Jaccard = 0.9103	[ 274	21	1099910	6 ]	0.9288 0.9786
sibling [ 6756742 ] : 6757010 0.0141383 (=137/(19*510)) 99.0916
	best keyword for cluster 6757010 is PF00432 with Jaccard = 0.9106	[ 336	14	1099842	19 ]	0.9600 0.9465

	SUGGESTING RELATEDNESS OF:   
	A> PF03936 ( PF03936   Terpene synthase family, metal binding domain )
	B> PF00432 ( PF00432   Prenyltransferase and squalene oxidase repeat )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00432| = 355 ,	 |PF03936| = 280 , 	|PF00432^PF03936| = 2	( 0.6% and 0.7% )

	both PF03936 and PF00432 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF03936 SSF48576 0.798 (average over 833 mutual instances, PF03936 1514 appearances, SSF48576 4885 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 523 ) 6754976_PF00160_PF00254            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00254  is 6723775 with Jaccard = 0.9093	|PF00254|=1037 [ 972	32	1099142	65 ]
 parent [ 6723775 ] : 6754976 0.010703 (=13835/(1125*1149)) 98.9684
  given [ 6723775 ] : 6723775 0.0520375 (=710/(12*1137)) 95.9682
	best keyword for cluster 6723775 is PF00254 with Jaccard = 0.9093	[ 972	32	1099142	65 ]	0.9681 0.9373
sibling [ 6723775 ] : 6722487 0.0440123 (=2139/(45*1080)) 95.7861
	best keyword for cluster 6722487 is PF00160 with Jaccard = 0.9289	[ 980	34	1099156	41 ]	0.9665 0.9598

	SUGGESTING RELATEDNESS OF:   
	A> PF00254 ( PF00254   FKBP-type peptidyl-prolyl cis-trans isomerase )
	B> PF00160 ( PF00160   Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00160| = 1021 ,	 |PF00254| = 1037 , 	|PF00160^PF00254| = 13	( 1.3% and 1.3% )

	both PF00254 and PF00160 have PDB structures
	PF00254 	   d.26.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00160 SSF50891 0.950 (average over 2953 mutual instances, PF00160 3038 appearances, SSF50891 3533 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 524 ) 6677258_PF02048_PF02058            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02048  is 6431160 with Jaccard = 0.9091	|PF02048|=11 [ 10	0	1100200	1 ]
 parent [ 6431160 ] : 6677258 0.235294 (=40/(10*17)) 87.5005
  given [ 6431160 ] : 6431160 1 (=24/(4*6)) 0.315756
	best keyword for cluster 6431160 is PF02048 with Jaccard = 0.9091	[ 10	0	1100200	1 ]	1.0000 0.9091
sibling [ 6431160 ] : 6562710 0.5625 (=9/(1*16)) 49.1244
	best keyword for cluster 6562710 is PF02058 with Jaccard = 1.0000	[ 16	0	1100195	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02048 ( PF02048   Heat-stable enterotoxin )
	B> PF02058 ( PF02058   Guanylin precursor )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF02048 and PF02058 have PDB structures
		PF02058 	   d.234.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF02058 SSF89890 0.800 (average over 26 mutual instances, PF02058 26 appearances, SSF89890 27 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 525 ) 6715034_PF03382_PF05215            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03382  is 6659162 with Jaccard = 0.9091	|PF03382|=97 [ 90	2	1100112	7 ]
 parent [ 6659162 ] : 6715034 0.0605546 (=428/(114*62)) 94.7298
  given [ 6659162 ] : 6659162 0.174383 (=113/(6*108)) 83.1401
	best keyword for cluster 6659162 is PF03382 with Jaccard = 0.9091	[ 90	2	1100112	7 ]	0.9783 0.9278
sibling [ 6659162 ] : 6708330 0.0859539 (=41/(9*53)) 93.6908
	best keyword for cluster 6708330 is PF05215 with Jaccard = 0.8333	[ 5	1	1100205	0 ]	0.8333 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03382 ( PF03382   Mycoplasma protein of unknown function, DUF285 )
	B> PF05215 ( PF05215   Spiralin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03382 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 526 ) 6728401_PF04208_PF04210            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04210  is 6427583 with Jaccard = 0.9091	|PF04210|=11 [ 10	0	1100200	1 ]
 parent [ 6427583 ] : 6728401 0.0354839 (=11/(10*31)) 96.5323
  given [ 6427583 ] : 6427583 1 (=9/(1*9)) 0.225379
	best keyword for cluster 6427583 is PF04210 with Jaccard = 0.9091	[ 10	0	1100200	1 ]	1.0000 0.9091
sibling [ 6427583 ] : 6689018 0.10101 (=20/(22*9)) 90.0609
	best keyword for cluster 6689018 is PF04208 with Jaccard = 1.0000	[ 21	0	1100190	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04210 ( PF04210   Tetrahydromethanopterin S-methyltransferase, subunit G )
	B> PF04208 ( PF04208   Tetrahydromethanopterin S-methyltransferase, subunit A )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04208| = 21 ,	 |PF04210| = 11 , 	|PF04208^PF04210| = 1	( 4.8% and 9.1% )

	Neither  PF04210 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 527 ) 6516428_PF05279_PF07169            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07169  is 6442429 with Jaccard = 0.9091	|PF07169|=11 [ 10	0	1100200	1 ]
 parent [ 6442429 ] : 6516428 0.836842 (=159/(10*19)) 18.2885
  given [ 6442429 ] : 6442429 1 (=9/(1*9)) 0.79231
	best keyword for cluster 6442429 is PF07169 with Jaccard = 0.9091	[ 10	0	1100200	1 ]	1.0000 0.9091
sibling [ 6442429 ] : 6499147 0.892857 (=75/(7*12)) 11.0598
	best keyword for cluster 6499147 is PF05279 with Jaccard = 0.8261	[ 19	0	1100188	4 ]	1.0000 0.8261

	SUGGESTING RELATEDNESS OF:   
	A> PF07169 ( PF07169   Triadin )
	B> PF05279 ( PF05279   Aspartyl beta-hydroxylase N-terminal region )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07169 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 528 ) 6656573_PF01625_PF01641            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01625  is 6609366 with Jaccard = 0.9089	|PF01625|=428 [ 389	0	1099783	39 ]
 parent [ 6609366 ] : 6656573 0.178414 (=26683/(431*347)) 82.2245
  given [ 6609366 ] : 6609366 0.416279 (=179/(1*430)) 66.6318
	best keyword for cluster 6609366 is PF01625 with Jaccard = 0.9089	[ 389	0	1099783	39 ]	1.0000 0.9089
sibling [ 6609366 ] : 6654707 0.229717 (=470/(6*341)) 81.6259
	best keyword for cluster 6654707 is PF01641 with Jaccard = 0.8918	[ 305	0	1099869	37 ]	1.0000 0.8918

	SUGGESTING RELATEDNESS OF:   
	A> PF01625 ( PF01625   Peptide methionine sulfoxide reductase )
	B> PF01641 ( PF01641   SelR domain )

	Only B has a clan ( CL0080.7 ).

	the two keywords coincide on Uniref90 proteins: |PF01625| = 428 ,	 |PF01641| = 342 , 	|PF01625^PF01641| = 69	( 16.1% and 20.2% )

	both PF01625 and PF01641 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF01641 SSF51316 0.909 (average over 1096 mutual instances, PF01641 1348 appearances, SSF51316 1572 appearances)
     2	PF01625 SSF55068 0.877 (average over 1339 mutual instances, PF01625 1591 appearances, SSF55068 1587 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 529 ) 6686617_PF00476_PF00752            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00752  is 6682097 with Jaccard = 0.9076	|PF00752|=218 [ 216	20	1099973	2 ]
 parent [ 6682097 ] : 6686617 0.12149 (=19501/(615*261)) 89.5745
  given [ 6682097 ] : 6682097 0.121622 (=63/(2*259)) 88.7149
	best keyword for cluster 6682097 is PF00752 with Jaccard = 0.9076	[ 216	20	1099973	2 ]	0.9153 0.9908
sibling [ 6682097 ] : 6665138 0.159041 (=292/(3*612)) 84.293
	best keyword for cluster 6665138 is PF00476 with Jaccard = 0.7633	[ 416	114	1099666	15 ]	0.7849 0.9652

	SUGGESTING RELATEDNESS OF:   
	A> PF00752 ( PF00752   XPG N-terminal domain )
	B> PF00476 ( PF00476   DNA polymerase family A )

	Only A has a clan ( CL0280.2 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF00752 and PF00476 have PDB structures
		PF00476 	   e.8.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 530 ) 6674654_PF01600_PF01601            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01600  is 6597009 with Jaccard = 0.9071	|PF01600|=183 [ 166	0	1100028	17 ]
 parent [ 6597009 ] : 6674654 0.141921 (=1061/(178*42)) 86.803
  given [ 6597009 ] : 6597009 0.397661 (=476/(7*171)) 60.2341
	best keyword for cluster 6597009 is PF01600 with Jaccard = 0.9071	[ 166	0	1100028	17 ]	1.0000 0.9071
sibling [ 6597009 ] : 6651052 0.203125 (=65/(32*10)) 80.3359
	best keyword for cluster 6651052 is PF01601 with Jaccard = 0.6667	[ 30	4	1100166	11 ]	0.8824 0.7317

	SUGGESTING RELATEDNESS OF:   
	A> PF01600 ( PF01600   Coronavirus S1 glycoprotein )
	B> PF01601 ( PF01601   Coronavirus S2 glycoprotein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01600| = 183 ,	 |PF01601| = 41 , 	|PF01600^PF01601| = 20	( 10.9% and 48.8% )

	only  PF01600 has a PDB structure (may not be up to date)
		PF01601 	   h.3.3.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 531 ) 6635535_PF03741_PF04332            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04332  is 6246423 with Jaccard = 0.9070	|PF04332|=43 [ 39	0	1100168	4 ]
 parent [ 6246423 ] : 6635535 0.29117 (=4620/(41*387)) 75.885
  given [ 6246423 ] : 6246423 1 (=78/(2*39)) 1.32111e-13
	best keyword for cluster 6246423 is PF04332 with Jaccard = 0.9070	[ 39	0	1100168	4 ]	1.0000 0.9070
sibling [ 6246423 ] : 6619220 0.316062 (=122/(1*386)) 70.0756
	best keyword for cluster 6619220 is PF03741 with Jaccard = 0.9971	[ 339	0	1099871	1 ]	1.0000 0.9971

	SUGGESTING RELATEDNESS OF:   
	A> PF04332 ( PF04332   Protein of unknown function (DUF475) )
	B> PF03741 ( PF03741   Integral membrane protein TerC family )

	Only A has a clan ( CL0015.13 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04332 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF04332 SSF103473 0.864 (average over 16 mutual instances, PF04332 17 appearances, SSF103473 39293 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 532 ) 6750299_PF01796_PF07431            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01796  is 6715575 with Jaccard = 0.9048	|PF01796|=189 [ 171	0	1100022	18 ]
 parent [ 6715575 ] : 6750299 0.0163317 (=52/(199*16)) 98.6405
  given [ 6715575 ] : 6715575 0.0608466 (=115/(189*10)) 94.8109
	best keyword for cluster 6715575 is PF01796 with Jaccard = 0.9048	[ 171	0	1100022	18 ]	1.0000 0.9048
sibling [ 6715575 ] : 6731553 0.047619 (=3/(9*7)) 96.8921
	best keyword for cluster 6731553 is PF07431 with Jaccard = 0.8750	[ 7	0	1100203	1 ]	1.0000 0.8750

	SUGGESTING RELATEDNESS OF:   
	A> PF01796 ( PF01796   Domain of unknown function DUF35 )
	B> PF07431 ( PF07431   Protein of unknown function (DUF1512) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01796 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 533 ) 6684095_PF00741_PF05121            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05121  is 6505331 with Jaccard = 0.9048	|PF05121|=21 [ 19	0	1100190	2 ]
 parent [ 6505331 ] : 6684095 0.113997 (=158/(21*66)) 89.1211
  given [ 6505331 ] : 6505331 0.9 (=18/(1*20)) 13.4524
	best keyword for cluster 6505331 is PF05121 with Jaccard = 0.9048	[ 19	0	1100190	2 ]	1.0000 0.9048
sibling [ 6505331 ] : 6632731 0.25 (=32/(2*64)) 75.3185
	best keyword for cluster 6632731 is PF00741 with Jaccard = 0.9206	[ 58	0	1100148	5 ]	1.0000 0.9206

	SUGGESTING RELATEDNESS OF:   
	A> PF05121 ( PF05121   Gas vesicle protein K )
	B> PF00741 ( PF00741   Gas vesicle protein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00741| = 63 ,	 |PF05121| = 21 , 	|PF00741^PF05121| = 5	( 7.9% and 23.8% )

	Neither  PF05121 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 534 ) 6751190_PF01074_PF03065            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07748  is 6728822 with Jaccard = 0.9045	|PF07748|=163 [ 161	15	1100033	2 ]
 parent [ 6728822 ] : 6751190 0.0169542 (=517/(158*193)) 98.7098
  given [ 6728822 ] : 6728822 0.0351064 (=33/(188*5)) 96.5871
	best keyword for cluster 6728822 is PF01074 with Jaccard = 0.9435	[ 167	9	1100034	1 ]	0.9489 0.9940
sibling [ 6728822 ] : 6720735 0.0449561 (=41/(6*152)) 95.5102
	best keyword for cluster 6720735 is PF03065 with Jaccard = 0.9500	[ 133	0	1100071	7 ]	1.0000 0.9500

	SUGGESTING RELATEDNESS OF:   
	A> PF01074 ( PF01074   Glycosyl hydrolases family 38 N-terminal domain )
	B> PF03065 ( PF03065   Glycosyl hydrolase family 57 )

	they come from the same clan: CL0158.6 : PF01074	PF03065	PF01522
	the two keywords do not coincide on UniRef90 proteins

	both PF01074 and PF03065 have PDB structures
	PF01074 	   c.6.2.1
	PF03065 	   c.6.2.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 535 ) 6750289_PF01427_PF02557            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02557  is 6723467 with Jaccard = 0.9036	|PF02557|=160 [ 150	6	1100045	10 ]
 parent [ 6723467 ] : 6750289 0.0186265 (=377/(88*230)) 98.6391
  given [ 6723467 ] : 6723467 0.0500424 (=649/(131*99)) 95.9206
	best keyword for cluster 6723467 is PF02557 with Jaccard = 0.9036	[ 150	6	1100045	10 ]	0.9615 0.9375
sibling [ 6723467 ] : 6602156 0.402299 (=35/(1*87)) 62.7467
	best keyword for cluster 6602156 is PF01427 with Jaccard = 0.9512	[ 78	2	1100129	2 ]	0.9750 0.9750

	SUGGESTING RELATEDNESS OF:   
	A> PF02557 ( PF02557   D-alanyl-D-alanine carboxypeptidase )
	B> PF01427 ( PF01427   D-ala-D-ala dipeptidase )

	they come from the same clan: CL0170.6 : PF01085	PF01427	PF05951	PF08291	PF03411	PF02557
	the two keywords do not coincide on UniRef90 proteins

	both PF02557 and PF01427 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF02557 SSF55166 0.819 (average over 234 mutual instances, PF02557 248 appearances, SSF55166 1247 appearances)
     2	PF01427 SSF55166 0.969 (average over 259 mutual instances, PF01427 262 appearances, SSF55166 1247 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 536 ) 6696267_PF06225_PF06227            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06225  is 6649280 with Jaccard = 0.9032	|PF06225|=28 [ 28	3	1100180	0 ]
 parent [ 6649280 ] : 6696267 0.101307 (=31/(34*9)) 91.6104
  given [ 6649280 ] : 6649280 0.223443 (=61/(13*21)) 79.8006
	best keyword for cluster 6649280 is PF06225 with Jaccard = 0.9032	[ 28	3	1100180	0 ]	0.9032 1.0000
sibling [ 6649280 ] : 6663638 0.25 (=2/(1*8)) 84
	best keyword for cluster 6663638 is PF06227 with Jaccard = 1.0000	[ 3	0	1100208	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06225 ( PF06225   Poxvirus A4/B15 family )
	B> PF06227 ( PF06227   Orthopoxvirus N1 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06225 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 537 ) 6729962_PF00457_PF01522            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01522  is 6718873 with Jaccard = 0.9029	|PF01522|=911 [ 828	6	1099294	83 ]
 parent [ 6718873 ] : 6729962 0.0338904 (=4490/(142*933)) 96.7181
  given [ 6718873 ] : 6718873 0.0529158 (=343/(7*926)) 95.2614
	best keyword for cluster 6718873 is PF01522 with Jaccard = 0.9029	[ 828	6	1099294	83 ]	0.9928 0.9089
sibling [ 6718873 ] : 6708529 0.0835979 (=79/(135*7)) 93.7245
	best keyword for cluster 6708529 is PF00457 with Jaccard = 0.9853	[ 134	1	1100075	1 ]	0.9926 0.9926

	SUGGESTING RELATEDNESS OF:   
	A> PF01522 ( PF01522   Polysaccharide deacetylase )
	B> PF00457 ( PF00457   Glycosyl hydrolases family 11 )

	A and B come from a different clan ( CL0158.6 , CL0004.14 ).

	the two keywords coincide on Uniref90 proteins: |PF00457| = 135 ,	 |PF01522| = 911 , 	|PF00457^PF01522| = 7	( 5.2% and 0.8% )

	both PF01522 and PF00457 have PDB structures
		PF00457 	   b.29.1.11

	SUPERFAM mapping significantly overlapping:
     1	PF00457 SSF49899 0.930 (average over 314 mutual instances, PF00457 406 appearances, SSF49899 14070 appearances)
     2	PF01522 SSF88713 0.586 (average over 2652 mutual instances, PF01522 2828 appearances, SSF88713 4598 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 538 ) 6605561_PF00441_PF01756            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01756  is 6602354 with Jaccard = 0.9028	|PF01756|=130 [ 130	14	1100067	0 ]
 parent [ 6602354 ] : 6605561 0.382416 (=125697/(2107*156)) 64.4188
  given [ 6602354 ] : 6602354 0.387778 (=349/(150*6)) 62.9528
	best keyword for cluster 6602354 is PF01756 with Jaccard = 0.9028	[ 130	14	1100067	0 ]	0.9028 1.0000
sibling [ 6602354 ] : 6581114 0.516144 (=1087/(1*2106)) 54.0987
	best keyword for cluster 6581114 is PF00441 with Jaccard = 0.9416	[ 1870	30	1098225	86 ]	0.9842 0.9560

	SUGGESTING RELATEDNESS OF:   
	A> PF01756 ( PF01756   Acyl-CoA oxidase )
	B> PF00441 ( PF00441   Acyl-CoA dehydrogenase, C-terminal domain )

	they come from the same clan: CL0087.7 : PF01756	PF00441	PF08028

	the two keywords coincide on Uniref90 proteins: |PF00441| = 1956 ,	 |PF01756| = 130 , 	|PF00441^PF01756| = 53	( 2.7% and 40.8% )

	both PF01756 and PF00441 have PDB structures
	PF01756 	   a.29.3.2
	PF00441 	   a.29.3.1

	SUPERFAM mapping significantly overlapping:
     1	PF00441 SSF47203 0.910 (average over 6570 mutual instances, PF00441 13147 appearances, SSF47203 17996 appearances)
     2	PF01756 SSF47203 0.935 (average over 253 mutual instances, PF01756 485 appearances, SSF47203 17996 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 539 ) 6608783_PF00135_PF07859            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00135  is 6582935 with Jaccard = 0.9021	|PF00135|=797 [ 719	0	1099414	78 ]
 parent [ 6582935 ] : 6608783 0.373768 (=260498/(747*933)) 66.4102
  given [ 6582935 ] : 6582935 0.520805 (=776/(2*745)) 54.6842
	best keyword for cluster 6582935 is PF00135 with Jaccard = 0.9021	[ 719	0	1099414	78 ]	1.0000 0.9021
sibling [ 6582935 ] : 6597184 0.440143 (=1228/(3*930)) 60.3925
	best keyword for cluster 6597184 is PF07859 with Jaccard = 0.9227	[ 716	40	1099435	20 ]	0.9471 0.9728

	SUGGESTING RELATEDNESS OF:   
	A> PF00135 ( PF00135   Carboxylesterase )
	B> PF07859 ( PF07859   alpha/beta hydrolase fold )

	they come from the same clan: CL0028.14 : PF05728	PF00975	PF07519	PF06850	PF07819	PF00326	PF05576	PF05577	PF02129	PF00450	PF02089	PF03403	PF03096	PF01764	PF01674	PF00151	PF03583	PF02450	PF03959	PF00756	PF06028	PF05990	PF05677	PF05057	PF04301	PF08538	PF07176	PF06821	PF06500	PF06342	PF06259	PF01738	PF01083	PF00135	PF07224	PF08840	PF05448	PF02273	PF08386	PF07859	PF02230	PF00561	PF06057
	the two keywords do not coincide on UniRef90 proteins

	both PF00135 and PF07859 have PDB structures
	PF00135 	   c.69.1.1  c.69.1.17
	PF07859 	   c.69.1.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 540 ) 6759091_PF01637_PF06846            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01637  is 6740504 with Jaccard = 0.9016	|PF01637|=168 [ 165	15	1100028	3 ]
 parent [ 6740504 ] : 6759091 0.0113533 (=188/(29*571)) 99.2146
  given [ 6740504 ] : 6740504 0.0292702 (=2371/(263*308)) 97.8279
	best keyword for cluster 6740504 is PF01637 with Jaccard = 0.9016	[ 165	15	1100028	3 ]	0.9167 0.9821
sibling [ 6740504 ] : 6672132 0.171569 (=35/(12*17)) 86.0559
	best keyword for cluster 6672132 is PF06846 with Jaccard = 1.0000	[ 11	0	1100200	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01637 ( PF01637   Archaeal ATPase )
	B> PF06846 ( PF06846   Protein of unknown function (DUF1245) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01637 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 541 ) 6696840_PF03400_PF03811            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03400  is 6690597 with Jaccard = 0.9014	|PF03400|=69 [ 64	2	1100140	5 ]
 parent [ 6690597 ] : 6696840 0.089158 (=773/(102*85)) 91.7053
  given [ 6690597 ] : 6690597 0.130952 (=11/(1*84)) 90.3792
	best keyword for cluster 6690597 is PF03400 with Jaccard = 0.9014	[ 64	2	1100140	5 ]	0.9697 0.9275
sibling [ 6690597 ] : 6648687 0.237403 (=457/(77*25)) 79.5642
	best keyword for cluster 6648687 is PF03811 with Jaccard = 0.8197	[ 50	5	1100150	6 ]	0.9091 0.8929

	SUGGESTING RELATEDNESS OF:   
	A> PF03400 ( PF03400   IS1 transposase )
	B> PF03811 ( PF03811   Insertion element protein )

	Only B has a clan ( CL0123.12 ).

	the two keywords coincide on Uniref90 proteins: |PF03400| = 69 ,	 |PF03811| = 56 , 	|PF03400^PF03811| = 2	( 2.9% and 3.6% )

	Neither  PF03400 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 542 ) 6723432_PF02275_PF03417            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03417  is 6667352 with Jaccard = 0.9000	|PF03417|=50 [ 45	0	1100161	5 ]
 parent [ 6667352 ] : 6723432 0.0491453 (=253/(99*52)) 95.9147
  given [ 6667352 ] : 6667352 0.171852 (=116/(27*25)) 84.8142
	best keyword for cluster 6667352 is PF03417 with Jaccard = 0.9000	[ 45	0	1100161	5 ]	1.0000 0.9000
sibling [ 6667352 ] : 6559634 0.551546 (=107/(2*97)) 46.4772
	best keyword for cluster 6559634 is PF02275 with Jaccard = 0.7699	[ 87	0	1100098	26 ]	1.0000 0.7699

	SUGGESTING RELATEDNESS OF:   
	A> PF03417 ( PF03417   Acyl-coenzyme A:6-aminopenicillanic acid acyl-transferase )
	B> PF02275 ( PF02275   Linear amide C-N hydrolases, choloylglycine hydrolase family )

	they come from the same clan: CL0052.11 : PF00227	PF03577	PF01804	PF01019	PF00310	PF02275	PF01112	PF03417
	the two keywords do not coincide on UniRef90 proteins

	only  PF03417 has a PDB structure (may not be up to date)
		PF02275 	   d.153.1.3

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 543 ) 6693364_PF00023_PF01412            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01412  is 6678069 with Jaccard = 0.8989	|PF01412|=351 [ 329	15	1099845	22 ]
 parent [ 6678069 ] : 6693364 0.101111 (=146245/(375*3857)) 90.9881
  given [ 6678069 ] : 6678069 0.136792 (=203/(4*371)) 87.6937
	best keyword for cluster 6678069 is PF01412 with Jaccard = 0.8989	[ 329	15	1099845	22 ]	0.9564 0.9373
sibling [ 6678069 ] : 6691620 0.0987318 (=8330/(22*3835)) 90.6005
	best keyword for cluster 6691620 is PF00023 with Jaccard = 0.7263	[ 3124	223	1095910	954 ]	0.9334 0.7661

	SUGGESTING RELATEDNESS OF:   
	A> PF01412 ( PF01412   Putative GTPase activating protein for Arf )
	B> PF00023 ( PF00023   Ankyrin repeat )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00023| = 4078 ,	 |PF01412| = 351 , 	|PF00023^PF01412| = 87	( 2.1% and 24.8% )

	both PF01412 and PF00023 have PDB structures
	PF01412 	   g.45.1.1
	PF00023 	   d.211.1.1  i.11.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF01412 SSF57863 0.943 (average over 744 mutual instances, PF01412 1021 appearances, SSF57863 1344 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 544 ) 6708789_PF02774_PF02800            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02774  is 6592245 with Jaccard = 0.8988	|PF02774|=521 [ 506	42	1099648	15 ]
 parent [ 6592245 ] : 6708789 0.0773083 (=44486/(607*948)) 93.7601
  given [ 6592245 ] : 6592245 0.45068 (=40810/(264*343)) 58.0641
	best keyword for cluster 6592245 is PF02774 with Jaccard = 0.8988	[ 506	42	1099648	15 ]	0.9234 0.9712
sibling [ 6592245 ] : 6702179 0.161563 (=153/(1*947)) 92.5745
	best keyword for cluster 6702179 is PF02800 with Jaccard = 0.9434	[ 833	48	1099328	2 ]	0.9455 0.9976

	SUGGESTING RELATEDNESS OF:   
	A> PF02774 ( PF02774   Semialdehyde dehydrogenase, dimerisation domain )
	B> PF02800 ( PF02800   Glyceraldehyde 3-phosphate dehydrogenase, C-terminal domain )

	they come from the same clan: CL0139.6 : PF02800	PF02774
	the two keywords do not coincide on UniRef90 proteins

	both PF02774 and PF02800 have PDB structures
		PF02800 	   d.81.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 545 ) 6697766_PF03151_PF08449            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08449  is 6695887 with Jaccard = 0.8957	|PF08449|=114 [ 103	1	1100096	11 ]
 parent [ 6695887 ] : 6697766 0.100189 (=3400/(303*112)) 91.8457
  given [ 6695887 ] : 6695887 0.0917431 (=30/(3*109)) 91.5001
	best keyword for cluster 6695887 is PF08449 with Jaccard = 0.8957	[ 103	1	1100096	11 ]	0.9904 0.9035
sibling [ 6695887 ] : 6615930 0.334001 (=5167/(65*238)) 68.8752
	best keyword for cluster 6615930 is PF03151 with Jaccard = 0.8301	[ 254	6	1099905	46 ]	0.9769 0.8467

	SUGGESTING RELATEDNESS OF:   
	A> PF08449 ( PF08449   UAA transporter family )
	B> PF03151 ( PF03151   Triose-phosphate Transporter family )

	they come from the same clan: CL0184.5 : PF07857	PF04342	PF00892	PF05653	PF06027	PF00893	PF04142	PF06379	PF06800	PF03151	PF08449	PF02694
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF08449 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 546 ) 6732346_PF02498_PF08346            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02498  is 6725496 with Jaccard = 0.8941	|PF02498|=233 [ 211	3	1099975	22 ]
 parent [ 6725496 ] : 6732346 0.0319589 (=1809/(178*318)) 96.9893
  given [ 6725496 ] : 6725496 0.0459426 (=261/(19*299)) 96.1671
	best keyword for cluster 6725496 is PF02498 with Jaccard = 0.8941	[ 211	3	1099975	22 ]	0.9860 0.9056
sibling [ 6725496 ] : 6707402 0.0669749 (=478/(117*61)) 93.5264
	best keyword for cluster 6707402 is PF08346 with Jaccard = 0.7231	[ 47	16	1100146	2 ]	0.7460 0.9592

	SUGGESTING RELATEDNESS OF:   
	A> PF02498 ( PF02498   BRO family, N-terminal domain )
	B> PF08346 ( PF08346   AntA/AntB antirepressor )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02498 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 547 ) 6781628_PF04159_PF04277            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04277  is 6761709 with Jaccard = 0.8929	|PF04277|=55 [ 50	1	1100155	5 ]
 parent [ 6761709 ] : 6781628 0.00047619 (=4/(84*100)) 99.9752
  given [ 6761709 ] : 6761709 0.0108108 (=8/(10*74)) 99.356
	best keyword for cluster 6761709 is PF04277 with Jaccard = 0.8929	[ 50	1	1100155	5 ]	0.9804 0.9091
sibling [ 6761709 ] : 6779296 0.00080289 (=2/(53*47)) 99.9423
	best keyword for cluster 6779296 is PF04159 with Jaccard = 1.0000	[ 4	0	1100207	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04277 ( PF04277   Oxaloacetate decarboxylase, gamma chain )
	B> PF04159 ( PF04159   NB glycoprotein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04277 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 548 ) 6768152_PF01105_PF04776            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04776  is 6740322 with Jaccard = 0.8929	|PF04776|=27 [ 25	1	1100183	2 ]
 parent [ 6740322 ] : 6768152 0.00428762 (=61/(41*347)) 99.645
  given [ 6740322 ] : 6740322 0.0252101 (=6/(34*7)) 97.8073
	best keyword for cluster 6740322 is PF04776 with Jaccard = 0.8929	[ 25	1	1100183	2 ]	0.9615 0.9259
sibling [ 6740322 ] : 6766010 0.00581395 (=6/(3*344)) 99.5556
	best keyword for cluster 6766010 is PF01105 with Jaccard = 0.6914	[ 177	0	1099955	79 ]	1.0000 0.6914

	SUGGESTING RELATEDNESS OF:   
	A> PF04776 ( PF04776   Protein of unknown function (DUF626) )
	B> PF01105 ( PF01105   emp24/gp25L/p24 family/GOLD )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF04776 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 549 ) 6576467_PF02434_PF04646            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02434  is 6535808 with Jaccard = 0.8913	|PF02434|=45 [ 41	1	1100165	4 ]
 parent [ 6535808 ] : 6576467 0.518541 (=867/(38*44)) 52.6529
  given [ 6535808 ] : 6535808 0.730159 (=230/(9*35)) 29.372
	best keyword for cluster 6535808 is PF02434 with Jaccard = 0.8913	[ 41	1	1100165	4 ]	0.9762 0.9111
sibling [ 6535808 ] : 6514429 0.835227 (=294/(22*16)) 17.294
	best keyword for cluster 6514429 is PF04646 with Jaccard = 0.8333	[ 20	0	1100187	4 ]	1.0000 0.8333

	SUGGESTING RELATEDNESS OF:   
	A> PF02434 ( PF02434   Fringe-like )
	B> PF04646 ( PF04646   Protein of unknown function, DUF604 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02434| = 45 ,	 |PF04646| = 24 , 	|PF02434^PF04646| = 1	( 2.2% and 4.2% )

	Neither  PF02434 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 550 ) 6664257_PF00430_PF05103            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00430  is 6615487 with Jaccard = 0.8909	|PF00430|=394 [ 351	0	1099817	43 ]
 parent [ 6615487 ] : 6664257 0.193881 (=17160/(406*218)) 84.1173
  given [ 6615487 ] : 6615487 0.335271 (=7615/(67*339)) 68.6452
	best keyword for cluster 6615487 is PF00430 with Jaccard = 0.8909	[ 351	0	1099817	43 ]	1.0000 0.8909
sibling [ 6615487 ] : 6642868 0.269444 (=1843/(180*38)) 77.9038
	best keyword for cluster 6642868 is PF05103 with Jaccard = 0.8919	[ 99	7	1100100	5 ]	0.9340 0.9519

	SUGGESTING RELATEDNESS OF:   
	A> PF00430 ( PF00430   ATP synthase B/B' CF(0) )
	B> PF05103 ( PF05103   DivIVA protein )

	Only A has a clan ( CL0255.4 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF00430 has a PDB structure (may not be up to date)
	PF00430 	   f.23.21.1  j.35.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00430 SSF82607 0.684 (average over 1 mutual instances, PF00430 10 appearances, SSF82607 761 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 551 ) 6653706_PF00505_PF03531            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00505  is 6648722 with Jaccard = 0.8889	|PF00505|=864 [ 768	0	1099347	96 ]
 parent [ 6648722 ] : 6653706 0.195702 (=10838/(65*852)) 81.2733
  given [ 6648722 ] : 6648722 0.240575 (=1423/(7*845)) 79.5867
	best keyword for cluster 6648722 is PF00505 with Jaccard = 0.8889	[ 768	0	1099347	96 ]	1.0000 0.8889
sibling [ 6648722 ] : 6624418 0.301333 (=226/(15*50)) 72.328
	best keyword for cluster 6624418 is PF03531 with Jaccard = 0.7667	[ 46	14	1100151	0 ]	0.7667 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00505 ( PF00505   HMG (high mobility group) box )
	B> PF03531 ( PF03531   Structure-specific recognition protein (SSRP1) )

	A and B come from a different clan ( CL0114.6 , CL0215.5 ).

	the two keywords coincide on Uniref90 proteins: |PF00505| = 864 ,	 |PF03531| = 46 , 	|PF00505^PF03531| = 17	( 2.0% and 37.0% )

	only  PF00505 has a PDB structure (may not be up to date)
	PF00505 	   a.21.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00505 SSF47095 0.800 (average over 2604 mutual instances, PF00505 2716 appearances, SSF47095 3113 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 552 ) 6647472_PF03168_PF07427            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07427  is 6625919 with Jaccard = 0.8889	|PF07427|=9 [ 8	0	1100202	1 ]
 parent [ 6625919 ] : 6647472 0.254261 (=179/(64*11)) 79.2263
  given [ 6625919 ] : 6625919 0.277778 (=5/(2*9)) 72.9905
	best keyword for cluster 6625919 is PF07427 with Jaccard = 0.8889	[ 8	0	1100202	1 ]	1.0000 0.8889
sibling [ 6625919 ] : 6607992 0.394917 (=404/(31*33)) 65.8917
	best keyword for cluster 6607992 is PF03168 with Jaccard = 1.0000	[ 26	0	1100185	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07427 ( PF07427   Protein of unknown function (DUF1511) )
	B> PF03168 ( PF03168   Late embryogenesis abundant protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF07427 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 553 ) 6681333_PF03032_PF08018            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08018  is 6651888 with Jaccard = 0.8889	|PF08018|=27 [ 24	0	1100184	3 ]
 parent [ 6651888 ] : 6681333 0.128254 (=202/(25*63)) 88.5089
  given [ 6651888 ] : 6651888 0.282609 (=13/(2*23)) 80.637
	best keyword for cluster 6651888 is PF08018 with Jaccard = 0.8889	[ 24	0	1100184	3 ]	1.0000 0.8889
sibling [ 6651888 ] : 6678437 0.127119 (=30/(4*59)) 87.805
	best keyword for cluster 6678437 is PF03032 with Jaccard = 0.6769	[ 44	0	1100146	21 ]	1.0000 0.6769

	SUGGESTING RELATEDNESS OF:   
	A> PF08018 ( PF08018   Frog antimicrobial peptide )
	B> PF03032 ( PF03032   Brevenin/esculentin/gaegurin/rugosin family )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03032| = 65 ,	 |PF08018| = 28 , 	|PF03032^PF08018| = 8	( 12.3% and 28.6% )

	Neither  PF08018 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 554 ) 6625262_PF00308_PF01695            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01695  is 6607912 with Jaccard = 0.8885	|PF01695|=292 [ 287	31	1099888	5 ]
 parent [ 6607912 ] : 6625262 0.314167 (=52665/(417*402)) 72.5964
  given [ 6607912 ] : 6607912 0.365012 (=603/(4*413)) 65.8168
	best keyword for cluster 6607912 is PF01695 with Jaccard = 0.8885	[ 287	31	1099888	5 ]	0.9025 0.9829
sibling [ 6607912 ] : 6622739 0.28625 (=229/(2*400)) 71.6495
	best keyword for cluster 6622739 is PF00308 with Jaccard = 0.9633	[ 315	12	1099884	0 ]	0.9633 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01695 ( PF01695   IstB-like ATP binding protein )
	B> PF00308 ( PF00308   Bacterial dnaA protein )

	they come from the same clan: CL0023.26 : PF02367	PF02534	PF02463	PF01202	PF00158	PF08542	PF03215	PF05729	PF00488	PF01078	PF00493	PF08433	PF01695	PF00437	PF05872	PF06144	PF00308	PF01583	PF00005	PF08298	PF07728	PF07726	PF07724	PF00004	PF05707
	the two keywords do not coincide on UniRef90 proteins

	only  PF01695 has a PDB structure (may not be up to date)
		PF00308 	   c.37.1.20

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 555 ) 6651089_PF01583_PF01747            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01747  is 6541006 with Jaccard = 0.8881	|PF01747|=134 [ 119	0	1100077	15 ]
 parent [ 6541006 ] : 6651089 0.197373 (=5409/(203*135)) 80.3545
  given [ 6541006 ] : 6541006 0.680451 (=181/(2*133)) 33.1475
	best keyword for cluster 6541006 is PF01747 with Jaccard = 0.8881	[ 119	0	1100077	15 ]	1.0000 0.8881
sibling [ 6541006 ] : 6536186 0.738333 (=443/(3*200)) 29.6961
	best keyword for cluster 6536186 is PF01583 with Jaccard = 0.7027	[ 182	0	1099952	77 ]	1.0000 0.7027

	SUGGESTING RELATEDNESS OF:   
	A> PF01747 ( PF01747   ATP-sulfurylase )
	B> PF01583 ( PF01583   Adenylylsulphate kinase )

	Only B has a clan ( CL0023.26 ).

	the two keywords coincide on Uniref90 proteins: |PF01583| = 259 ,	 |PF01747| = 134 , 	|PF01583^PF01747| = 33	( 12.7% and 24.6% )

	both PF01747 and PF01583 have PDB structures
		PF01583 	   c.37.1.15  c.37.1.4

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 556 ) 6705094_PF00891_PF05891            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00891  is 6648966 with Jaccard = 0.8862	|PF00891|=380 [ 366	33	1099798	14 ]
 parent [ 6648966 ] : 6705094 0.0886865 (=1636/(43*429)) 93.1238
  given [ 6648966 ] : 6648966 0.220012 (=741/(8*421)) 79.6904
	best keyword for cluster 6648966 is PF00891 with Jaccard = 0.8862	[ 366	33	1099798	14 ]	0.9173 0.9632
sibling [ 6648966 ] : 6610234 0.371795 (=58/(39*4)) 66.8698
	best keyword for cluster 6610234 is PF05891 with Jaccard = 0.9286	[ 39	3	1100169	0 ]	0.9286 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00891 ( PF00891   O-methyltransferase )
	B> PF05891 ( PF05891   Eukaryotic protein of unknown function (DUF858) )

	they come from the same clan: CL0102.14 : PF06962	PF00398	PF06325	PF03291	PF01135	PF01358	PF06460	PF01189	PF05401	PF01234	PF01555	PF02384	PF07942	PF05175	PF05063	PF07109	PF02475	PF07021	PF08003	PF05148	PF01795	PF02390	PF01596	PF00891	PF09445	PF08242	PF08241	PF05971	PF02086	PF02527	PF08704	PF01728	PF01269	PF07669	PF06080	PF05891	PF05430	PF04816	PF04672	PF04445	PF04378	PF01861	PF03269	PF03141	PF07757	PF07279	PF05219	PF08123	PF00145	PF03602	PF02353	PF01739	PF06859	PF09243	PF01564	PF03848	PF05724	PF02005	PF05958	PF01209	PF01170
	the two keywords do not coincide on UniRef90 proteins

	both PF00891 and PF05891 have PDB structures
		PF05891 	   c.66.1.42

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 557 ) 6667538_PF02493_PF07661            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02493  is 6662460 with Jaccard = 0.8847	|PF02493|=449 [ 399	2	1099760	50 ]
 parent [ 6662460 ] : 6667538 0.17733 (=12406/(159*440)) 84.8563
  given [ 6662460 ] : 6662460 0.180619 (=315/(4*436)) 83.7714
	best keyword for cluster 6662460 is PF02493 with Jaccard = 0.8847	[ 399	2	1099760	50 ]	0.9950 0.8886
sibling [ 6662460 ] : 6648173 0.228632 (=107/(3*156)) 79.4327
	best keyword for cluster 6648173 is PF07661 with Jaccard = 0.9138	[ 106	1	1100095	9 ]	0.9907 0.9217

	SUGGESTING RELATEDNESS OF:   
	A> PF02493 ( PF02493   MORN repeat )
	B> PF07661 ( PF07661   MORN repeat variant )

	they come from the same clan: CL0251.3 : PF07661	PF02493

	the two keywords coincide on Uniref90 proteins: |PF02493| = 449 ,	 |PF07661| = 115 , 	|PF02493^PF07661| = 2	( 0.4% and 1.7% )

	only  PF02493 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 558 ) 6716047_PF00590_PF02602            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00590  is 6685379 with Jaccard = 0.8814	|PF00590|=1270 [ 1122	3	1098938	148 ]
 parent [ 6685379 ] : 6716047 0.0535455 (=21959/(300*1367)) 94.8904
  given [ 6685379 ] : 6685379 0.133467 (=11625/(67*1300)) 89.3396
	best keyword for cluster 6685379 is PF00590 with Jaccard = 0.8814	[ 1122	3	1098938	148 ]	0.9973 0.8835
sibling [ 6685379 ] : 6669701 0.165375 (=1792/(42*258)) 85.4332
	best keyword for cluster 6669701 is PF02602 with Jaccard = 0.8272	[ 249	0	1099910	52 ]	1.0000 0.8272

	SUGGESTING RELATEDNESS OF:   
	A> PF00590 ( PF00590   Tetrapyrrole (Corrin/Porphyrin) Methylases )
	B> PF02602 ( PF02602   Uroporphyrinogen-III synthase HemD )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00590| = 1270 ,	 |PF02602| = 301 , 	|PF00590^PF02602| = 58	( 4.6% and 19.3% )

	both PF00590 and PF02602 have PDB structures
		PF02602 	   c.113.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF02602 SSF69618 0.927 (average over 844 mutual instances, PF02602 1033 appearances, SSF69618 1108 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 559 ) 6715698_PF02945_PF03175            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03175  is 6692611 with Jaccard = 0.8812	|PF03175|=101 [ 89	0	1100110	12 ]
 parent [ 6692611 ] : 6715698 0.0615757 (=483/(74*106)) 94.8307
  given [ 6692611 ] : 6692611 0.107843 (=44/(4*102)) 90.7988
	best keyword for cluster 6692611 is PF03175 with Jaccard = 0.8812	[ 89	0	1100110	12 ]	1.0000 0.8812
sibling [ 6692611 ] : 6664680 0.184942 (=253/(36*38)) 84.2093
	best keyword for cluster 6664680 is PF02945 with Jaccard = 0.8462	[ 33	6	1100172	0 ]	0.8462 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03175 ( PF03175   DNA polymerase type B, organellar and viral )
	B> PF02945 ( PF02945   Recombination endonuclease VII )

	A and B come from a different clan ( CL0194.5 , CL0263.2 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF03175 and PF02945 have PDB structures
		PF02945 	   d.4.1.5

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 560 ) 6698382_PF00046_PF00412            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00046  is 6695559 with Jaccard = 0.8808	|PF00046|=3370 [ 3171	230	1096611	199 ]
 parent [ 6695559 ] : 6698382 0.086379 (=207982/(645*3733)) 91.9378
  given [ 6695559 ] : 6695559 0.114319 (=21467/(51*3682)) 91.4494
	best keyword for cluster 6695559 is PF00046 with Jaccard = 0.8808	[ 3171	230	1096611	199 ]	0.9324 0.9409
sibling [ 6695559 ] : 6686524 0.128315 (=329/(4*641)) 89.5542
	best keyword for cluster 6686524 is PF00412 with Jaccard = 0.7463	[ 562	32	1099458	159 ]	0.9461 0.7795

	SUGGESTING RELATEDNESS OF:   
	A> PF00046 ( PF00046   Homeobox domain )
	B> PF00412 ( PF00412   LIM domain )

	Only A has a clan ( CL0123.12 ).

	the two keywords coincide on Uniref90 proteins: |PF00046| = 3370 ,	 |PF00412| = 721 , 	|PF00046^PF00412| = 105	( 3.1% and 14.6% )

	both PF00046 and PF00412 have PDB structures
	PF00046 	   a.4.1.1  j.92.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00046 SSF46689 0.773 (average over 9143 mutual instances, PF00046 9568 appearances, SSF46689 68153 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 561 ) 6743478_PF00551_PF02769            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02769  is 6686008 with Jaccard = 0.8803	|PF02769|=1069 [ 1059	134	1099008	10 ]
 parent [ 6686008 ] : 6743478 0.0193508 (=24548/(1367*928)) 98.0966
  given [ 6686008 ] : 6686008 0.120242 (=52036/(869*498)) 89.476
	best keyword for cluster 6686008 is PF02769 with Jaccard = 0.8803	[ 1059	134	1099008	10 ]	0.8877 0.9906
sibling [ 6686008 ] : 6733475 0.0442287 (=41/(1*927)) 97.1063
	best keyword for cluster 6733475 is PF00551 with Jaccard = 0.9203	[ 808	7	1099333	63 ]	0.9914 0.9277

	SUGGESTING RELATEDNESS OF:   
	A> PF02769 ( PF02769   AIR synthase related protein, C-terminal domain )
	B> PF00551 ( PF00551   Formyl transferase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00551| = 871 ,	 |PF02769| = 1069 , 	|PF00551^PF02769| = 30	( 3.4% and 2.8% )

	both PF02769 and PF00551 have PDB structures
	PF02769 	   d.139.1.1
	PF00551 	   c.65.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF02769 SSF56042 0.865 (average over 3183 mutual instances, PF02769 6538 appearances, SSF56042 6282 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 562 ) 6628159_PF01579_PF03236            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01579  is 6621820 with Jaccard = 0.8800	|PF01579|=49 [ 44	1	1100161	5 ]
 parent [ 6621820 ] : 6628159 0.295733 (=506/(29*59)) 73.7389
  given [ 6621820 ] : 6621820 0.307018 (=35/(2*57)) 71.2946
	best keyword for cluster 6621820 is PF01579 with Jaccard = 0.8800	[ 44	1	1100161	5 ]	0.9778 0.8980
sibling [ 6621820 ] : 6555485 0.607143 (=102/(8*21)) 43.0595
	best keyword for cluster 6555485 is PF03236 with Jaccard = 0.9000	[ 18	2	1100191	0 ]	0.9000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01579 ( PF01579   Domain of unknown function DUF19 )
	B> PF03236 ( PF03236   Domain of unknown function DUF263 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01579 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 563 ) 6706799_PF03544_PF05569            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03544  is 6705391 with Jaccard = 0.8798	|PF03544|=234 [ 227	24	1099953	7 ]
 parent [ 6705391 ] : 6706799 0.0705605 (=3298/(82*570)) 93.4258
  given [ 6705391 ] : 6705391 0.0848109 (=287/(6*564)) 93.1689
	best keyword for cluster 6705391 is PF03544 with Jaccard = 0.8798	[ 227	24	1099953	7 ]	0.9044 0.9701
sibling [ 6705391 ] : 6649350 0.225 (=36/(2*80)) 79.8615
	best keyword for cluster 6649350 is PF05569 with Jaccard = 0.6700	[ 67	1	1100111	32 ]	0.9853 0.6768

	SUGGESTING RELATEDNESS OF:   
	A> PF03544 ( PF03544   Gram-negative bacterial tonB protein )
	B> PF05569 ( PF05569   BlaR1 peptidase M56 )

	Only B has a clan ( CL0150.6 ).

	the two keywords coincide on Uniref90 proteins: |PF03544| = 234 ,	 |PF05569| = 99 , 	|PF03544^PF05569| = 13	( 5.6% and 13.1% )

	only  PF03544 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 564 ) 6729734_PF00320_PF04855            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00320  is 6728520 with Jaccard = 0.8789	|PF00320|=385 [ 341	3	1099823	44 ]
 parent [ 6728520 ] : 6729734 0.037004 (=746/(48*420)) 96.6962
  given [ 6728520 ] : 6728520 0.0373171 (=153/(10*410)) 96.548
	best keyword for cluster 6728520 is PF00320 with Jaccard = 0.8789	[ 341	3	1099823	44 ]	0.9913 0.8857
sibling [ 6728520 ] : 6671414 0.141304 (=13/(2*46)) 85.8777
	best keyword for cluster 6671414 is PF04855 with Jaccard = 1.0000	[ 43	0	1100168	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00320 ( PF00320   GATA zinc finger )
	B> PF04855 ( PF04855   SNF5 / SMARCB1 / INI1 )

	Only A has a clan ( CL0167.10 ).

	the two keywords coincide on Uniref90 proteins: |PF00320| = 385 ,	 |PF04855| = 43 , 	|PF00320^PF04855| = 2	( 0.5% and 4.7% )

	only  PF00320 has a PDB structure (may not be up to date)
	PF00320 	   g.39.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 565 ) 6680903_PF04717_PF06890            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06890  is 6463517 with Jaccard = 0.8788	|PF06890|=33 [ 29	0	1100178	4 ]
 parent [ 6463517 ] : 6680903 0.134473 (=346/(31*83)) 88.41
  given [ 6463517 ] : 6463517 0.976923 (=127/(26*5)) 2.79599
	best keyword for cluster 6463517 is PF06890 with Jaccard = 0.8788	[ 29	0	1100178	4 ]	1.0000 0.8788
sibling [ 6463517 ] : 6669671 0.166915 (=224/(22*61)) 85.4093
	best keyword for cluster 6669671 is PF04717 with Jaccard = 0.9273	[ 51	2	1100156	2 ]	0.9623 0.9623

	SUGGESTING RELATEDNESS OF:   
	A> PF06890 ( PF06890   Bacteriophage Mu Gp45 protein )
	B> PF04717 ( PF04717   Phage-related baseplate assembly protein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04717| = 53 ,	 |PF06890| = 33 , 	|PF04717^PF06890| = 3	( 5.7% and 9.1% )

	Neither  PF06890 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 566 ) 6598068_PF02797_PF08541            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08541  is 6563150 with Jaccard = 0.8767	|PF08541|=448 [ 398	6	1099757	50 ]
 parent [ 6563150 ] : 6598068 0.427155 (=60902/(469*304)) 60.6578
  given [ 6563150 ] : 6563150 0.534261 (=499/(2*467)) 49.6942
	best keyword for cluster 6563150 is PF08541 with Jaccard = 0.8767	[ 398	6	1099757	50 ]	0.9851 0.8884
sibling [ 6563150 ] : 6597133 0.402318 (=243/(2*302)) 60.3392
	best keyword for cluster 6597133 is PF02797 with Jaccard = 0.8990	[ 258	24	1099924	5 ]	0.9149 0.9810

	SUGGESTING RELATEDNESS OF:   
	A> PF08541 ( PF08541   3-Oxoacyl-[acyl-carrier-protein (ACP)] synthase III C terminal )
	B> PF02797 ( PF02797   Chalcone and stilbene synthases, C-terminal domain )

	they come from the same clan: CL0046.10 : PF02803	PF02801	PF00109	PF01154	PF08392	PF00195	PF02797	PF08545	PF08541	PF00108
	the two keywords do not coincide on UniRef90 proteins

	both PF08541 and PF02797 have PDB structures
		PF02797 	   c.95.1.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 567 ) 6738378_PF00753_PF07522            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00753  is 6735710 with Jaccard = 0.8752	|PF00753|=3135 [ 2897	175	1096901	238 ]
 parent [ 6735710 ] : 6738378 0.0311918 (=10091/(81*3994)) 97.6263
  given [ 6735710 ] : 6735710 0.038499 (=65869/(488*3506)) 97.3396
	best keyword for cluster 6735710 is PF00753 with Jaccard = 0.8752	[ 2897	175	1096901	238 ]	0.9430 0.9241
sibling [ 6735710 ] : 6717203 0.0579151 (=30/(74*7)) 95.049
	best keyword for cluster 6717203 is PF07522 with Jaccard = 0.9153	[ 54	3	1100152	2 ]	0.9474 0.9643

	SUGGESTING RELATEDNESS OF:   
	A> PF00753 ( PF00753   Metallo-beta-lactamase superfamily )
	B> PF07522 ( PF07522   DNA repair metallo-beta-lactamase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00753 has a PDB structure (may not be up to date)
	PF00753 	   d.157.1.1  d.157.1.2  d.157.1.3  d.157.1.7  d.157.1.9
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 568 ) 6581327_PF03516_PF05474            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05474  is 6534316 with Jaccard = 0.8750	|PF05474|=15 [ 14	1	1100195	1 ]
 parent [ 6534316 ] : 6581327 0.478632 (=112/(18*13)) 54.1525
  given [ 6534316 ] : 6534316 0.732143 (=41/(4*14)) 28.4559
	best keyword for cluster 6534316 is PF05474 with Jaccard = 0.8750	[ 14	1	1100195	1 ]	0.9333 0.9333
sibling [ 6534316 ] : 6560600 0.545455 (=12/(2*11)) 47.2386
	best keyword for cluster 6560600 is PF03516 with Jaccard = 0.7143	[ 5	2	1100204	0 ]	0.7143 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05474 ( PF05474   Semenogelin )
	B> PF03516 ( PF03516   Filaggrin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05474 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 569 ) 6712639_PF00004_PF05695            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05695  is 6701117 with Jaccard = 0.8750	|PF05695|=35 [ 35	5	1100171	0 ]
 parent [ 6701117 ] : 6712639 0.0646679 (=30521/(65*7261)) 94.3402
  given [ 6701117 ] : 6701117 0.0778689 (=19/(4*61)) 92.3812
	best keyword for cluster 6701117 is PF05695 with Jaccard = 0.8750	[ 35	5	1100171	0 ]	0.8750 1.0000
sibling [ 6701117 ] : 6711543 0.0669631 (=24623/(51*7210)) 94.1712
	best keyword for cluster 6711543 is PF00004 with Jaccard = 0.6365	[ 3979	2107	1093960	165 ]	0.6538 0.9602

	SUGGESTING RELATEDNESS OF:   
	A> PF05695 ( PF05695   Plant protein of unknown function (DUF825) )
	B> PF00004 ( PF00004   ATPase family associated with various cellular activities (AAA) )

	Only B has a clan ( CL0023.26 ).

	the two keywords coincide on Uniref90 proteins: |PF00004| = 4144 ,	 |PF05695| = 35 , 	|PF00004^PF05695| = 17	( 0.4% and 48.6% )

	only  PF05695 has a PDB structure (may not be up to date)
		PF00004 	   c.37.1.1  c.37.1.20

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 570 ) 6737235_PF00033_PF03161            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03161  is 6632032 with Jaccard = 0.8710	|PF03161|=93 [ 81	0	1100118	12 ]
 parent [ 6632032 ] : 6737235 0.0250365 (=7058/(86*3278)) 97.5055
  given [ 6632032 ] : 6632032 0.267857 (=45/(2*84)) 75.1462
	best keyword for cluster 6632032 is PF03161 with Jaccard = 0.8710	[ 81	0	1100118	12 ]	1.0000 0.8710
sibling [ 6632032 ] : 6735269 0.042722 (=140/(1*3277)) 97.2996
	best keyword for cluster 6735269 is PF00033 with Jaccard = 0.9322	[ 2927	199	1097071	14 ]	0.9363 0.9952

	SUGGESTING RELATEDNESS OF:   
	A> PF03161 ( PF03161   LAGLIDADG DNA endonuclease family )
	B> PF00033 ( PF00033   Cytochrome b(N-terminal)/b6/petB )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00033| = 2941 ,	 |PF03161| = 93 , 	|PF00033^PF03161| = 3	( 0.1% and 3.2% )

	both PF03161 and PF00033 have PDB structures
	PF03161 	   d.95.2.1
	PF00033 	   f.21.1.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 571 ) 6724780_PF03724_PF04170            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03724  is 6685476 with Jaccard = 0.8692	|PF03724|=107 [ 93	0	1100104	14 ]
 parent [ 6685476 ] : 6724780 0.0420338 (=291/(43*161)) 96.0822
  given [ 6685476 ] : 6685476 0.112609 (=618/(49*112)) 89.369
	best keyword for cluster 6685476 is PF03724 with Jaccard = 0.8692	[ 93	0	1100104	14 ]	1.0000 0.8692
sibling [ 6685476 ] : 6666141 0.158537 (=13/(2*41)) 84.5004
	best keyword for cluster 6666141 is PF04170 with Jaccard = 0.9524	[ 20	1	1100190	0 ]	0.9524 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03724 ( PF03724   Domain of unknown function (306) )
	B> PF04170 ( PF04170   Uncharacterized lipoprotein NlpE involved in copper resistance )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03724| = 107 ,	 |PF04170| = 20 , 	|PF03724^PF04170| = 1	( 0.9% and 5.0% )

	Neither  PF03724 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 572 ) 6774576_PF02674_PF06900            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02674  is 6772591 with Jaccard = 0.8684	|PF02674|=188 [ 165	2	1100021	23 ]
 parent [ 6772591 ] : 6774576 0.00203037 (=23/(48*236)) 99.8453
  given [ 6772591 ] : 6772591 0.00274725 (=15/(26*210)) 99.7927
	best keyword for cluster 6772591 is PF02674 with Jaccard = 0.8684	[ 165	2	1100021	23 ]	0.9880 0.8777
sibling [ 6772591 ] : 6770289 0.0037037 (=2/(18*30)) 99.7204
	best keyword for cluster 6770289 is PF06900 with Jaccard = 1.0000	[ 5	0	1100206	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02674 ( PF02674   Colicin V production protein )
	B> PF06900 ( PF06900   Protein of unknown function (DUF1270) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02674 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 573 ) 6721711_PF04271_PF04492            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04271  is 6711946 with Jaccard = 0.8673	|PF04271|=92 [ 85	6	1100113	7 ]
 parent [ 6711946 ] : 6721711 0.0456731 (=475/(52*200)) 95.6656
  given [ 6711946 ] : 6711946 0.0652778 (=235/(20*180)) 94.2464
	best keyword for cluster 6711946 is PF04271 with Jaccard = 0.8673	[ 85	6	1100113	7 ]	0.9341 0.9239
sibling [ 6711946 ] : 6673410 0.139881 (=94/(24*28)) 86.4985
	best keyword for cluster 6673410 is PF04492 with Jaccard = 0.6667	[ 18	5	1100184	4 ]	0.7826 0.8182

	SUGGESTING RELATEDNESS OF:   
	A> PF04271 ( PF04271   DnaD-like domain )
	B> PF04492 ( PF04492   Bacteriophage replication protein O )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04271 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 574 ) 6622276_PF02309_PF02362            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02309  is 6606108 with Jaccard = 0.8662	|PF02309|=142 [ 123	0	1100069	19 ]
 parent [ 6606108 ] : 6622276 0.293164 (=6459/(153*144)) 71.4924
  given [ 6606108 ] : 6606108 0.369718 (=105/(2*142)) 64.5477
	best keyword for cluster 6606108 is PF02309 with Jaccard = 0.8662	[ 123	0	1100069	19 ]	1.0000 0.8662
sibling [ 6606108 ] : 6599331 0.405721 (=1773/(38*115)) 61.2884
	best keyword for cluster 6599331 is PF02362 with Jaccard = 0.6256	[ 137	1	1099992	81 ]	0.9928 0.6284

	SUGGESTING RELATEDNESS OF:   
	A> PF02309 ( PF02309   AUX/IAA family )
	B> PF02362 ( PF02362   B3 DNA binding domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02309| = 142 ,	 |PF02362| = 218 , 	|PF02309^PF02362| = 14	( 9.9% and 6.4% )

	only  PF02309 has a PDB structure (may not be up to date)
		PF02362 	   b.142.1.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 575 ) 6475621_PF00749_PF03950            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03950  is 6323607 with Jaccard = 0.8655	|PF03950|=216 [ 193	7	1099988	23 ]
 parent [ 6323607 ] : 6475621 0.959779 (=101845/(217*489)) 4.78999
  given [ 6323607 ] : 6323607 1 (=5610/(30*187)) 7.13173e-08
	best keyword for cluster 6323607 is PF03950 with Jaccard = 0.8655	[ 193	7	1099988	23 ]	0.9650 0.8935
sibling [ 6323607 ] : 6445136 0.991964 (=5678/(12*477)) 0.950106
	best keyword for cluster 6445136 is PF00749 with Jaccard = 0.6706	[ 450	0	1099540	221 ]	1.0000 0.6706

	SUGGESTING RELATEDNESS OF:   
	A> PF03950 ( PF03950   tRNA synthetases class I (E and Q), anti-codon binding domain )
	B> PF00749 ( PF00749   tRNA synthetases class I (E and Q), catalytic domain )

	Only B has a clan ( CL0038.9 ).

	the two keywords coincide on Uniref90 proteins: |PF00749| = 671 ,	 |PF03950| = 216 , 	|PF00749^PF03950| = 210	( 31.3% and 97.2% )

	both PF03950 and PF00749 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF03950 SSF50715 0.905 (average over 619 mutual instances, PF03950 710 appearances, SSF50715 2049 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 576 ) 6753665_PF00337_PF04099            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00337  is 6752320 with Jaccard = 0.8651	|PF00337|=212 [ 186	3	1099996	26 ]
 parent [ 6752320 ] : 6753665 0.0114014 (=343/(138*218)) 98.8814
  given [ 6752320 ] : 6752320 0.0143541 (=27/(9*209)) 98.7911
	best keyword for cluster 6752320 is PF00337 with Jaccard = 0.8651	[ 186	3	1099996	26 ]	0.9841 0.8774
sibling [ 6752320 ] : 6681927 0.134615 (=630/(78*60)) 88.6667
	best keyword for cluster 6681927 is PF04099 with Jaccard = 0.7653	[ 75	23	1100113	0 ]	0.7653 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00337 ( PF00337   Galactoside-binding lectin )
	B> PF04099 ( PF04099   Sybindin-like family )

	A and B come from a different clan ( CL0004.14 , CL0212.4 ).

	the two keywords coincide on Uniref90 proteins: |PF00337| = 212 ,	 |PF04099| = 75 , 	|PF00337^PF04099| = 2	( 0.9% and 2.7% )

	only  PF00337 has a PDB structure (may not be up to date)
	PF00337 	   b.29.1.3
	
	SUPERFAM mapping significantly overlapping:
     1	PF00337 SSF49899 0.926 (average over 447 mutual instances, PF00337 453 appearances, SSF49899 14070 appearances)
     2	PF04099 SSF64356 0.954 (average over 167 mutual instances, PF04099 170 appearances, SSF64356 1711 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 577 ) 6595370_PF00386_PF01391            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00386  is 6579905 with Jaccard = 0.8649	|PF00386|=174 [ 160	11	1100026	14 ]
 parent [ 6579905 ] : 6595370 0.42048 (=68339/(182*893)) 59.5189
  given [ 6579905 ] : 6579905 0.481875 (=2313/(32*150)) 53.6781
	best keyword for cluster 6579905 is PF00386 with Jaccard = 0.8649	[ 160	11	1100026	14 ]	0.9357 0.9195
sibling [ 6579905 ] : 6593540 0.434637 (=12335/(33*860)) 58.72
	best keyword for cluster 6593540 is PF01391 with Jaccard = 0.6285	[ 751	36	1099016	408 ]	0.9543 0.6480

	SUGGESTING RELATEDNESS OF:   
	A> PF00386 ( PF00386   C1q domain )
	B> PF01391 ( PF01391   Collagen triple helix repeat (20 copies) )

	Only A has a clan ( CL0100.7 ).

	the two keywords coincide on Uniref90 proteins: |PF00386| = 174 ,	 |PF01391| = 1159 , 	|PF00386^PF01391| = 90	( 51.7% and 7.8% )

	both PF00386 and PF01391 have PDB structures
	PF00386 	   b.22.1.1
	PF01391 	   d.169.1.5  h.1.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00386 SSF49842 0.910 (average over 373 mutual instances, PF00386 377 appearances, SSF49842 1081 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 578 ) 6682017_PF01895_PF02690            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01895  is 6601900 with Jaccard = 0.8649	|PF01895|=277 [ 256	19	1099915	21 ]
 parent [ 6601900 ] : 6682017 0.125154 (=6827/(319*171)) 88.6983
  given [ 6601900 ] : 6601900 0.413011 (=5555/(50*269)) 62.519
	best keyword for cluster 6601900 is PF01895 with Jaccard = 0.8649	[ 256	19	1099915	21 ]	0.9309 0.9242
sibling [ 6601900 ] : 6630073 0.257396 (=87/(2*169)) 74.6247
	best keyword for cluster 6630073 is PF02690 with Jaccard = 0.9936	[ 155	1	1100055	0 ]	0.9936 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01895 ( PF01895   PhoU family )
	B> PF02690 ( PF02690   Na+/Pi-cotransporter )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01895| = 277 ,	 |PF02690| = 155 , 	|PF01895^PF02690| = 11	( 4.0% and 7.1% )

	only  PF01895 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 579 ) 6741682_PF02992_PF03004            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03004  is 6735857 with Jaccard = 0.8630	|PF03004|=71 [ 63	2	1100138	8 ]
 parent [ 6735857 ] : 6741682 0.0225235 (=2994/(134*992)) 97.9417
  given [ 6735857 ] : 6735857 0.0282258 (=35/(10*124)) 97.3586
	best keyword for cluster 6735857 is PF03004 with Jaccard = 0.8630	[ 63	2	1100138	8 ]	0.9692 0.8873
sibling [ 6735857 ] : 6737777 0.0292634 (=29/(1*991)) 97.5643
	best keyword for cluster 6737777 is PF02992 with Jaccard = 0.7841	[ 385	98	1099720	8 ]	0.7971 0.9796

	SUGGESTING RELATEDNESS OF:   
	A> PF03004 ( PF03004   Plant transposase (Ptta/En/Spm family) )
	B> PF02992 ( PF02992   Transposase family tnp2 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF02992| = 393 ,	 |PF03004| = 71 , 	|PF02992^PF03004| = 5	( 1.3% and 7.0% )

	Neither  PF03004 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 580 ) 6693891_PF00132_PF00483            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00132  is 6676082 with Jaccard = 0.8619	|PF00132|=2358 [ 2053	24	1097829	305 ]
 parent [ 6676082 ] : 6693891 0.101199 (=372306/(2438*1509)) 91.0764
  given [ 6676082 ] : 6676082 0.162283 (=9014/(23*2415)) 87.1826
	best keyword for cluster 6676082 is PF00132 with Jaccard = 0.8619	[ 2053	24	1097829	305 ]	0.9884 0.8707
sibling [ 6676082 ] : 6677300 0.133046 (=401/(2*1507)) 87.5139
	best keyword for cluster 6677300 is PF00483 with Jaccard = 0.7386	[ 1297	58	1098455	401 ]	0.9572 0.7638

	SUGGESTING RELATEDNESS OF:   
	A> PF00132 ( PF00132   Bacterial transferase hexapeptide (three repeats) )
	B> PF00483 ( PF00483   Nucleotidyl transferase )

	Only B has a clan ( CL0110.6 ).

	the two keywords coincide on Uniref90 proteins: |PF00132| = 2358 ,	 |PF00483| = 1698 , 	|PF00132^PF00483| = 331	( 14.0% and 19.5% )

	both PF00132 and PF00483 have PDB structures
	PF00132 	   b.81.1.1  b.81.1.2  b.81.1.3  b.81.1.5  b.81.1.6
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 581 ) 6608488_PF00378_PF00725            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00378  is 6602365 with Jaccard = 0.8614	|PF00378|=2033 [ 1752	1	1098177	281 ]
 parent [ 6602365 ] : 6608488 0.346156 (=428907/(644*1924)) 66.0146
  given [ 6602365 ] : 6602365 0.413111 (=1588/(2*1922)) 62.9686
	best keyword for cluster 6602365 is PF00378 with Jaccard = 0.8614	[ 1752	1	1098177	281 ]	0.9994 0.8618
sibling [ 6602365 ] : 6593490 0.479459 (=922/(3*641)) 58.6697
	best keyword for cluster 6593490 is PF00725 with Jaccard = 0.9474	[ 576	12	1099603	20 ]	0.9796 0.9664

	SUGGESTING RELATEDNESS OF:   
	A> PF00378 ( PF00378   Enoyl-CoA hydratase/isomerase family )
	B> PF00725 ( PF00725   3-hydroxyacyl-CoA dehydrogenase, C-terminal domain )

	A and B come from a different clan ( CL0127.6 , CL0106.7 ).

	the two keywords coincide on Uniref90 proteins: |PF00378| = 2033 ,	 |PF00725| = 596 , 	|PF00378^PF00725| = 241	( 11.9% and 40.4% )

	both PF00378 and PF00725 have PDB structures
	PF00378 	   c.14.1.3
	PF00725 	   a.100.1.3

	SUPERFAM mapping significantly overlapping:
     1	PF00725 SSF48179 0.569 (average over 1877 mutual instances, PF00725 3749 appearances, SSF48179 20570 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 582 ) 6692694_PF00795_PF02540            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00795  is 6676809 with Jaccard = 0.8609	|PF00795|=1201 [ 1034	0	1099010	167 ]
 parent [ 6676809 ] : 6692694 0.10092 (=53595/(459*1157)) 90.8178
  given [ 6676809 ] : 6676809 0.1492 (=31811/(230*927)) 87.4004
	best keyword for cluster 6676809 is PF00795 with Jaccard = 0.8609	[ 1034	0	1099010	167 ]	1.0000 0.8609
sibling [ 6676809 ] : 6651913 0.228995 (=8280/(358*101)) 80.6531
	best keyword for cluster 6651913 is PF02540 with Jaccard = 0.7392	[ 326	83	1099770	32 ]	0.7971 0.9106

	SUGGESTING RELATEDNESS OF:   
	A> PF00795 ( PF00795   Carbon-nitrogen hydrolase )
	B> PF02540 ( PF02540   NAD synthase )

	Only B has a clan ( CL0039.7 ).

	the two keywords coincide on Uniref90 proteins: |PF00795| = 1201 ,	 |PF02540| = 358 , 	|PF00795^PF02540| = 150	( 12.5% and 41.9% )

	both PF00795 and PF02540 have PDB structures
	PF00795 	   d.160.1.1  d.160.1.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF00795 SSF56317 0.652 (average over 3308 mutual instances, PF00795 3415 appearances, SSF56317 3928 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 583 ) 6636556_PF00346_PF00374            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00374  is 6600211 with Jaccard = 0.8606	|PF00374|=287 [ 247	0	1099924	40 ]
 parent [ 6600211 ] : 6636556 0.271283 (=26452/(281*347)) 76.0845
  given [ 6600211 ] : 6600211 0.456989 (=255/(2*279)) 61.7438
	best keyword for cluster 6600211 is PF00374 with Jaccard = 0.8606	[ 247	0	1099924	40 ]	1.0000 0.8606
sibling [ 6600211 ] : 6612724 0.347826 (=240/(2*345)) 67.6607
	best keyword for cluster 6612724 is PF00346 with Jaccard = 0.9904	[ 309	3	1099899	0 ]	0.9904 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00374 ( PF00374   Nickel-dependent hydrogenase )
	B> PF00346 ( PF00346   Respiratory-chain NADH dehydrogenase, 49 Kd subunit )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00346| = 309 ,	 |PF00374| = 287 , 	|PF00346^PF00374| = 37	( 12.0% and 12.9% )

	only  PF00374 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 584 ) 6687705_PF06151_PF08395            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08395  is 6658598 with Jaccard = 0.8598	|PF08395|=164 [ 141	0	1100047	23 ]
 parent [ 6658598 ] : 6687705 0.131944 (=475/(25*144)) 89.7932
  given [ 6658598 ] : 6658598 0.198354 (=241/(9*135)) 82.987
	best keyword for cluster 6658598 is PF08395 with Jaccard = 0.8598	[ 141	0	1100047	23 ]	1.0000 0.8598
sibling [ 6658598 ] : 6613064 0.378788 (=25/(22*3)) 67.8268
	best keyword for cluster 6613064 is PF06151 with Jaccard = 0.8333	[ 20	4	1100187	0 ]	0.8333 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08395 ( PF08395   7tm Chemosensory receptor )
	B> PF06151 ( PF06151   Trehalose receptor )

	they come from the same clan: CL0176.5 : PF02949	PF08395	PF03268	PF06151
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF08395 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 585 ) 6737788_PF05514_PF07681            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07681  is 6729047 with Jaccard = 0.8595	|PF07681|=443 [ 422	48	1099720	21 ]
 parent [ 6729047 ] : 6737788 0.0283 (=280/(17*582)) 97.5656
  given [ 6729047 ] : 6729047 0.0440906 (=1340/(58*524)) 96.6143
	best keyword for cluster 6729047 is PF07681 with Jaccard = 0.8595	[ 422	48	1099720	21 ]	0.8979 0.9526
sibling [ 6729047 ] : 6701508 0.1 (=6/(12*5)) 92.4433
	best keyword for cluster 6701508 is PF05514 with Jaccard = 0.9231	[ 12	1	1100198	0 ]	0.9231 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07681 ( PF07681   DoxX )
	B> PF05514 ( PF05514   HR-like lesion-inducing )

	Only A has a clan ( CL0131.6 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07681 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 586 ) 6646199_PF00131_PF01439            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01439  is 6630315 with Jaccard = 0.8581	|PF01439|=144 [ 133	11	1100056	11 ]
 parent [ 6630315 ] : 6646199 0.292884 (=9261/(186*170)) 78.7596
  given [ 6630315 ] : 6630315 0.375 (=369/(6*164)) 74.6983
	best keyword for cluster 6630315 is PF01439 with Jaccard = 0.8581	[ 133	11	1100056	11 ]	0.9236 0.9236
sibling [ 6630315 ] : 6628254 0.315164 (=1353/(27*159)) 73.8056
	best keyword for cluster 6628254 is PF00131 with Jaccard = 0.7431	[ 81	23	1100102	5 ]	0.7788 0.9419

	SUGGESTING RELATEDNESS OF:   
	A> PF01439 ( PF01439   Metallothionein )
	B> PF00131 ( PF00131   Metallothionein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01439 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 587 ) 6556301_PF04877_PF07132            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04877  is 5618755 with Jaccard = 0.8571	|PF04877|=7 [ 6	0	1100204	1 ]
 parent [ 5618755 ] : 6556301 0.625 (=30/(6*8)) 43.9395
  given [ 5618755 ] : 5618755 1 (=9/(3*3)) 2.47778e-77
	best keyword for cluster 5618755 is PF04877 with Jaccard = 0.8571	[ 6	0	1100204	1 ]	1.0000 0.8571
sibling [ 5618755 ] : 6363179 1 (=7/(1*7)) 3.91429e-05
	best keyword for cluster 6363179 is PF07132 with Jaccard = 0.8750	[ 7	1	1100203	0 ]	0.8750 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04877 ( PF04877   HrpZ )
	B> PF07132 ( PF07132   Harpin protein (HrpN) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF04877 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 588 ) 6734062_PF01663_PF01676            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01663  is 6689075 with Jaccard = 0.8527	|PF01663|=306 [ 301	47	1099858	5 ]
 parent [ 6689075 ] : 6734062 0.0374571 (=6857/(439*417)) 97.1713
  given [ 6689075 ] : 6689075 0.124902 (=640/(12*427)) 90.0879
	best keyword for cluster 6689075 is PF01663 with Jaccard = 0.8527	[ 301	47	1099858	5 ]	0.8649 0.9837
sibling [ 6689075 ] : 6713780 0.0596591 (=735/(385*32)) 94.5262
	best keyword for cluster 6713780 is PF01676 with Jaccard = 0.9446	[ 341	11	1099850	9 ]	0.9688 0.9743

	SUGGESTING RELATEDNESS OF:   
	A> PF01663 ( PF01663   Type I phosphodiesterase / nucleotide pyrophosphatase )
	B> PF01676 ( PF01676   Metalloenzyme superfamily )

	they come from the same clan: CL0088.10 : PF00884	PF01663	PF08665	PF01676	PF02995	PF07394	PF00245
	the two keywords do not coincide on UniRef90 proteins

	both PF01663 and PF01676 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 589 ) 6674985_PF05646_PF07019            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05646  is 6664519 with Jaccard = 0.8519	|PF05646|=27 [ 23	0	1100184	4 ]
 parent [ 6664519 ] : 6674985 0.16129 (=110/(22*31)) 86.8959
  given [ 6664519 ] : 6664519 0.166667 (=18/(27*4)) 84.1916
	best keyword for cluster 6664519 is PF05646 with Jaccard = 0.8519	[ 23	0	1100184	4 ]	1.0000 0.8519
sibling [ 6664519 ] : 6602333 0.4 (=16/(2*20)) 62.9261
	best keyword for cluster 6602333 is PF07019 with Jaccard = 0.9444	[ 17	1	1100193	0 ]	0.9444 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05646 ( PF05646   Protein of unknown function (DUF786) )
	B> PF07019 ( PF07019   Rab5-interacting protein (Rab5ip) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05646 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 590 ) 6651854_PF00069_PF08311            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08311  is 6580339 with Jaccard = 0.8491	|PF08311|=46 [ 45	7	1100158	1 ]
 parent [ 6580339 ] : 6651854 0.225494 (=165733/(58*12672)) 80.6142
  given [ 6580339 ] : 6580339 0.5 (=56/(2*56)) 53.832
	best keyword for cluster 6580339 is PF08311 with Jaccard = 0.8491	[ 45	7	1100158	1 ]	0.8654 0.9783
sibling [ 6580339 ] : 6650440 0.237647 (=57132/(19*12653)) 80.1955
	best keyword for cluster 6650440 is PF00069 with Jaccard = 0.7752	[ 10205	1790	1087046	1170 ]	0.8508 0.8971

	SUGGESTING RELATEDNESS OF:   
	A> PF08311 ( PF08311   Mad3/BUB1 homology region 1 )
	B> PF00069 ( PF00069   Protein kinase domain )

	Only B has a clan ( CL0016.14 ).

	the two keywords coincide on Uniref90 proteins: |PF00069| = 11375 ,	 |PF08311| = 46 , 	|PF00069^PF08311| = 24	( 0.2% and 52.2% )

	only  PF08311 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00069 SSF56112 0.797 (average over 32363 mutual instances, PF00069 36405 appearances, SSF56112 66637 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 591 ) 6468685_PF02134_PF05237            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02134  is 6428878 with Jaccard = 0.8485	|PF02134|=157 [ 140	8	1100046	17 ]
 parent [ 6428878 ] : 6468685 0.970412 (=79959/(149*553)) 3.51969
  given [ 6428878 ] : 6428878 0.997629 (=4208/(38*111)) 0.254901
	best keyword for cluster 6428878 is PF02134 with Jaccard = 0.8485	[ 140	8	1100046	17 ]	0.9459 0.8917
sibling [ 6428878 ] : 6461015 0.980108 (=53608/(129*424)) 2.44493
	best keyword for cluster 6461015 is PF05237 with Jaccard = 0.6440	[ 331	178	1099697	5 ]	0.6503 0.9851

	SUGGESTING RELATEDNESS OF:   
	A> PF02134 ( PF02134   Repeat in ubiquitin-activating (UBA) protein )
	B> PF05237 ( PF05237   MoeZ/MoeB domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF02134 and PF05237 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 592 ) 6778122_PF04406_PF06882            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06882  is 6770500 with Jaccard = 0.8485	|PF06882|=64 [ 56	2	1100145	8 ]
 parent [ 6770500 ] : 6778122 0.000804934 (=65/(412*196)) 99.9224
  given [ 6770500 ] : 6770500 0.00282636 (=112/(153*259)) 99.7276
	best keyword for cluster 6770500 is PF06882 with Jaccard = 0.8485	[ 56	2	1100145	8 ]	0.9655 0.8750
sibling [ 6770500 ] : 6774757 0.0019971 (=11/(34*162)) 99.8496
	best keyword for cluster 6774757 is PF04406 with Jaccard = 0.9500	[ 57	3	1100151	0 ]	0.9500 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06882 ( PF06882   Protein of unknown function (DUF1263) )
	B> PF04406 ( PF04406   Type IIB DNA topoisomerase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06882 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 593 ) 6754793_PF04977_PF04999            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04977  is 6713845 with Jaccard = 0.8462	|PF04977|=208 [ 176	0	1100003	32 ]
 parent [ 6713845 ] : 6754793 0.0135887 (=481/(207*171)) 98.9564
  given [ 6713845 ] : 6713845 0.0691942 (=407/(34*173)) 94.5381
	best keyword for cluster 6713845 is PF04977 with Jaccard = 0.8462	[ 176	0	1100003	32 ]	1.0000 0.8462
sibling [ 6713845 ] : 6745174 0.0247302 (=110/(32*139)) 98.247
	best keyword for cluster 6745174 is PF04999 with Jaccard = 0.7558	[ 65	21	1100125	0 ]	0.7558 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04977 ( PF04977   Septum formation initiator )
	B> PF04999 ( PF04999   Cell division protein FtsL )

	they come from the same clan: CL0225.3 : PF04977	PF04999

	the two keywords coincide on Uniref90 proteins: |PF04977| = 208 ,	 |PF04999| = 65 , 	|PF04977^PF04999| = 1	( 0.5% and 1.5% )

	Neither  PF04977 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 594 ) 6643783_PF00060_PF00497            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00497  is 6639432 with Jaccard = 0.8442	|PF00497|=995 [ 840	0	1099216	155 ]
 parent [ 6639432 ] : 6643783 0.252819 (=79745/(303*1041)) 78.0743
  given [ 6639432 ] : 6639432 0.245878 (=2535/(10*1031)) 76.851
	best keyword for cluster 6639432 is PF00497 with Jaccard = 0.8442	[ 840	0	1099216	155 ]	1.0000 0.8442
sibling [ 6639432 ] : 6629027 0.273288 (=487/(6*297)) 74.0437
	best keyword for cluster 6629027 is PF00060 with Jaccard = 0.8930	[ 242	29	1099940	0 ]	0.8930 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00497 ( PF00497   Bacterial extracellular solute-binding proteins, family 3 )
	B> PF00060 ( PF00060   Ligand-gated ion channel )

	Only A has a clan ( CL0177.7 ).

	the two keywords coincide on Uniref90 proteins: |PF00060| = 242 ,	 |PF00497| = 995 , 	|PF00060^PF00497| = 5	( 2.1% and 0.5% )

	both PF00497 and PF00060 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 595 ) 6646403_PF02117_PF02175            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02117  is 6561562 with Jaccard = 0.8438	|PF02117|=64 [ 54	0	1100147	10 ]
 parent [ 6561562 ] : 6646403 0.229237 (=817/(54*66)) 78.9067
  given [ 6561562 ] : 6561562 0.529412 (=81/(3*51)) 48.0511
	best keyword for cluster 6561562 is PF02117 with Jaccard = 0.8438	[ 54	0	1100147	10 ]	1.0000 0.8438
sibling [ 6561562 ] : 6626574 0.302419 (=75/(4*62)) 73.1491
	best keyword for cluster 6626574 is PF02175 with Jaccard = 0.7021	[ 33	12	1100164	2 ]	0.7333 0.9429

	SUGGESTING RELATEDNESS OF:   
	A> PF02117 ( PF02117   C.elegans Sra family integral membrane protein )
	B> PF02175 ( PF02175   C.elegans integral membrane protein Srb )

	they come from the same clan: CL0138.6 : PF02117	PF02175	PF03125
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF02117 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 596 ) 6753694_PF05154_PF07754            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05154  is 6725797 with Jaccard = 0.8417	|PF05154|=139 [ 117	0	1100072	22 ]
 parent [ 6725797 ] : 6753694 0.0157154 (=1347/(176*487)) 98.8841
  given [ 6725797 ] : 6725797 0.0416667 (=82/(164*12)) 96.208
	best keyword for cluster 6725797 is PF05154 with Jaccard = 0.8417	[ 117	0	1100072	22 ]	1.0000 0.8417
sibling [ 6725797 ] : 6750033 0.0229825 (=131/(12*475)) 98.6209
	best keyword for cluster 6750033 is PF07754 with Jaccard = 0.6667	[ 18	8	1100184	1 ]	0.6923 0.9474

	SUGGESTING RELATEDNESS OF:   
	A> PF05154 ( PF05154   TM2 domain )
	B> PF07754 ( PF07754   Domain of unknown function (DUF1610) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05154 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 597 ) 6613166_PF00441_PF08028            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02770  is 6605561 with Jaccard = 0.8409	|PF02770|=1925 [ 1813	231	1098055	112 ]
 parent [ 6605561 ] : 6613166 0.341475 (=217145/(281*2263)) 67.8683
  given [ 6605561 ] : 6605561 0.382416 (=125697/(2107*156)) 64.4188
	best keyword for cluster 6605561 is PF00441 with Jaccard = 0.9296	[ 1927	117	1098138	29 ]	0.9428 0.9852
sibling [ 6605561 ] : 6555437 0.599206 (=4379/(252*29)) 43.0147
	best keyword for cluster 6555437 is PF08028 with Jaccard = 0.8674	[ 229	23	1099947	12 ]	0.9087 0.9502

	SUGGESTING RELATEDNESS OF:   
	A> PF00441 ( PF00441   Acyl-CoA dehydrogenase, C-terminal domain )
	B> PF08028 ( PF08028   Acyl-CoA dehydrogenase, C-terminal domain )

	they come from the same clan: CL0087.7 : PF01756	PF00441	PF08028
	the two keywords do not coincide on UniRef90 proteins

	only  PF00441 has a PDB structure (may not be up to date)
	PF00441 	   a.29.3.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF08028 SSF47203 0.844 (average over 653 mutual instances, PF08028 1323 appearances, SSF47203 17996 appearances)
     2	PF00441 SSF47203 0.910 (average over 6570 mutual instances, PF00441 13147 appearances, SSF47203 17996 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 598 ) 6516641_PF00759_PF02927            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02927  is 6455799 with Jaccard = 0.8409	|PF02927|=39 [ 37	5	1100167	2 ]
 parent [ 6455799 ] : 6516641 0.84375 (=7425/(44*200)) 18.4615
  given [ 6455799 ] : 6455799 0.983333 (=472/(20*24)) 1.8588
	best keyword for cluster 6455799 is PF02927 with Jaccard = 0.8409	[ 37	5	1100167	2 ]	0.8810 0.9487
sibling [ 6455799 ] : 6514071 0.831633 (=652/(4*196)) 17.0268
	best keyword for cluster 6514071 is PF00759 with Jaccard = 0.7471	[ 195	0	1099950	66 ]	1.0000 0.7471

	SUGGESTING RELATEDNESS OF:   
	A> PF02927 ( PF02927   N-terminal ig-like domain of cellulase )
	B> PF00759 ( PF00759   Glycosyl hydrolase family 9 )

	Only B has a clan ( CL0059.10 ).

	the two keywords coincide on Uniref90 proteins: |PF00759| = 261 ,	 |PF02927| = 39 , 	|PF00759^PF02927| = 39	( 14.9% and 100.0% )

	both PF02927 and PF00759 have PDB structures
	PF02927 	   b.1.18.2  b.18.1.24
	PF00759 	   a.102.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF00759 SSF48208 0.965 (average over 621 mutual instances, PF00759 956 appearances, SSF48208 6032 appearances)
     2	PF02927 SSF81296 0.845 (average over 77 mutual instances, PF02927 221 appearances, SSF81296 30857 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 599 ) 6709420_PF00266_PF01053            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00266  is 6704692 with Jaccard = 0.8407	|PF00266|=1537 [ 1304	14	1098660	233 ]
 parent [ 6704692 ] : 6709420 0.0748695 (=117157/(1046*1496)) 93.8493
  given [ 6704692 ] : 6704692 0.0738941 (=441/(4*1492)) 93.014
	best keyword for cluster 6704692 is PF00266 with Jaccard = 0.8407	[ 1304	14	1098660	233 ]	0.9894 0.8484
sibling [ 6704692 ] : 6687938 0.123536 (=11055/(94*952)) 89.8447
	best keyword for cluster 6687938 is PF01053 with Jaccard = 0.8562	[ 810	131	1099265	5 ]	0.8608 0.9939

	SUGGESTING RELATEDNESS OF:   
	A> PF00266 ( PF00266   Aminotransferase class-V )
	B> PF01053 ( PF01053   Cys/Met metabolism PLP-dependent enzyme )

	they come from the same clan: CL0061.8 : PF05889	PF00464	PF03841	PF00282	PF01276	PF02347	PF01041	PF01053	PF01212	PF00266	PF00202	PF00155	PF06838	PF04864

	the two keywords coincide on Uniref90 proteins: |PF00266| = 1537 ,	 |PF01053| = 815 , 	|PF00266^PF01053| = 1	( 0.1% and 0.1% )

	both PF00266 and PF01053 have PDB structures
	PF00266 	   c.67.1.3  c.67.1.4
	PF01053 	   c.67.1.3

	SUPERFAM mapping significantly overlapping:
     1	PF00266 SSF53383 0.863 (average over 4864 mutual instances, PF00266 4914 appearances, SSF53383 34644 appearances)
     2	PF01053 SSF53383 0.965 (average over 2570 mutual instances, PF01053 2583 appearances, SSF53383 34644 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 600 ) 6676383_PF00969_PF07654            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00969  is 6668343 with Jaccard = 0.8404	|PF00969|=1642 [ 1380	0	1098569	262 ]
 parent [ 6668343 ] : 6676383 0.1361 (=403911/(1433*2071)) 87.2854
  given [ 6668343 ] : 6668343 0.161995 (=1387/(6*1427)) 85.0401
	best keyword for cluster 6668343 is PF00969 with Jaccard = 0.8404	[ 1380	0	1098569	262 ]	1.0000 0.8404
sibling [ 6668343 ] : 6672045 0.150629 (=1556/(5*2066)) 86.0136
	best keyword for cluster 6672045 is PF07654 with Jaccard = 0.6767	[ 1411	592	1098126	82 ]	0.7044 0.9451

	SUGGESTING RELATEDNESS OF:   
	A> PF00969 ( PF00969   Class II histocompatibility antigen, beta domain )
	B> PF07654 ( PF07654   Immunoglobulin C1-set domain )

	Only B has a clan ( CL0011.18 ).

	the two keywords coincide on Uniref90 proteins: |PF00969| = 1642 ,	 |PF07654| = 1493 , 	|PF00969^PF07654| = 281	( 17.1% and 18.8% )

	both PF00969 and PF07654 have PDB structures
		PF07654 	   b.1.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF00969 SSF54452 0.877 (average over 8969 mutual instances, PF00969 8969 appearances, SSF54452 25772 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 601 ) 6635446_PF03534_PF05593            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03534  is 6514144 with Jaccard = 0.8400	|PF03534|=23 [ 21	2	1100186	2 ]
 parent [ 6514144 ] : 6635446 0.253386 (=3180/(25*502)) 75.8558
  given [ 6514144 ] : 6514144 0.833333 (=20/(1*24)) 17.076
	best keyword for cluster 6514144 is PF03534 with Jaccard = 0.8400	[ 21	2	1100186	2 ]	0.9130 0.9130
sibling [ 6514144 ] : 6633457 0.287075 (=1688/(12*490)) 75.4311
	best keyword for cluster 6633457 is PF05593 with Jaccard = 0.9171	[ 321	11	1099861	18 ]	0.9669 0.9469

	SUGGESTING RELATEDNESS OF:   
	A> PF03534 ( PF03534   Salmonella virulence plasmid 65kDa B protein )
	B> PF05593 ( PF05593   RHS Repeat )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03534| = 23 ,	 |PF05593| = 339 , 	|PF03534^PF05593| = 4	( 17.4% and 1.2% )

	Neither  PF03534 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 602 ) 6703032_PF00332_PF03198            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00332  is 6657782 with Jaccard = 0.8398	|PF00332|=313 [ 304	49	1099849	9 ]
 parent [ 6657782 ] : 6703032 0.0838167 (=3150/(437*86)) 92.7213
  given [ 6657782 ] : 6657782 0.209872 (=7220/(334*103)) 82.6347
	best keyword for cluster 6657782 is PF00332 with Jaccard = 0.8398	[ 304	49	1099849	9 ]	0.8612 0.9712
sibling [ 6657782 ] : 6676339 0.136546 (=34/(83*3)) 87.2635
	best keyword for cluster 6676339 is PF03198 with Jaccard = 0.9878	[ 81	1	1100129	0 ]	0.9878 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00332 ( PF00332   Glycosyl hydrolases family 17 )
	B> PF03198 ( PF03198   Glycolipid anchored surface protein (GAS1) )

	they come from the same clan: CL0058.10 : PF07971	PF02446	PF03198	PF02324	PF02057	PF01630	PF07745	PF02449	PF01229	PF01301	PF01055	PF02055	PF00933	PF02836	PF02156	PF01183	PF00728	PF00704	PF00332	PF01373	PF00331	PF00232	PF02638	PF00150	PF00128	PF02065
	the two keywords do not coincide on UniRef90 proteins

	only  PF00332 has a PDB structure (may not be up to date)
	PF00332 	   c.1.8.3
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 603 ) 6670332_PF00534_PF08323            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08323  is 6656397 with Jaccard = 0.8397	|PF08323|=281 [ 262	31	1099899	19 ]
 parent [ 6656397 ] : 6670332 0.167036 (=198174/(336*3531)) 85.6249
  given [ 6656397 ] : 6656397 0.22006 (=147/(2*334)) 82.1344
	best keyword for cluster 6656397 is PF08323 with Jaccard = 0.8397	[ 262	31	1099899	19 ]	0.8942 0.9324
sibling [ 6656397 ] : 6667843 0.17602 (=1863/(3*3528)) 84.954
	best keyword for cluster 6667843 is PF00534 with Jaccard = 0.8054	[ 3112	7	1096347	745 ]	0.9978 0.8068

	SUGGESTING RELATEDNESS OF:   
	A> PF08323 ( PF08323   Starch synthase catalytic domain )
	B> PF00534 ( PF00534   Glycosyl transferases group 1 )

	Only B has a clan ( CL0113.8 ).

	the two keywords coincide on Uniref90 proteins: |PF00534| = 3857 ,	 |PF08323| = 281 , 	|PF00534^PF08323| = 237	( 6.1% and 84.3% )

	both PF08323 and PF00534 have PDB structures
	PF08323 	   c.87.1.8
	PF00534 	   c.87.1.8

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 604 ) 6707025_PF00102_PF00782            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00782  is 6702473 with Jaccard = 0.8354	|PF00782|=648 [ 614	87	1099476	34 ]
 parent [ 6702473 ] : 6707025 0.0898443 (=53604/(697*856)) 93.4771
  given [ 6702473 ] : 6702473 0.110245 (=2644/(29*827)) 92.6183
	best keyword for cluster 6702473 is PF00782 with Jaccard = 0.8354	[ 614	87	1099476	34 ]	0.8759 0.9475
sibling [ 6702473 ] : 6678150 0.147908 (=410/(4*693)) 87.7265
	best keyword for cluster 6678150 is PF00102 with Jaccard = 0.8047	[ 618	49	1099443	101 ]	0.9265 0.8595

	SUGGESTING RELATEDNESS OF:   
	A> PF00782 ( PF00782   Dual specificity phosphatase, catalytic domain )
	B> PF00102 ( PF00102   Protein-tyrosine phosphatase )

	they come from the same clan: CL0031.8 : PF00102	PF04273	PF00782	PF05706	PF03162
	the two keywords do not coincide on UniRef90 proteins

	both PF00782 and PF00102 have PDB structures
	PF00782 	   c.45.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 605 ) 6551245_PF00971_PF01045            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01045  is 6325717 with Jaccard = 0.8333	|PF01045|=6 [ 5	0	1100205	1 ]
 parent [ 6325717 ] : 6551245 0.6 (=69/(23*5)) 40
  given [ 6325717 ] : 6325717 1 (=4/(1*4)) 1e-07
	best keyword for cluster 6325717 is PF01045 with Jaccard = 0.8333	[ 5	0	1100205	1 ]	1.0000 0.8333
sibling [ 6325717 ] : 6275750 1 (=76/(19*4)) 2.17657e-11
	best keyword for cluster 6275750 is PF00971 with Jaccard = 0.8519	[ 23	0	1100184	4 ]	1.0000 0.8519

	SUGGESTING RELATEDNESS OF:   
	A> PF01045 ( PF01045   EIAV glycoprotein, gp45 )
	B> PF00971 ( PF00971   EIAV coat protein, gp90 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00971| = 27 ,	 |PF01045| = 6 , 	|PF00971^PF01045| = 4	( 14.8% and 66.7% )

	Neither  PF01045 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 606 ) 6475606_PF01509_PF08068            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08068  is 6457596 with Jaccard = 0.8333	|PF08068|=65 [ 60	7	1100139	5 ]
 parent [ 6457596 ] : 6475606 0.959075 (=19826/(304*68)) 4.78632
  given [ 6457596 ] : 6457596 0.980114 (=1035/(44*24)) 2.03885
	best keyword for cluster 6457596 is PF08068 with Jaccard = 0.8333	[ 60	7	1100139	5 ]	0.8955 0.9231
sibling [ 6457596 ] : 6413094 0.999554 (=6717/(24*280)) 0.046832
	best keyword for cluster 6413094 is PF01509 with Jaccard = 0.8023	[ 280	0	1099862	69 ]	1.0000 0.8023

	SUGGESTING RELATEDNESS OF:   
	A> PF08068 ( PF08068   DKCLD (NUC011) domain )
	B> PF01509 ( PF01509   TruB family pseudouridylate synthase (N terminal domain) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01509| = 349 ,	 |PF08068| = 65 , 	|PF01509^PF08068| = 63	( 18.1% and 96.9% )

	both PF08068 and PF01509 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 607 ) 6561138_PF01037_PF08394            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08394  is 6336483 with Jaccard = 0.8333	|PF08394|=12 [ 10	0	1100199	2 ]
 parent [ 6336483 ] : 6561138 0.554545 (=4758/(10*858)) 47.8405
  given [ 6336483 ] : 6336483 1 (=21/(3*7)) 5.85996e-07
	best keyword for cluster 6336483 is PF08394 with Jaccard = 0.8333	[ 10	0	1100199	2 ]	1.0000 0.8333
sibling [ 6336483 ] : 6558108 0.602662 (=5615/(11*847)) 45.1333
	best keyword for cluster 6558108 is PF01037 with Jaccard = 0.8715	[ 739	18	1099363	91 ]	0.9762 0.8904

	SUGGESTING RELATEDNESS OF:   
	A> PF08394 ( PF08394   Archaeal TRASH domain )
	B> PF01037 ( PF01037   AsnC family )

	A and B come from a different clan ( CL0175.5 , CL0032.9 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF08394 has a PDB structure (may not be up to date)
		PF01037 	   d.58.4.2

	SUPERFAM mapping significantly overlapping:
     1	PF01037 SSF54909 0.871 (average over 3219 mutual instances, PF01037 3221 appearances, SSF54909 7040 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 608 ) 6667472_PF00102_PF02206            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00102  is 6659227 with Jaccard = 0.8331	|PF00102|=719 [ 599	0	1099492	120 ]
 parent [ 6659227 ] : 6667472 0.155901 (=7226/(75*618)) 84.8418
  given [ 6659227 ] : 6659227 0.206026 (=506/(4*614)) 83.1854
	best keyword for cluster 6659227 is PF00102 with Jaccard = 0.8331	[ 599	0	1099492	120 ]	1.0000 0.8331
sibling [ 6659227 ] : 6659164 0.189815 (=41/(3*72)) 83.1415
	best keyword for cluster 6659164 is PF02206 with Jaccard = 0.7808	[ 57	9	1100138	7 ]	0.8636 0.8906

	SUGGESTING RELATEDNESS OF:   
	A> PF00102 ( PF00102   Protein-tyrosine phosphatase )
	B> PF02206 ( PF02206   Domain of unknown function )

	Only A has a clan ( CL0031.8 ).

	the two keywords coincide on Uniref90 proteins: |PF00102| = 719 ,	 |PF02206| = 64 , 	|PF00102^PF02206| = 16	( 2.2% and 25.0% )

	only  PF00102 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 609 ) 6753502_PF00339_PF03643            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00339  is 6745088 with Jaccard = 0.8321	|PF00339|=252 [ 228	22	1099937	24 ]
 parent [ 6745088 ] : 6753502 0.0148073 (=395/(78*342)) 98.871
  given [ 6745088 ] : 6745088 0.021021 (=63/(9*333)) 98.2416
	best keyword for cluster 6745088 is PF00339 with Jaccard = 0.8321	[ 228	22	1099937	24 ]	0.9120 0.9048
sibling [ 6745088 ] : 6744992 0.0519481 (=4/(1*77)) 98.2338
	best keyword for cluster 6744992 is PF03643 with Jaccard = 0.9853	[ 67	0	1100143	1 ]	1.0000 0.9853

	SUGGESTING RELATEDNESS OF:   
	A> PF00339 ( PF00339   Arrestin (or S-antigen), N-terminal domain )
	B> PF03643 ( PF03643   Vacuolar protein sorting-associated protein 26 )

	they come from the same clan: CL0135.6 : PF00339	PF07070	PF03643

	the two keywords coincide on Uniref90 proteins: |PF00339| = 252 ,	 |PF03643| = 68 , 	|PF00339^PF03643| = 2	( 0.8% and 2.9% )

	only  PF00339 has a PDB structure (may not be up to date)
	PF00339 	   b.1.18.11
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 610 ) 6733269_PF00264_PF03723            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03722  is 6702906 with Jaccard = 0.8301	|PF03722|=171 [ 171	35	1100005	0 ]
 parent [ 6702906 ] : 6733269 0.0433528 (=3034/(216*324)) 97.0829
  given [ 6702906 ] : 6702906 0.0744186 (=16/(1*215)) 92.6929
	best keyword for cluster 6702906 is PF03723 with Jaccard = 0.9130	[ 189	17	1100004	1 ]	0.9175 0.9947
sibling [ 6702906 ] : 6732209 0.0309598 (=10/(1*323)) 96.9679
	best keyword for cluster 6732209 is PF00264 with Jaccard = 0.9864	[ 290	0	1099917	4 ]	1.0000 0.9864

	SUGGESTING RELATEDNESS OF:   
	A> PF03723 ( PF03723   Hemocyanin, ig-like domain )
	B> PF00264 ( PF00264   Common central domain of tyrosinase )

	Only B has a clan ( CL0205.5 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF03723 and PF00264 have PDB structures
	PF03723 	   b.1.18.3
	
	SUPERFAM mapping significantly overlapping:
     1	PF00264 SSF48056 0.819 (average over 1601 mutual instances, PF00264 1604 appearances, SSF48056 2598 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 611 ) 6592227_PF00519_PF01057            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01057  is 6531307 with Jaccard = 0.8293	|PF01057|=41 [ 34	0	1100170	7 ]
 parent [ 6531307 ] : 6592227 0.483186 (=2845/(46*128)) 58.0319
  given [ 6531307 ] : 6531307 0.752688 (=350/(31*15)) 26.5954
	best keyword for cluster 6531307 is PF01057 with Jaccard = 0.8293	[ 34	0	1100170	7 ]	1.0000 0.8293
sibling [ 6531307 ] : 6539179 0.694444 (=175/(2*126)) 31.8946
	best keyword for cluster 6539179 is PF00519 with Jaccard = 0.9920	[ 124	1	1100086	0 ]	0.9920 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01057 ( PF01057   Parvovirus non-structural protein NS1 )
	B> PF00519 ( PF00519   Papillomavirus helicase )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01057 and PF00519 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 612 ) 6686807_PF03706_PF04329            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03706  is 6674998 with Jaccard = 0.8273	|PF03706|=138 [ 115	1	1100072	23 ]
 parent [ 6674998 ] : 6686807 0.124997 (=4922/(169*233)) 89.6224
  given [ 6674998 ] : 6674998 0.157307 (=958/(30*203)) 86.9023
	best keyword for cluster 6674998 is PF03706 with Jaccard = 0.8273	[ 115	1	1100072	23 ]	0.9914 0.8333
sibling [ 6674998 ] : 6653191 0.228819 (=740/(22*147)) 81.0397
	best keyword for cluster 6653191 is PF04329 with Jaccard = 0.8280	[ 77	6	1100118	10 ]	0.9277 0.8851

	SUGGESTING RELATEDNESS OF:   
	A> PF03706 ( PF03706   Uncharacterised protein family (UPF0104) )
	B> PF04329 ( PF04329   Family of unknown function (DUF470) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF03706| = 138 ,	 |PF04329| = 87 , 	|PF03706^PF04329| = 13	( 9.4% and 14.9% )

	Neither  PF03706 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 613 ) 6692803_PF03205_PF07683            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03205  is 6524027 with Jaccard = 0.8261	|PF03205|=138 [ 114	0	1100073	24 ]
 parent [ 6524027 ] : 6692803 0.128002 (=7504/(128*458)) 90.8378
  given [ 6524027 ] : 6524027 0.797333 (=299/(3*125)) 22.2543
	best keyword for cluster 6524027 is PF03205 with Jaccard = 0.8261	[ 114	0	1100073	24 ]	1.0000 0.8261
sibling [ 6524027 ] : 6690468 0.10022 (=182/(454*4)) 90.356
	best keyword for cluster 6690468 is PF07683 with Jaccard = 0.8228	[ 339	70	1099799	3 ]	0.8289 0.9912

	SUGGESTING RELATEDNESS OF:   
	A> PF03205 ( PF03205   Molybdopterin guanine dinucleotide synthesis protein B )
	B> PF07683 ( PF07683   Cobalamin synthesis protein cobW C-terminal domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF03205 and PF07683 have PDB structures
		PF07683 	   d.237.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF07683 SSF90002 0.980 (average over 1038 mutual instances, PF07683 1046 appearances, SSF90002 2049 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 614 ) 6701651_PF00155_PF00392            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00155  is 6694887 with Jaccard = 0.8258	|PF00155|=3377 [ 2816	33	1096801	561 ]
 parent [ 6694887 ] : 6701651 0.0820302 (=567380/(2193*3154)) 92.4733
  given [ 6694887 ] : 6694887 0.101111 (=1592/(5*3149)) 91.2929
	best keyword for cluster 6694887 is PF00155 with Jaccard = 0.8258	[ 2816	33	1096801	561 ]	0.9884 0.8339
sibling [ 6694887 ] : 6658415 0.186615 (=1634/(4*2189)) 82.8631
	best keyword for cluster 6658415 is PF00392 with Jaccard = 0.7866	[ 1950	14	1097732	515 ]	0.9929 0.7911

	SUGGESTING RELATEDNESS OF:   
	A> PF00155 ( PF00155   Aminotransferase class I and II )
	B> PF00392 ( PF00392   Bacterial regulatory proteins, gntR family )

	A and B come from a different clan ( CL0061.8 , CL0123.12 ).

	the two keywords coincide on Uniref90 proteins: |PF00155| = 3377 ,	 |PF00392| = 2465 , 	|PF00155^PF00392| = 432	( 12.8% and 17.5% )

	both PF00155 and PF00392 have PDB structures
	PF00155 	   c.67.1.1  c.67.1.3  c.67.1.4
	PF00392 	   a.4.5.6

	SUPERFAM mapping significantly overlapping:
     1	PF00155 SSF53383 0.849 (average over 10819 mutual instances, PF00155 10880 appearances, SSF53383 34644 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 615 ) 6723269_PF06283_PF06439            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06439  is 6628229 with Jaccard = 0.8246	|PF06439|=57 [ 47	0	1100154	10 ]
 parent [ 6628229 ] : 6723269 0.0421941 (=180/(54*79)) 95.8968
  given [ 6628229 ] : 6628229 0.326531 (=80/(5*49)) 73.7849
	best keyword for cluster 6628229 is PF06439 with Jaccard = 0.8246	[ 47	0	1100154	10 ]	1.0000 0.8246
sibling [ 6628229 ] : 6686033 0.135093 (=174/(23*56)) 89.4827
	best keyword for cluster 6686033 is PF06283 with Jaccard = 0.7778	[ 21	6	1100184	0 ]	0.7778 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06439 ( PF06439   Domain of Unknown Function (DUF1080) )
	B> PF06283 ( PF06283   Protein of unknown function (DUF1037) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF06439 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 616 ) 6589675_PF00063_PF00784            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00063  is 6580724 with Jaccard = 0.8237	|PF00063|=638 [ 528	3	1099570	110 ]
 parent [ 6580724 ] : 6589675 0.43471 (=12787/(53*555)) 57.1727
  given [ 6580724 ] : 6580724 0.471636 (=1297/(5*550)) 53.9676
	best keyword for cluster 6580724 is PF00063 with Jaccard = 0.8237	[ 528	3	1099570	110 ]	0.9944 0.8276
sibling [ 6580724 ] : 6552525 0.692308 (=36/(1*52)) 40.8609
	best keyword for cluster 6552525 is PF00784 with Jaccard = 0.6234	[ 48	4	1100134	25 ]	0.9231 0.6575

	SUGGESTING RELATEDNESS OF:   
	A> PF00063 ( PF00063   Myosin head (motor domain) )
	B> PF00784 ( PF00784   MyTH4 domain )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00063| = 638 ,	 |PF00784| = 73 , 	|PF00063^PF00784| = 34	( 5.3% and 46.6% )

	only  PF00063 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 617 ) 6716126_PF00076_PF01805            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00076  is 6714471 with Jaccard = 0.8228	|PF00076|=4044 [ 3441	138	1096029	603 ]
 parent [ 6714471 ] : 6716126 0.0578344 (=29681/(127*4041)) 94.9031
  given [ 6714471 ] : 6714471 0.0651384 (=1577/(6*4035)) 94.6378
	best keyword for cluster 6714471 is PF00076 with Jaccard = 0.8228	[ 3441	138	1096029	603 ]	0.9614 0.8509
sibling [ 6714471 ] : 6711407 0.0606557 (=37/(5*122)) 94.1557
	best keyword for cluster 6711407 is PF01805 with Jaccard = 0.7661	[ 95	2	1100087	27 ]	0.9794 0.7787

	SUGGESTING RELATEDNESS OF:   
	A> PF00076 ( PF00076   RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) )
	B> PF01805 ( PF01805   Surp module )

	Only A has a clan ( CL0221.5 ).

	the two keywords coincide on Uniref90 proteins: |PF00076| = 4044 ,	 |PF01805| = 122 , 	|PF00076^PF01805| = 18	( 0.4% and 14.8% )

	both PF00076 and PF01805 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 618 ) 6722123_PF00096_PF06524            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00096  is 6720078 with Jaccard = 0.8226	|PF00096|=4886 [ 4234	261	1095064	652 ]
 parent [ 6720078 ] : 6722123 0.0569141 (=6368/(21*5328)) 95.7328
  given [ 6720078 ] : 6720078 0.052689 (=2802/(10*5318)) 95.4265
	best keyword for cluster 6720078 is PF00096 with Jaccard = 0.8226	[ 4234	261	1095064	652 ]	0.9419 0.8666
sibling [ 6720078 ] : 6690125 0.122222 (=11/(6*15)) 90.2811
	best keyword for cluster 6690125 is PF06524 with Jaccard = 0.7273	[ 8	3	1100200	0 ]	0.7273 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00096 ( PF00096   Zinc finger, C2H2 type )
	B> PF06524 ( PF06524   NOA36 protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00096 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 619 ) 6756152_PF01167_PF03478            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03478  is 6737410 with Jaccard = 0.8221	|PF03478|=212 [ 208	41	1099958	4 ]
 parent [ 6737410 ] : 6756152 0.0141183 (=377/(69*387)) 99.0409
  given [ 6737410 ] : 6737410 0.0287947 (=140/(13*374)) 97.5271
	best keyword for cluster 6737410 is PF03478 with Jaccard = 0.8221	[ 208	41	1099958	4 ]	0.8353 0.9811
sibling [ 6737410 ] : 6666452 0.157692 (=41/(65*4)) 84.5977
	best keyword for cluster 6666452 is PF01167 with Jaccard = 0.8592	[ 61	0	1100140	10 ]	1.0000 0.8592

	SUGGESTING RELATEDNESS OF:   
	A> PF03478 ( PF03478   Protein of unknown function (DUF295) )
	B> PF01167 ( PF01167   Tub family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF03478 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01167 SSF54518 0.784 (average over 177 mutual instances, PF01167 199 appearances, SSF54518 251 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 620 ) 6700926_PF05170_PF05359            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05170  is 6698578 with Jaccard = 0.8210	|PF05170|=162 [ 133	0	1100049	29 ]
 parent [ 6698578 ] : 6700926 0.0945652 (=1218/(70*184)) 92.3402
  given [ 6698578 ] : 6698578 0.0917603 (=98/(6*178)) 91.9775
	best keyword for cluster 6698578 is PF05170 with Jaccard = 0.8210	[ 133	0	1100049	29 ]	1.0000 0.8210
sibling [ 6698578 ] : 6587305 0.477823 (=237/(8*62)) 56.2
	best keyword for cluster 6587305 is PF05359 with Jaccard = 0.9800	[ 49	1	1100161	0 ]	0.9800 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05170 ( PF05170   AsmA family )
	B> PF05359 ( PF05359   Domain of Unknown Function (DUF748) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05170 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 621 ) 6720440_PF00014_PF00095            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00014  is 6716513 with Jaccard = 0.8202	|PF00014|=425 [ 365	20	1099766	60 ]
 parent [ 6716513 ] : 6720440 0.0496207 (=3087/(151*412)) 95.4803
  given [ 6716513 ] : 6716513 0.0722359 (=147/(5*407)) 94.971
	best keyword for cluster 6716513 is PF00014 with Jaccard = 0.8202	[ 365	20	1099766	60 ]	0.9481 0.8588
sibling [ 6716513 ] : 6691549 0.12415 (=73/(147*4)) 90.5824
	best keyword for cluster 6691549 is PF00095 with Jaccard = 0.6913	[ 103	12	1100062	34 ]	0.8957 0.7518

	SUGGESTING RELATEDNESS OF:   
	A> PF00014 ( PF00014   Kunitz/Bovine pancreatic trypsin inhibitor domain )
	B> PF00095 ( PF00095   WAP-type (Whey Acidic Protein) 'four-disulfide core' )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00014| = 426 ,	 |PF00095| = 137 , 	|PF00014^PF00095| = 31	( 7.3% and 22.6% )

	both PF00014 and PF00095 have PDB structures
	PF00014 	   g.8.1.1  g.8.1.2  k.35.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00095 SSF57256 0.887 (average over 257 mutual instances, PF00095 361 appearances, SSF57256 386 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 622 ) 6731550_PF00021_PF00087            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00021  is 6727290 with Jaccard = 0.8182	|PF00021|=133 [ 126	21	1100057	7 ]
 parent [ 6727290 ] : 6731550 0.0447345 (=2005/(180*249)) 96.8914
  given [ 6727290 ] : 6727290 0.050223 (=563/(59*190)) 96.3975
	best keyword for cluster 6727290 is PF00021 with Jaccard = 0.8182	[ 126	21	1100057	7 ]	0.8571 0.9474
sibling [ 6727290 ] : 6726427 0.0446927 (=8/(1*179)) 96.2886
	best keyword for cluster 6726427 is PF00087 with Jaccard = 0.9819	[ 163	2	1100045	1 ]	0.9879 0.9939

	SUGGESTING RELATEDNESS OF:   
	A> PF00021 ( PF00021   u-PAR/Ly-6 domain )
	B> PF00087 ( PF00087   Snake toxin )

	they come from the same clan: CL0117.6 : PF01064	PF06211	PF02988	PF00087	PF00021
	the two keywords do not coincide on UniRef90 proteins

	both PF00021 and PF00087 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 623 ) 6446230_PF00912_PF06832            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06832  is 6335824 with Jaccard = 0.8182	|PF06832|=54 [ 54	12	1100145	0 ]
 parent [ 6335824 ] : 6446230 0.990878 (=46708/(74*637)) 1.01074
  given [ 6335824 ] : 6335824 1 (=73/(1*73)) 5.08114e-07
	best keyword for cluster 6335824 is PF06832 with Jaccard = 0.8182	[ 54	12	1100145	0 ]	0.8182 1.0000
sibling [ 6335824 ] : 6430004 0.997592 (=14088/(23*614)) 0.282846
	best keyword for cluster 6430004 is PF00912 with Jaccard = 0.8807	[ 576	0	1099557	78 ]	1.0000 0.8807

	SUGGESTING RELATEDNESS OF:   
	A> PF06832 ( PF06832   Penicillin-Binding Protein C-terminus Family )
	B> PF00912 ( PF00912   Transglycosylase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00912| = 654 ,	 |PF06832| = 54 , 	|PF00912^PF06832| = 53	( 8.1% and 98.1% )

	only  PF06832 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 624 ) 6756416_PF00789_PF03556            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00789  is 6733568 with Jaccard = 0.8163	|PF00789|=247 [ 231	36	1099928	16 ]
 parent [ 6733568 ] : 6756416 0.0134178 (=307/(65*352)) 99.056
  given [ 6733568 ] : 6733568 0.0374626 (=763/(73*279)) 97.1196
	best keyword for cluster 6733568 is PF00789 with Jaccard = 0.8163	[ 231	36	1099928	16 ]	0.8652 0.9352
sibling [ 6733568 ] : 6554719 0.578125 (=37/(1*64)) 42.5896
	best keyword for cluster 6554719 is PF03556 with Jaccard = 0.9831	[ 58	0	1100152	1 ]	1.0000 0.9831

	SUGGESTING RELATEDNESS OF:   
	A> PF00789 ( PF00789   UBX domain )
	B> PF03556 ( PF03556   Domain of unknown function (DUF298) )

	Only A has a clan ( CL0072.14 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF00789 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 625 ) 6585690_PF00289_PF02844            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00289  is 6562669 with Jaccard = 0.8162	|PF00289|=1011 [ 857	39	1099161	154 ]
 parent [ 6562669 ] : 6585690 0.489895 (=135458/(281*984)) 55.5826
  given [ 6562669 ] : 6562669 0.510682 (=502/(1*983)) 49.0745
	best keyword for cluster 6562669 is PF00289 with Jaccard = 0.8162	[ 857	39	1099161	154 ]	0.9565 0.8477
sibling [ 6562669 ] : 6464215 0.97491 (=544/(2*279)) 2.89589
	best keyword for cluster 6464215 is PF02844 with Jaccard = 0.8194	[ 245	7	1099912	47 ]	0.9722 0.8390

	SUGGESTING RELATEDNESS OF:   
	A> PF00289 ( PF00289   Carbamoyl-phosphate synthase L chain, N-terminal domain )
	B> PF02844 ( PF02844   Phosphoribosylglycinamide synthetase, N domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF00289 and PF02844 have PDB structures
	PF00289 	   c.30.1.1
	PF02844 	   c.30.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 626 ) 6611315_PF00046_PF00292            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00292  is 6572234 with Jaccard = 0.8150	|PF00292|=199 [ 163	1	1100011	36 ]
 parent [ 6572234 ] : 6611315 0.338824 (=155865/(169*2722)) 67.1606
  given [ 6572234 ] : 6572234 0.553892 (=185/(2*167)) 51.5431
	best keyword for cluster 6572234 is PF00292 with Jaccard = 0.8150	[ 163	1	1100011	36 ]	0.9939 0.8191
sibling [ 6572234 ] : 6607061 0.353877 (=31402/(33*2689)) 65.2037
	best keyword for cluster 6607061 is PF00046 with Jaccard = 0.7494	[ 2545	26	1096815	825 ]	0.9899 0.7552

	SUGGESTING RELATEDNESS OF:   
	A> PF00292 ( PF00292   'Paired box' domain )
	B> PF00046 ( PF00046   Homeobox domain )

	Only B has a clan ( CL0123.12 ).

	the two keywords coincide on Uniref90 proteins: |PF00046| = 3370 ,	 |PF00292| = 199 , 	|PF00046^PF00292| = 91	( 2.7% and 45.7% )

	both PF00292 and PF00046 have PDB structures
		PF00046 	   a.4.1.1  j.92.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00292 SSF46689 0.566 (average over 602 mutual instances, PF00292 602 appearances, SSF46689 68153 appearances)
     2	PF00046 SSF46689 0.773 (average over 9143 mutual instances, PF00046 9568 appearances, SSF46689 68153 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 627 ) 6718245_PF02552_PF02776            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02552  is 6534419 with Jaccard = 0.8095	|PF02552|=21 [ 17	0	1100190	4 ]
 parent [ 6534419 ] : 6718245 0.0652965 (=1382/(17*1245)) 95.1822
  given [ 6534419 ] : 6534419 0.733333 (=22/(15*2)) 28.5316
	best keyword for cluster 6534419 is PF02552 with Jaccard = 0.8095	[ 17	0	1100190	4 ]	1.0000 0.8095
sibling [ 6534419 ] : 6710583 0.0746774 (=463/(5*1240)) 94.0099
	best keyword for cluster 6710583 is PF02776 with Jaccard = 0.9328	[ 1028	72	1099109	2 ]	0.9345 0.9981

	SUGGESTING RELATEDNESS OF:   
	A> PF02552 ( PF02552   CO dehydrogenase beta subunit/acetyl-CoA synthase epsilon subunit )
	B> PF02776 ( PF02776   Thiamine pyrophosphate enzyme, N-terminal TPP binding domain )

	Only A has a clan ( CL0085.9 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF02552 and PF02776 have PDB structures
	PF02552 	   c.31.1.6
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 628 ) 6755285_PF04488_PF05785            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04488  is 6752513 with Jaccard = 0.8095	|PF04488|=164 [ 153	25	1100022	11 ]
 parent [ 6752513 ] : 6755285 0.0139186 (=215/(271*57)) 98.9885
  given [ 6752513 ] : 6752513 0.017738 (=202/(219*52)) 98.8029
	best keyword for cluster 6752513 is PF04488 with Jaccard = 0.8095	[ 153	25	1100022	11 ]	0.8596 0.9329
sibling [ 6752513 ] : 6753581 0.0178571 (=1/(1*56)) 98.875
	best keyword for cluster 6753581 is PF05785 with Jaccard = 0.7000	[ 7	3	1100201	0 ]	0.7000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04488 ( PF04488   Glycosyltransferase sugar-binding region containing DXD motif )
	B> PF05785 ( PF05785   Rho-activating domain of cytotoxic necrotizing factor )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF04488 and PF05785 have PDB structures
		PF05785 	   d.194.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF05785 SSF64438 0.861 (average over 21 mutual instances, PF05785 21 appearances, SSF64438 688 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 629 ) 6760626_PF00144_PF00933            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00144  is 6745544 with Jaccard = 0.8072	|PF00144|=1031 [ 833	1	1099179	198 ]
 parent [ 6745544 ] : 6760626 0.00740457 (=5569/(963*781)) 99.2978
  given [ 6745544 ] : 6745544 0.0193717 (=148/(8*955)) 98.2752
	best keyword for cluster 6745544 is PF00144 with Jaccard = 0.8072	[ 833	1	1099179	198 ]	0.9988 0.8080
sibling [ 6745544 ] : 6759392 0.00897436 (=7/(1*780)) 99.2326
	best keyword for cluster 6759392 is PF00933 with Jaccard = 0.9658	[ 649	18	1099539	5 ]	0.9730 0.9924

	SUGGESTING RELATEDNESS OF:   
	A> PF00144 ( PF00144   Beta-lactamase )
	B> PF00933 ( PF00933   Glycosyl hydrolase family 3 N terminal domain )

	A and B come from a different clan ( CL0013.12 , CL0058.10 ).

	the two keywords coincide on Uniref90 proteins: |PF00144| = 1031 ,	 |PF00933| = 654 , 	|PF00144^PF00933| = 7	( 0.7% and 1.1% )

	both PF00144 and PF00933 have PDB structures
	PF00144 	   e.3.1.1
	PF00933 	   c.1.8.7

	SUPERFAM mapping significantly overlapping:
     1	PF00144 SSF56601 0.955 (average over 4139 mutual instances, PF00144 4197 appearances, SSF56601 18812 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 630 ) 6736944_PF01553_PF04028            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01553  is 6731479 with Jaccard = 0.8064	|PF01553|=1152 [ 1008	98	1098961	144 ]
 parent [ 6731479 ] : 6736944 0.0328781 (=2470/(57*1318)) 97.479
  given [ 6731479 ] : 6731479 0.0403242 (=5797/(120*1198)) 96.8814
	best keyword for cluster 6731479 is PF01553 with Jaccard = 0.8064	[ 1008	98	1098961	144 ]	0.9114 0.8750
sibling [ 6731479 ] : 6514859 0.845455 (=93/(2*55)) 17.586
	best keyword for cluster 6514859 is PF04028 with Jaccard = 0.9556	[ 43	1	1100166	1 ]	0.9773 0.9773

	SUGGESTING RELATEDNESS OF:   
	A> PF01553 ( PF01553   Acyltransferase )
	B> PF04028 ( PF04028   Domain of unknown function (DUF374) )

	Only A has a clan ( CL0228.3 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF01553 has a PDB structure (may not be up to date)
	PF01553 	   c.112.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 631 ) 6688186_PF01943_PF03023            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01943  is 6683234 with Jaccard = 0.8053	|PF01943|=569 [ 459	1	1099641	110 ]
 parent [ 6683234 ] : 6688186 0.11534 (=31047/(377*714)) 89.9014
  given [ 6683234 ] : 6683234 0.125408 (=1229/(14*700)) 88.998
	best keyword for cluster 6683234 is PF01943 with Jaccard = 0.8053	[ 459	1	1099641	110 ]	0.9978 0.8067
sibling [ 6683234 ] : 6682609 0.116667 (=217/(372*5)) 88.872
	best keyword for cluster 6682609 is PF03023 with Jaccard = 0.6888	[ 228	101	1099880	2 ]	0.6930 0.9913

	SUGGESTING RELATEDNESS OF:   
	A> PF01943 ( PF01943   Polysaccharide biosynthesis protein )
	B> PF03023 ( PF03023   MviN-like protein )

	they come from the same clan: CL0222.3 : PF01554	PF03023	PF01943	PF04506

	the two keywords coincide on Uniref90 proteins: |PF01943| = 569 ,	 |PF03023| = 230 , 	|PF01943^PF03023| = 1	( 0.2% and 0.4% )

	Neither  PF01943 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 632 ) 6688257_PF00840_PF02015            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02015  is 6441890 with Jaccard = 0.8043	|PF02015|=46 [ 37	0	1100165	9 ]
 parent [ 6441890 ] : 6688257 0.106549 (=410/(37*104)) 89.9297
  given [ 6441890 ] : 6441890 0.992424 (=131/(33*4)) 0.759409
	best keyword for cluster 6441890 is PF02015 with Jaccard = 0.8043	[ 37	0	1100165	9 ]	1.0000 0.8043
sibling [ 6441890 ] : 6625832 0.274157 (=366/(89*15)) 72.9157
	best keyword for cluster 6625832 is PF00840 with Jaccard = 0.8788	[ 87	12	1100112	0 ]	0.8788 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02015 ( PF02015   Glycosyl hydrolase family 45 )
	B> PF00840 ( PF00840   Glycosyl hydrolase family 7 )

	A and B come from a different clan ( CL0199.7 , CL0004.14 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF02015 and PF00840 have PDB structures
	PF02015 	   b.52.1.1
	PF00840 	   b.29.1.10

	SUPERFAM mapping significantly overlapping:
     1	PF00840 SSF49899 0.989 (average over 255 mutual instances, PF00840 318 appearances, SSF49899 14070 appearances)
     2	PF02015 SSF50685 0.935 (average over 100 mutual instances, PF02015 129 appearances, SSF50685 2549 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 633 ) 6698573_PF00465_PF01761            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01761  is 6659594 with Jaccard = 0.8028	|PF01761|=360 [ 289	0	1099851	71 ]
 parent [ 6659594 ] : 6698573 0.0944991 (=21216/(314*715)) 91.9766
  given [ 6659594 ] : 6659594 0.175777 (=164/(311*3)) 83.2773
	best keyword for cluster 6659594 is PF01761 with Jaccard = 0.8028	[ 289	0	1099851	71 ]	1.0000 0.8028
sibling [ 6659594 ] : 6695679 0.092437 (=66/(1*714)) 91.4994
	best keyword for cluster 6695679 is PF00465 with Jaccard = 0.8457	[ 592	50	1099511	58 ]	0.9221 0.9108

	SUGGESTING RELATEDNESS OF:   
	A> PF01761 ( PF01761   3-dehydroquinate synthase )
	B> PF00465 ( PF00465   Iron-containing alcohol dehydrogenase )

	they come from the same clan: CL0224.3 : PF01761	PF00465
	the two keywords do not coincide on UniRef90 proteins

	both PF01761 and PF00465 have PDB structures
	PF01761 	   e.22.1.1
	PF00465 	   e.22.1.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 634 ) 6744375_PF00929_PF02811            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00929  is 6741904 with Jaccard = 0.8025	|PF00929|=1073 [ 1032	213	1098925	41 ]
 parent [ 6741904 ] : 6744375 0.019284 (=32249/(1040*1608)) 98.1808
  given [ 6741904 ] : 6741904 0.0252419 (=1234/(31*1577)) 97.9624
	best keyword for cluster 6741904 is PF00929 with Jaccard = 0.8025	[ 1032	213	1098925	41 ]	0.8289 0.9618
sibling [ 6741904 ] : 6739449 0.0295172 (=749/(25*1015)) 97.7263
	best keyword for cluster 6739449 is PF02811 with Jaccard = 0.7896	[ 747	56	1099265	143 ]	0.9303 0.8393

	SUGGESTING RELATEDNESS OF:   
	A> PF00929 ( PF00929   Exonuclease )
	B> PF02811 ( PF02811   PHP domain )

	A and B come from a different clan ( CL0219.6 , CL0034.9 ).

	the two keywords coincide on Uniref90 proteins: |PF00929| = 1073 ,	 |PF02811| = 890 , 	|PF00929^PF02811| = 55	( 5.1% and 6.2% )

	both PF00929 and PF02811 have PDB structures
	PF00929 	   c.55.3.5
	
	SUPERFAM mapping significantly overlapping:
     1	PF00929 SSF53098 0.885 (average over 3310 mutual instances, PF00929 3861 appearances, SSF53098 65670 appearances)
     2	PF02811 SSF89550 0.796 (average over 2526 mutual instances, PF02811 3397 appearances, SSF89550 5217 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 635 ) 6703698_PF00565_PF00567            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00565  is 6695678 with Jaccard = 0.8000	|PF00565|=320 [ 256	0	1099891	64 ]
 parent [ 6695678 ] : 6703698 0.08248 (=3552/(135*319)) 92.8462
  given [ 6695678 ] : 6695678 0.0981013 (=93/(3*316)) 91.4993
	best keyword for cluster 6695678 is PF00565 with Jaccard = 0.8000	[ 256	0	1099891	64 ]	1.0000 0.8000
sibling [ 6695678 ] : 6678034 0.141221 (=74/(4*131)) 87.6765
	best keyword for cluster 6678034 is PF00567 with Jaccard = 0.7548	[ 117	4	1100056	34 ]	0.9669 0.7748

	SUGGESTING RELATEDNESS OF:   
	A> PF00565 ( PF00565   Staphylococcal nuclease homologue )
	B> PF00567 ( PF00567   Tudor domain )

	Only B has a clan ( CL0049.9 ).

	the two keywords coincide on Uniref90 proteins: |PF00565| = 320 ,	 |PF00567| = 151 , 	|PF00565^PF00567| = 38	( 11.9% and 25.2% )

	only  PF00565 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00565 SSF50199 0.792 (average over 816 mutual instances, PF00565 827 appearances, SSF50199 989 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 636 ) 6706354_PF06158_PF06528            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06528  is 6557661 with Jaccard = 0.8000	|PF06528|=15 [ 12	0	1100196	3 ]
 parent [ 6557661 ] : 6706354 0.0666667 (=28/(14*30)) 93.3533
  given [ 6557661 ] : 6557661 0.692308 (=9/(1*13)) 44.9869
	best keyword for cluster 6557661 is PF06528 with Jaccard = 0.8000	[ 12	0	1100196	3 ]	1.0000 0.8000
sibling [ 6557661 ] : 6646478 0.243386 (=46/(21*9)) 78.9664
	best keyword for cluster 6646478 is PF06158 with Jaccard = 1.0000	[ 19	0	1100192	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06528 ( PF06528   Phage P2 GpE )
	B> PF06158 ( PF06158   Phage tail protein E )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF06158| = 19 ,	 |PF06528| = 15 , 	|PF06158^PF06528| = 2	( 10.5% and 13.3% )

	Neither  PF06528 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 637 ) 6666168_PF02121_PF02862            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02862  is 6585801 with Jaccard = 0.7975	|PF02862|=79 [ 63	0	1100132	16 ]
 parent [ 6585801 ] : 6666168 0.163288 (=733/(67*67)) 84.5138
  given [ 6585801 ] : 6585801 0.476923 (=62/(2*65)) 55.6951
	best keyword for cluster 6585801 is PF02862 with Jaccard = 0.7975	[ 63	0	1100132	16 ]	1.0000 0.7975
sibling [ 6585801 ] : 6628148 0.30303 (=20/(1*66)) 73.7258
	best keyword for cluster 6628148 is PF02121 with Jaccard = 0.8730	[ 55	8	1100148	0 ]	0.8730 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02862 ( PF02862   DDHD domain )
	B> PF02121 ( PF02121   Phosphatidylinositol transfer protein )

	Only B has a clan ( CL0209.4 ).

	the two keywords coincide on Uniref90 proteins: |PF02121| = 55 ,	 |PF02862| = 79 , 	|PF02121^PF02862| = 11	( 20.0% and 13.9% )

	only  PF02862 has a PDB structure (may not be up to date)
		PF02121 	   d.129.3.4

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 638 ) 6769866_PF00903_PF01261            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01261  is 6767388 with Jaccard = 0.7973	|PF01261|=992 [ 952	202	1099017	40 ]
 parent [ 6767388 ] : 6769866 0.00384732 (=15568/(3022*1339)) 99.7065
  given [ 6767388 ] : 6767388 0.00496776 (=886/(1189*150)) 99.6129
	best keyword for cluster 6767388 is PF01261 with Jaccard = 0.7973	[ 952	202	1099017	40 ]	0.8250 0.9597
sibling [ 6767388 ] : 6767046 0.00463576 (=28/(2*3020)) 99.5995
	best keyword for cluster 6767046 is PF00903 with Jaccard = 0.8550	[ 2111	196	1097742	162 ]	0.9150 0.9287

	SUGGESTING RELATEDNESS OF:   
	A> PF01261 ( PF01261   Xylose isomerase-like TIM barrel )
	B> PF00903 ( PF00903   Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily )

	A and B come from a different clan ( CL0152.6 , CL0104.8 ).

	the two keywords coincide on Uniref90 proteins: |PF00903| = 2273 ,	 |PF01261| = 992 , 	|PF00903^PF01261| = 14	( 0.6% and 1.4% )

	both PF01261 and PF00903 have PDB structures
	PF01261 	   c.1.15.1  c.1.15.3  c.1.15.4  c.1.15.5
	PF00903 	   d.32.1.1  d.32.1.10  d.32.1.2  d.32.1.3  d.32.1.4  d.32.1.6  d.32.1.8

	SUPERFAM mapping significantly overlapping:
     1	PF01261 SSF51658 0.729 (average over 3023 mutual instances, PF01261 3048 appearances, SSF51658 3985 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 639 ) 6720806_PF00462_PF04908            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00462  is 6692246 with Jaccard = 0.7954	|PF00462|=902 [ 719	2	1099307	183 ]
 parent [ 6692246 ] : 6720806 0.0563709 (=1544/(33*830)) 95.531
  given [ 6692246 ] : 6692246 0.110169 (=364/(4*826)) 90.7467
	best keyword for cluster 6692246 is PF00462 with Jaccard = 0.7954	[ 719	2	1099307	183 ]	0.9972 0.7971
sibling [ 6692246 ] : 6605613 0.40625 (=13/(1*32)) 64.4917
	best keyword for cluster 6605613 is PF04908 with Jaccard = 0.8158	[ 31	1	1100173	6 ]	0.9688 0.8378

	SUGGESTING RELATEDNESS OF:   
	A> PF00462 ( PF00462   Glutaredoxin )
	B> PF04908 ( PF04908   SH3-binding, glutamic acid-rich protein )

	they come from the same clan: CL0172.11 : PF00837	PF04908	PF02630	PF08534	PF02114	PF04756	PF07449	PF02798	PF00255	PF00462	PF07912	PF06110	PF05768	PF07955	PF01323	PF01216	PF03960	PF00578	PF00085
	the two keywords do not coincide on UniRef90 proteins

	both PF00462 and PF04908 have PDB structures
	PF00462 	   c.47.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00462 SSF52833 0.710 (average over 2554 mutual instances, PF00462 2661 appearances, SSF52833 34965 appearances)
     2	PF04908 SSF52833 0.972 (average over 75 mutual instances, PF04908 78 appearances, SSF52833 34965 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 640 ) 6709396_PF01871_PF02900            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02900  is 6664524 with Jaccard = 0.7924	|PF02900|=236 [ 187	0	1099975	49 ]
 parent [ 6664524 ] : 6709396 0.0704607 (=2340/(162*205)) 93.8444
  given [ 6664524 ] : 6664524 0.191218 (=1768/(138*67)) 84.1975
	best keyword for cluster 6664524 is PF02900 with Jaccard = 0.7924	[ 187	0	1099975	49 ]	1.0000 0.7924
sibling [ 6664524 ] : 6648931 0.205364 (=1248/(103*59)) 79.6683
	best keyword for cluster 6648931 is PF01871 with Jaccard = 0.7143	[ 110	36	1100057	8 ]	0.7534 0.9322

	SUGGESTING RELATEDNESS OF:   
	A> PF02900 ( PF02900   Catalytic LigB subunit of aromatic ring-opening dioxygenase )
	B> PF01871 ( PF01871   AMMECR1 )

	Only A has a clan ( CL0283.2 ).

	the two keywords coincide on Uniref90 proteins: |PF01871| = 118 ,	 |PF02900| = 236 , 	|PF01871^PF02900| = 13	( 11.0% and 5.5% )

	both PF02900 and PF01871 have PDB structures
		PF01871 	   d.309.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF02900 SSF53213 0.960 (average over 651 mutual instances, PF02900 671 appearances, SSF53213 704 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 641 ) 6653242_PF00547_PF00699            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00699  is 6436120 with Jaccard = 0.7923	|PF00699|=130 [ 103	0	1100081	27 ]
 parent [ 6436120 ] : 6653242 0.189592 (=1971/(92*113)) 81.0735
  given [ 6436120 ] : 6436120 0.995495 (=221/(111*2)) 0.486867
	best keyword for cluster 6436120 is PF00699 with Jaccard = 0.7923	[ 103	0	1100081	27 ]	1.0000 0.7923
sibling [ 6436120 ] : 6263173 1 (=91/(1*91)) 2.5738e-12
	best keyword for cluster 6263173 is PF00547 with Jaccard = 0.7545	[ 83	0	1100101	27 ]	1.0000 0.7545

	SUGGESTING RELATEDNESS OF:   
	A> PF00699 ( PF00699   Urease beta subunit )
	B> PF00547 ( PF00547   Urease, gamma subunit )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00547| = 110 ,	 |PF00699| = 130 , 	|PF00547^PF00699| = 36	( 32.7% and 27.7% )

	both PF00699 and PF00547 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00699 SSF51278 0.899 (average over 491 mutual instances, PF00699 687 appearances, SSF51278 729 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 642 ) 6765418_PF01127_PF02313            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01127  is 6735057 with Jaccard = 0.7904	|PF01127|=230 [ 230	61	1099920	0 ]
 parent [ 6735057 ] : 6765418 0.00629277 (=103/(496*33)) 99.5309
  given [ 6735057 ] : 6735057 0.0358274 (=2002/(173*323)) 97.2782
	best keyword for cluster 6735057 is PF01127 with Jaccard = 0.7904	[ 230	61	1099920	0 ]	0.7904 1.0000
sibling [ 6735057 ] : 6762758 0.03125 (=1/(1*32)) 99.4062
	best keyword for cluster 6762758 is PF02313 with Jaccard = 1.0000	[ 21	0	1100190	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01127 ( PF01127   Succinate dehydrogenase cytochrome b subunit )
	B> PF02313 ( PF02313   Fumarate reductase subunit D )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF01127 and PF02313 have PDB structures
		PF02313 	   f.21.2.2

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 643 ) 6580212_PF00129_PF07654            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07654  is 6546979 with Jaccard = 0.7900	|PF07654|=1493 [ 1189	12	1098706	304 ]
 parent [ 6546979 ] : 6580212 0.471989 (=428027/(1214*747)) 53.7949
  given [ 6546979 ] : 6546979 0.652078 (=2369/(3*1211)) 36.6568
	best keyword for cluster 6546979 is PF07654 with Jaccard = 0.7900	[ 1189	12	1098706	304 ]	0.9900 0.7964
sibling [ 6546979 ] : 6558890 0.573154 (=854/(2*745)) 45.9789
	best keyword for cluster 6558890 is PF00129 with Jaccard = 0.6092	[ 725	0	1099021	465 ]	1.0000 0.6092

	SUGGESTING RELATEDNESS OF:   
	A> PF07654 ( PF07654   Immunoglobulin C1-set domain )
	B> PF00129 ( PF00129   Class I Histocompatibility antigen, domains alpha 1 and 2 )

	Only A has a clan ( CL0011.18 ).

	the two keywords coincide on Uniref90 proteins: |PF00129| = 1190 ,	 |PF07654| = 1493 , 	|PF00129^PF07654| = 606	( 50.9% and 40.6% )

	both PF07654 and PF00129 have PDB structures
	PF07654 	   b.1.1.2
	
	SUPERFAM mapping significantly overlapping:
     1	PF00129 SSF54452 0.972 (average over 8055 mutual instances, PF00129 8055 appearances, SSF54452 25772 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 644 ) 6740496_PF05378_PF06032            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06032  is 6509800 with Jaccard = 0.7872	|PF06032|=47 [ 37	0	1100164	10 ]
 parent [ 6509800 ] : 6740496 0.0219609 (=353/(38*423)) 97.8272
  given [ 6509800 ] : 6509800 0.847926 (=184/(7*31)) 15.2818
	best keyword for cluster 6509800 is PF06032 with Jaccard = 0.7872	[ 37	0	1100164	10 ]	1.0000 0.7872
sibling [ 6509800 ] : 6737708 0.0267786 (=67/(6*417)) 97.5561
	best keyword for cluster 6737708 is PF05378 with Jaccard = 0.6702	[ 254	122	1099832	3 ]	0.6755 0.9883

	SUGGESTING RELATEDNESS OF:   
	A> PF06032 ( PF06032   Protein of unknown function (DUF917) )
	B> PF05378 ( PF05378   Hydantoinase/oxoprolinase N-terminal region )

	Only B has a clan ( CL0108.10 ).

	the two keywords coincide on Uniref90 proteins: |PF05378| = 257 ,	 |PF06032| = 47 , 	|PF05378^PF06032| = 10	( 3.9% and 21.3% )

	Neither  PF06032 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF05378 SSF53383 0.793 (average over 1 mutual instances, PF05378 1 appearances, SSF53383 34644 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 645 ) 6689861_PF00256_PF00828            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00256  is 6632191 with Jaccard = 0.7863	|PF00256|=325 [ 298	54	1099832	27 ]
 parent [ 6632191 ] : 6689861 0.11918 (=4331/(92*395)) 90.2474
  given [ 6632191 ] : 6632191 0.28181 (=8146/(97*298)) 75.1802
	best keyword for cluster 6632191 is PF00256 with Jaccard = 0.7863	[ 298	54	1099832	27 ]	0.8466 0.9169
sibling [ 6632191 ] : 6549048 0.644522 (=1106/(26*66)) 38.0505
	best keyword for cluster 6549048 is PF00828 with Jaccard = 0.7126	[ 62	25	1100124	0 ]	0.7126 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00256 ( PF00256   Ribosomal protein L15 )
	B> PF00828 ( PF00828   Eukaryotic ribosomal protein L18 )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF00256 has a PDB structure (may not be up to date)
	PF00256 	   c.12.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 646 ) 6695975_PF00069_PF01453            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00069  is 6694181 with Jaccard = 0.7833	|PF00069|=11375 [ 10402	1905	1086931	973 ]
 parent [ 6694181 ] : 6695975 0.0868221 (=389625/(338*13277)) 91.541
  given [ 6694181 ] : 6694181 0.10485 (=11130/(8*13269)) 91.1396
	best keyword for cluster 6694181 is PF00069 with Jaccard = 0.7833	[ 10402	1905	1086931	973 ]	0.8452 0.9145
sibling [ 6694181 ] : 6676105 0.172108 (=4254/(231*107)) 87.1973
	best keyword for cluster 6676105 is PF01453 with Jaccard = 0.6419	[ 294	13	1099753	151 ]	0.9577 0.6607

	SUGGESTING RELATEDNESS OF:   
	A> PF00069 ( PF00069   Protein kinase domain )
	B> PF01453 ( PF01453   D-mannose binding lectin )

	Only A has a clan ( CL0016.14 ).

	the two keywords coincide on Uniref90 proteins: |PF00069| = 11375 ,	 |PF01453| = 445 , 	|PF00069^PF01453| = 162	( 1.4% and 36.4% )

	both PF00069 and PF01453 have PDB structures
		PF01453 	   b.78.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF01453 SSF51110 0.759 (average over 1305 mutual instances, PF01453 2025 appearances, SSF51110 3868 appearances)
     2	PF00069 SSF56112 0.797 (average over 32363 mutual instances, PF00069 36405 appearances, SSF56112 66637 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 647 ) 6742870_PF05101_PF05245            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05101  is 6702318 with Jaccard = 0.7818	|PF05101|=55 [ 43	0	1100156	12 ]
 parent [ 6702318 ] : 6742870 0.0311966 (=73/(45*52)) 98.0401
  given [ 6702318 ] : 6702318 0.0965909 (=34/(8*44)) 92.5928
	best keyword for cluster 6702318 is PF05101 with Jaccard = 0.7818	[ 43	0	1100156	12 ]	1.0000 0.7818
sibling [ 6702318 ] : 6552559 0.612903 (=266/(14*31)) 40.9073
	best keyword for cluster 6552559 is PF05245 with Jaccard = 0.7059	[ 12	4	1100194	1 ]	0.7500 0.9231

	SUGGESTING RELATEDNESS OF:   
	A> PF05101 ( PF05101   Type IV secretory pathway, VirB3-like protein )
	B> PF05245 ( PF05245   Conjugal transfer protein TrbD )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05101 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 648 ) 6754528_PF01575_PF07977            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01575  is 6732221 with Jaccard = 0.7770	|PF01575|=570 [ 453	13	1099628	117 ]
 parent [ 6732221 ] : 6754528 0.015873 (=6144/(672*576)) 98.9393
  given [ 6732221 ] : 6732221 0.0309345 (=144/(7*665)) 96.9692
	best keyword for cluster 6732221 is PF01575 with Jaccard = 0.7770	[ 453	13	1099628	117 ]	0.9721 0.7947
sibling [ 6732221 ] : 6736699 0.0330435 (=19/(1*575)) 97.4489
	best keyword for cluster 6736699 is PF07977 with Jaccard = 0.6835	[ 324	123	1099737	27 ]	0.7248 0.9231

	SUGGESTING RELATEDNESS OF:   
	A> PF01575 ( PF01575   MaoC like domain )
	B> PF07977 ( PF07977   FabA-like domain )

	they come from the same clan: CL0050.7 : PF03061	PF01643	PF02551	PF07977	PF01575
	the two keywords do not coincide on UniRef90 proteins

	both PF01575 and PF07977 have PDB structures
		PF07977 	   d.38.1.2  d.38.1.6

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 649 ) 6745191_PF04535_PF07911            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04535  is 6690734 with Jaccard = 0.7765	|PF04535|=85 [ 66	0	1100126	19 ]
 parent [ 6690734 ] : 6745191 0.0191053 (=41/(74*29)) 98.2489
  given [ 6690734 ] : 6690734 0.117457 (=109/(16*58)) 90.4268
	best keyword for cluster 6690734 is PF04535 with Jaccard = 0.7765	[ 66	0	1100126	19 ]	1.0000 0.7765
sibling [ 6690734 ] : 6704172 0.0714286 (=2/(1*28)) 92.9429
	best keyword for cluster 6704172 is PF07911 with Jaccard = 1.0000	[ 23	0	1100188	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04535 ( PF04535   Domain of unknown function (DUF588) )
	B> PF07911 ( PF07911   Protein of unknown function (DUF1677) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04535| = 85 ,	 |PF07911| = 23 , 	|PF04535^PF07911| = 1	( 1.2% and 4.3% )

	Neither  PF04535 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 650 ) 6619387_PF00836_PF05672            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05672  is 6606333 with Jaccard = 0.7742	|PF05672|=30 [ 24	1	1100180	6 ]
 parent [ 6606333 ] : 6619387 0.346915 (=1822/(101*52)) 70.2099
  given [ 6606333 ] : 6606333 0.402174 (=111/(6*46)) 64.7943
	best keyword for cluster 6606333 is PF05672 with Jaccard = 0.7742	[ 24	1	1100180	6 ]	0.9600 0.8000
sibling [ 6606333 ] : 6602233 0.427673 (=1088/(48*53)) 62.8107
	best keyword for cluster 6602233 is PF00836 with Jaccard = 0.8788	[ 29	4	1100178	0 ]	0.8788 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05672 ( PF05672   MAP7 (E-MAP-115) family )
	B> PF00836 ( PF00836   Stathmin family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF05672 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00836 SSF101494 0.963 (average over 91 mutual instances, PF00836 91 appearances, SSF101494 91 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 651 ) 6762276_PF01381_PF02486            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01381  is 6760686 with Jaccard = 0.7684	|PF01381|=3353 [ 3006	559	1096299	347 ]
 parent [ 6760686 ] : 6762276 0.00821969 (=4470/(104*5229)) 99.3842
  given [ 6760686 ] : 6760686 0.0099152 (=4034/(79*5150)) 99.301
	best keyword for cluster 6760686 is PF01381 with Jaccard = 0.7684	[ 3006	559	1096299	347 ]	0.8432 0.8965
sibling [ 6760686 ] : 6755100 0.0134921 (=17/(14*90)) 98.9755
	best keyword for cluster 6755100 is PF02486 with Jaccard = 0.9506	[ 77	3	1100130	1 ]	0.9625 0.9872

	SUGGESTING RELATEDNESS OF:   
	A> PF01381 ( PF01381   Helix-turn-helix )
	B> PF02486 ( PF02486   Replication initiation factor )

	Only A has a clan ( CL0123.12 ).

	the two keywords coincide on Uniref90 proteins: |PF01381| = 3353 ,	 |PF02486| = 78 , 	|PF01381^PF02486| = 8	( 0.2% and 10.3% )

	only  PF01381 has a PDB structure (may not be up to date)
	PF01381 	   a.35.1.11  a.35.1.2  a.35.1.3
	
	SUPERFAM mapping significantly overlapping:
     1	PF01381 SSF47413 0.810 (average over 8999 mutual instances, PF01381 10797 appearances, SSF47413 20047 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 652 ) 6757628_PF05978_PF07690            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05978  is 6736932 with Jaccard = 0.7670	|PF05978|=79 [ 79	24	1100108	0 ]
 parent [ 6736932 ] : 6757628 0.0123495 (=26074/(145*14561)) 99.1309
  given [ 6736932 ] : 6736932 0.0275689 (=66/(126*19)) 97.4776
	best keyword for cluster 6736932 is PF05978 with Jaccard = 0.7670	[ 79	24	1100108	0 ]	0.7670 1.0000
sibling [ 6736932 ] : 6757401 0.0111875 (=2441/(15*14546)) 99.1154
	best keyword for cluster 6757401 is PF07690 with Jaccard = 0.7945	[ 10339	2422	1087197	253 ]	0.8102 0.9761

	SUGGESTING RELATEDNESS OF:   
	A> PF05978 ( PF05978   Eukaryotic protein of unknown function (DUF895) )
	B> PF07690 ( PF07690   Major Facilitator Superfamily )

	they come from the same clan: CL0015.13 : PF00083	PF03209	PF00854	PF03137	PF03825	PF01733	PF06813	PF07672	PF07690	PF01306	PF01770	PF05978	PF05977	PF05631	PF04332	PF07673	PF06779	PF02487	PF03092	PF06609

	the two keywords coincide on Uniref90 proteins: |PF05978| = 79 ,	 |PF07690| = 10592 , 	|PF05978^PF07690| = 1	( 1.3% and 0.0% )

	only  PF05978 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF07690 SSF103473 0.840 (average over 31421 mutual instances, PF07690 31552 appearances, SSF103473 39293 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 653 ) 6701988_PF00406_PF01202            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01202  is 6659576 with Jaccard = 0.7660	|PF01202|=481 [ 370	2	1099728	111 ]
 parent [ 6659576 ] : 6701988 0.105209 (=52905/(531*947)) 92.5249
  given [ 6659576 ] : 6659576 0.230781 (=10543/(423*108)) 83.2649
	best keyword for cluster 6659576 is PF01202 with Jaccard = 0.7660	[ 370	2	1099728	111 ]	0.9946 0.7692
sibling [ 6659576 ] : 6693889 0.106285 (=301/(3*944)) 91.0758
	best keyword for cluster 6693889 is PF00406 with Jaccard = 0.6344	[ 479	274	1099456	2 ]	0.6361 0.9958

	SUGGESTING RELATEDNESS OF:   
	A> PF01202 ( PF01202   Shikimate kinase )
	B> PF00406 ( PF00406   Adenylate kinase )

	Only A has a clan ( CL0023.26 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF01202 and PF00406 have PDB structures
		PF00406 	   c.37.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 654 ) 6763647_PF04138_PF04794            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04138  is 6725663 with Jaccard = 0.7650	|PF04138|=232 [ 179	2	1099977	53 ]
 parent [ 6725663 ] : 6763647 0.0065849 (=182/(249*111)) 99.4482
  given [ 6725663 ] : 6725663 0.0493724 (=118/(10*239)) 96.1989
	best keyword for cluster 6725663 is PF04138 with Jaccard = 0.7650	[ 179	2	1099977	53 ]	0.9890 0.7716
sibling [ 6725663 ] : 6762612 0.00909091 (=1/(1*110)) 99.4
	best keyword for cluster 6762612 is PF04794 with Jaccard = 0.9773	[ 86	2	1100123	0 ]	0.9773 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF04138 ( PF04138   GtrA-like protein )
	B> PF04794 ( PF04794   YdjC-like protein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04138| = 232 ,	 |PF04794| = 86 , 	|PF04138^PF04794| = 3	( 1.3% and 3.5% )

	Neither  PF04138 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 655 ) 6453480_PF00005_PF00664            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00664  is 6450609 with Jaccard = 0.7598	|PF00664|=2604 [ 2420	581	1097026	184 ]
 parent [ 6450609 ] : 6453480 0.987075 (=43874842/(13969*3182)) 1.60299
  given [ 6450609 ] : 6450609 0.988648 (=176103/(3125*57)) 1.39078
	best keyword for cluster 6450609 is PF00664 with Jaccard = 0.7598	[ 2420	581	1097026	184 ]	0.8064 0.9293
sibling [ 6450609 ] : 6448285 0.991553 (=14455140/(1136*12833)) 1.19485
	best keyword for cluster 6448285 is PF00005 with Jaccard = 0.7048	[ 12863	1	1081961	5386 ]	0.9999 0.7049

	SUGGESTING RELATEDNESS OF:   
	A> PF00664 ( PF00664   ABC transporter transmembrane region )
	B> PF00005 ( PF00005   ABC transporter )

	A and B come from a different clan ( CL0241.3 , CL0023.26 ).

	the two keywords coincide on Uniref90 proteins: |PF00005| = 18249 ,	 |PF00664| = 2604 , 	|PF00005^PF00664| = 2535	( 13.9% and 97.4% )

	both PF00664 and PF00005 have PDB structures
	PF00664 	   f.37.1.1
	PF00005 	   c.37.1.12  j.35.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00664 SSF90123 0.746 (average over 7613 mutual instances, PF00664 7751 appearances, SSF90123 18042 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 656 ) 6527357_PF01500_PF05287            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01500  is 6524428 with Jaccard = 0.7586	|PF01500|=52 [ 44	6	1100153	8 ]
 parent [ 6524428 ] : 6527357 0.792008 (=773/(16*61)) 24.5323
  given [ 6524428 ] : 6524428 0.808786 (=626/(18*43)) 22.6091
	best keyword for cluster 6524428 is PF01500 with Jaccard = 0.7586	[ 44	6	1100153	8 ]	0.8800 0.8462
sibling [ 6524428 ] : 6324943 1 (=60/(10*6)) 9.24524e-08
	best keyword for cluster 6324943 is PF05287 with Jaccard = 0.7273	[ 16	0	1100189	6 ]	1.0000 0.7273

	SUGGESTING RELATEDNESS OF:   
	A> PF01500 ( PF01500   Keratin, high sulfur B2 protein )
	B> PF05287 ( PF05287   PMG protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF01500 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 657 ) 6729673_PF03435_PF06408            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03435  is 6665268 with Jaccard = 0.7562	|PF03435|=277 [ 214	6	1099928	63 ]
 parent [ 6665268 ] : 6729673 0.0421345 (=349/(251*33)) 96.6883
  given [ 6665268 ] : 6665268 0.175639 (=2747/(115*136)) 84.3269
	best keyword for cluster 6665268 is PF03435 with Jaccard = 0.7562	[ 214	6	1099928	63 ]	0.9727 0.7726
sibling [ 6665268 ] : 6687189 0.133333 (=12/(30*3)) 89.6812
	best keyword for cluster 6687189 is PF06408 with Jaccard = 1.0000	[ 25	0	1100186	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03435 ( PF03435   Saccharopine dehydrogenase )
	B> PF06408 ( PF06408   Homospermidine synthase )

	Only A has a clan ( CL0063.17 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF03435 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 658 ) 6750141_PF00001_PF01748            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01748  is 6744778 with Jaccard = 0.7557	|PF01748|=317 [ 266	35	1099859	51 ]
 parent [ 6744778 ] : 6750141 0.018445 (=44072/(5689*420)) 98.6287
  given [ 6744778 ] : 6744778 0.0247355 (=886/(119*301)) 98.2175
	best keyword for cluster 6744778 is PF01748 with Jaccard = 0.7557	[ 266	35	1099859	51 ]	0.8837 0.8391
sibling [ 6744778 ] : 6744405 0.023841 (=2164/(16*5673)) 98.1833
	best keyword for cluster 6744405 is PF00001 with Jaccard = 0.9786	[ 5032	43	1095069	67 ]	0.9915 0.9869

	SUGGESTING RELATEDNESS OF:   
	A> PF01748 ( PF01748   Caenorhabditis serpentine receptor-like protein )
	B> PF00001 ( PF00001   7 transmembrane receptor (rhodopsin family) )

	they come from the same clan: CL0192.7 : PF05296	PF03383	PF01748	PF04789	PF06976	PF01036	PF01461	PF00001	PF03402
	the two keywords do not coincide on UniRef90 proteins

	only  PF01748 has a PDB structure (may not be up to date)
		PF00001 	   f.13.1.2  i.22.1.1  j.101.1.1  j.35.1.1  j.82.1.1  j.94.1.1

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 659 ) 6674928_PF05899_PF06249            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05899  is 6558535 with Jaccard = 0.7554	|PF05899|=139 [ 105	0	1100072	34 ]
 parent [ 6558535 ] : 6674928 0.161402 (=838/(118*44)) 86.8651
  given [ 6558535 ] : 6558535 0.580531 (=328/(5*113)) 45.5925
	best keyword for cluster 6558535 is PF05899 with Jaccard = 0.7554	[ 105	0	1100072	34 ]	1.0000 0.7554
sibling [ 6558535 ] : 6644741 0.254826 (=66/(37*7)) 78.3594
	best keyword for cluster 6644741 is PF06249 with Jaccard = 0.6129	[ 19	12	1100180	0 ]	0.6129 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05899 ( PF05899   Protein of unknown function (DUF861) )
	B> PF06249 ( PF06249   Ethanolamine utilisation protein EutQ )

	they come from the same clan: CL0029.13 : PF01238	PF05726	PF02678	PF01050	PF02373	PF04209	PF06560	PF05523	PF06249	PF06339	PF04074	PF07385	PF00908	PF06172	PF08007	PF05899	PF07883	PF00190	PF05995	PF02041	PF05118	PF03079	PF02311	PF06052
	the two keywords do not coincide on UniRef90 proteins

	only  PF05899 has a PDB structure (may not be up to date)
	PF05899 	   b.82.1.11  b.82.1.8
	
	SUPERFAM mapping significantly overlapping:
     1	PF06249 SSF51182 0.695 (average over 75 mutual instances, PF06249 75 appearances, SSF51182 14255 appearances)
     2	PF05899 SSF51182 0.709 (average over 422 mutual instances, PF05899 437 appearances, SSF51182 14255 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 660 ) 6676374_PF00359_PF00874            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00874  is 6602247 with Jaccard = 0.7517	|PF00874|=275 [ 215	11	1099925	60 ]
 parent [ 6602247 ] : 6676374 0.136794 (=18645/(235*580)) 87.2803
  given [ 6602247 ] : 6602247 0.41523 (=289/(3*232)) 62.8244
	best keyword for cluster 6602247 is PF00874 with Jaccard = 0.7517	[ 215	11	1099925	60 ]	0.9513 0.7818
sibling [ 6602247 ] : 6668523 0.158868 (=275/(3*577)) 85.1586
	best keyword for cluster 6668523 is PF00359 with Jaccard = 0.6166	[ 386	137	1099585	103 ]	0.7380 0.7894

	SUGGESTING RELATEDNESS OF:   
	A> PF00874 ( PF00874   PRD domain )
	B> PF00359 ( PF00359   Phosphoenolpyruvate-dependent sugar phosphotransferase system, EIIA 2 )

	Only A has a clan ( CL0166.7 ).

	the two keywords coincide on Uniref90 proteins: |PF00359| = 489 ,	 |PF00874| = 275 , 	|PF00359^PF00874| = 76	( 15.5% and 27.6% )

	both PF00874 and PF00359 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00359 SSF55804 0.923 (average over 2008 mutual instances, PF00359 2875 appearances, SSF55804 4723 appearances)
     2	PF00874 SSF63520 0.839 (average over 960 mutual instances, PF00874 1775 appearances, SSF63520 2527 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 661 ) 6719639_PF00593_PF01640            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01640  is 6652465 with Jaccard = 0.7500	|PF01640|=7 [ 6	1	1100203	1 ]
 parent [ 6652465 ] : 6719639 0.0485463 (=2012/(15*2763)) 95.3643
  given [ 6652465 ] : 6652465 0.2 (=10/(10*5)) 80.82
	best keyword for cluster 6652465 is PF01640 with Jaccard = 0.7500	[ 6	1	1100203	1 ]	0.8571 0.8571
sibling [ 6652465 ] : 6718932 0.054407 (=900/(6*2757)) 95.2683
	best keyword for cluster 6718932 is PF00593 with Jaccard = 0.9224	[ 2188	104	1097839	80 ]	0.9546 0.9647

	SUGGESTING RELATEDNESS OF:   
	A> PF01640 ( PF01640   Peptidase C10 family )
	B> PF00593 ( PF00593   TonB dependent receptor )

	A and B come from a different clan ( CL0125.9 , CL0193.8 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF01640 and PF00593 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 662 ) 6705148_PF00096_PF07400            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07400  is 6507050 with Jaccard = 0.7500	|PF07400|=3 [ 3	1	1100207	0 ]
 parent [ 6507050 ] : 6705148 0.0702479 (=4386/(12*5203)) 93.1329
  given [ 6507050 ] : 6507050 0.9 (=18/(2*10)) 14.069
	best keyword for cluster 6507050 is PF07400 with Jaccard = 0.7500	[ 3	1	1100207	0 ]	0.7500 1.0000
sibling [ 6507050 ] : 6703989 0.0788276 (=4502/(11*5192)) 92.9129
	best keyword for cluster 6703989 is PF00096 with Jaccard = 0.8179	[ 4205	255	1095070	681 ]	0.9428 0.8606

	SUGGESTING RELATEDNESS OF:   
	A> PF07400 ( PF07400   Interleukin 11 )
	B> PF00096 ( PF00096   Zinc finger, C2H2 type )

	Only A has a clan ( CL0053.9 ).
	the two keywords do not coincide on UniRef90 proteins

	only  PF07400 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF07400 SSF47266 0.807 (average over 2 mutual instances, PF07400 2 appearances, SSF47266 2488 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 663 ) 6748487_PF01257_PF06999            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06999  is 6605528 with Jaccard = 0.7326	|PF06999|=86 [ 63	0	1100125	23 ]
 parent [ 6605528 ] : 6748487 0.0197256 (=552/(66*424)) 98.5005
  given [ 6605528 ] : 6605528 0.425366 (=436/(25*41)) 64.39
	best keyword for cluster 6605528 is PF06999 with Jaccard = 0.7326	[ 63	0	1100125	23 ]	1.0000 0.7326
sibling [ 6605528 ] : 6694889 0.115498 (=4376/(296*128)) 91.2943
	best keyword for cluster 6694889 is PF01257 with Jaccard = 0.8487	[ 258	22	1099907	24 ]	0.9214 0.9149

	SUGGESTING RELATEDNESS OF:   
	A> PF06999 ( PF06999   Sucrase/ferredoxin-like )
	B> PF01257 ( PF01257   Respiratory-chain NADH dehydrogenase 24 Kd subunit )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06999 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01257 SSF52833 0.515 (average over 774 mutual instances, PF01257 782 appearances, SSF52833 34965 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 664 ) 6696382_PF01585_PF07713            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01585  is 6690768 with Jaccard = 0.7322	|PF01585|=338 [ 257	13	1099860	81 ]
 parent [ 6690768 ] : 6696382 0.119826 (=1297/(33*328)) 91.6347
  given [ 6690768 ] : 6690768 0.125045 (=1378/(38*290)) 90.435
	best keyword for cluster 6690768 is PF01585 with Jaccard = 0.7322	[ 257	13	1099860	81 ]	0.9519 0.7604
sibling [ 6690768 ] : 6591104 0.439655 (=51/(29*4)) 57.791
	best keyword for cluster 6591104 is PF07713 with Jaccard = 1.0000	[ 26	0	1100185	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01585 ( PF01585   G-patch domain )
	B> PF07713 ( PF07713   Protein of unknown function (DUF1604) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01585| = 338 ,	 |PF07713| = 26 , 	|PF01585^PF07713| = 10	( 3.0% and 38.5% )

	Neither  PF01585 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 665 ) 6708570_PF00073_PF00915            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00073  is 6701144 with Jaccard = 0.7265	|PF00073|=433 [ 433	163	1099615	0 ]
 parent [ 6701144 ] : 6708570 0.0678761 (=11112/(214*765)) 93.7389
  given [ 6701144 ] : 6701144 0.0871217 (=9908/(202*563)) 92.3862
	best keyword for cluster 6701144 is PF00073 with Jaccard = 0.7265	[ 433	163	1099615	0 ]	0.7265 1.0000
sibling [ 6701144 ] : 6682948 0.116904 (=74/(3*211)) 88.9262
	best keyword for cluster 6682948 is PF00915 with Jaccard = 0.9675	[ 149	2	1100057	3 ]	0.9868 0.9803

	SUGGESTING RELATEDNESS OF:   
	A> PF00073 ( PF00073   picornavirus capsid protein )
	B> PF00915 ( PF00915   Calicivirus coat protein )

	they come from the same clan: CL0055.7 : PF01318	PF00915	PF00760	PF01829	PF00073	PF00983	PF00729
	the two keywords do not coincide on UniRef90 proteins

	both PF00073 and PF00915 have PDB structures
		PF00915 	   b.121.4.3

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 666 ) 6651087_PF00390_PF01515            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01515  is 6545808 with Jaccard = 0.7197	|PF01515|=314 [ 226	0	1099897	88 ]
 parent [ 6545808 ] : 6651087 0.197697 (=22161/(248*452)) 80.351
  given [ 6545808 ] : 6545808 0.643725 (=159/(1*247)) 35.919
	best keyword for cluster 6545808 is PF01515 with Jaccard = 0.7197	[ 226	0	1099897	88 ]	1.0000 0.7197
sibling [ 6545808 ] : 6565475 0.505556 (=455/(2*450)) 50.0974
	best keyword for cluster 6565475 is PF00390 with Jaccard = 0.9739	[ 410	11	1099790	0 ]	0.9739 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01515 ( PF01515   Phosphate acetyl/butaryl transferase )
	B> PF00390 ( PF00390   Malic enzyme, N-terminal domain )

	Only A has a clan ( CL0270.2 ).

	the two keywords coincide on Uniref90 proteins: |PF00390| = 410 ,	 |PF01515| = 314 , 	|PF00390^PF01515| = 88	( 21.5% and 28.0% )

	both PF01515 and PF00390 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 667 ) 6765896_PF05051_PF06747            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06747  is 6761481 with Jaccard = 0.7155	|PF06747|=213 [ 166	19	1099979	47 ]
 parent [ 6761481 ] : 6765896 0.00650878 (=119/(47*389)) 99.5518
  given [ 6761481 ] : 6761481 0.0109244 (=351/(119*270)) 99.3437
	best keyword for cluster 6761481 is PF06747 with Jaccard = 0.7155	[ 166	19	1099979	47 ]	0.8973 0.7793
sibling [ 6761481 ] : 6756568 0.0217391 (=1/(1*46)) 99.0652
	best keyword for cluster 6756568 is PF05051 with Jaccard = 1.0000	[ 35	0	1100176	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF06747 ( PF06747   CHCH domain )
	B> PF05051 ( PF05051   Cytochrome C oxidase copper chaperone (COX17) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF06747 has a PDB structure (may not be up to date)
		PF05051 	   a.17.1.2

	SUPERFAM mapping significantly overlapping:
     1	PF06747 SSF47072 0.856 (average over 4 mutual instances, PF06747 4 appearances, SSF47072 24 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 668 ) 6664467_PF01391_PF07212            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01391  is 6664282 with Jaccard = 0.7152	|PF01391|=1159 [ 924	133	1098919	235 ]
 parent [ 6664282 ] : 6664467 0.170602 (=3360/(15*1313)) 84.1743
  given [ 6664282 ] : 6664282 0.186005 (=731/(3*1310)) 84.1281
	best keyword for cluster 6664282 is PF01391 with Jaccard = 0.7152	[ 924	133	1098919	235 ]	0.8742 0.7972
sibling [ 6664282 ] : 6550373 0.611111 (=22/(3*12)) 39.063
	best keyword for cluster 6550373 is PF07212 with Jaccard = 1.0000	[ 9	0	1100202	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF01391 ( PF01391   Collagen triple helix repeat (20 copies) )
	B> PF07212 ( PF07212   Hyaluronidase protein (HylP) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01391 has a PDB structure (may not be up to date)
	PF01391 	   d.169.1.5  h.1.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 669 ) 6748873_PF04569_PF05754            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05754  is 6736150 with Jaccard = 0.7147	|PF05754|=384 [ 278	5	1099822	106 ]
 parent [ 6736150 ] : 6748873 0.0204209 (=15965/(619*1263)) 98.5341
  given [ 6736150 ] : 6736150 0.0357762 (=3425/(302*317)) 97.3905
	best keyword for cluster 6736150 is PF05754 with Jaccard = 0.7147	[ 278	5	1099822	106 ]	0.9823 0.7240
sibling [ 6736150 ] : 6747727 0.0224261 (=4729/(198*1065)) 98.4455
	best keyword for cluster 6747727 is PF04569 with Jaccard = 0.7317	[ 150	51	1100006	4 ]	0.7463 0.9740

	SUGGESTING RELATEDNESS OF:   
	A> PF05754 ( PF05754   Domain of unknown function (DUF834) )
	B> PF04569 ( PF04569   Protein of unknown function )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04569| = 154 ,	 |PF05754| = 384 , 	|PF04569^PF05754| = 1	( 0.6% and 0.3% )

	Neither  PF05754 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 670 ) 6678846_PF00090_PF00200            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00090  is 6658050 with Jaccard = 0.7099	|PF00090|=489 [ 389	59	1099663	100 ]
 parent [ 6658050 ] : 6678846 0.127163 (=24693/(522*372)) 87.8929
  given [ 6658050 ] : 6658050 0.218269 (=2086/(503*19)) 82.7142
	best keyword for cluster 6658050 is PF00090 with Jaccard = 0.7099	[ 389	59	1099663	100 ]	0.8683 0.7955
sibling [ 6658050 ] : 6668806 0.159687 (=408/(7*365)) 85.1875
	best keyword for cluster 6668806 is PF00200 with Jaccard = 0.7820	[ 269	75	1099867	0 ]	0.7820 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00090 ( PF00090   Thrombospondin type 1 domain )
	B> PF00200 ( PF00200   Disintegrin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF00090 and PF00200 have PDB structures
		PF00200 	   g.20.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00200 SSF57552 0.970 (average over 566 mutual instances, PF00200 567 appearances, SSF57552 1832 appearances)
     2	PF00090 SSF82895 0.795 (average over 1393 mutual instances, PF00090 1737 appearances, SSF82895 3242 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 671 ) 6780465_PF01659_PF08015            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08015  is 6773994 with Jaccard = 0.7045	|PF08015|=44 [ 31	0	1100167	13 ]
 parent [ 6773994 ] : 6780465 0.000589449 (=8/(87*156)) 99.9608
  given [ 6773994 ] : 6773994 0.0021164 (=4/(42*45)) 99.8306
	best keyword for cluster 6773994 is PF08015 with Jaccard = 0.7045	[ 31	0	1100167	13 ]	1.0000 0.7045
sibling [ 6773994 ] : 6778177 0.000988142 (=5/(46*110)) 99.9232
	best keyword for cluster 6778177 is PF01659 with Jaccard = 0.8621	[ 25	4	1100182	0 ]	0.8621 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08015 ( PF08015   Fungal mating-type pheromone )
	B> PF01659 ( PF01659   Luteovirus putative VPg genome linked protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF08015 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 672 ) 6752562_PF00096_PF05485            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05485  is 6723534 with Jaccard = 0.7033	|PF05485|=91 [ 64	0	1100120	27 ]
 parent [ 6723534 ] : 6752562 0.0131453 (=11888/(118*7664)) 98.8076
  given [ 6723534 ] : 6723534 0.0451632 (=155/(66*52)) 95.9325
	best keyword for cluster 6723534 is PF05485 with Jaccard = 0.7033	[ 64	0	1100120	27 ]	1.0000 0.7033
sibling [ 6723534 ] : 6752225 0.0171924 (=5634/(43*7621)) 98.7845
	best keyword for cluster 6752225 is PF00096 with Jaccard = 0.6418	[ 4375	1931	1093394	511 ]	0.6938 0.8954

	SUGGESTING RELATEDNESS OF:   
	A> PF05485 ( PF05485   THAP domain )
	B> PF00096 ( PF00096   Zinc finger, C2H2 type )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00096| = 4886 ,	 |PF05485| = 91 , 	|PF00096^PF05485| = 8	( 0.2% and 8.8% )

	only  PF05485 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 673 ) 6771366_PF00702_PF00982            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00702  is 6769089 with Jaccard = 0.7019	|PF00702|=4884 [ 4772	1915	1093412	112 ]
 parent [ 6769089 ] : 6771366 0.00313522 (=10159/(8245*393)) 99.7556
  given [ 6769089 ] : 6769089 0.00472453 (=78112/(3443*4802)) 99.6794
	best keyword for cluster 6769089 is PF00702 with Jaccard = 0.7019	[ 4772	1915	1093412	112 ]	0.7136 0.9771
sibling [ 6769089 ] : 6769131 0.00510204 (=2/(1*392)) 99.6811
	best keyword for cluster 6769131 is PF00982 with Jaccard = 0.6836	[ 229	105	1099876	1 ]	0.6856 0.9957

	SUGGESTING RELATEDNESS OF:   
	A> PF00702 ( PF00702   haloacid dehalogenase-like hydrolase )
	B> PF00982 ( PF00982   Glycosyltransferase family 20 )

	A and B come from a different clan ( CL0137.9 , CL0113.8 ).
	the two keywords do not coincide on UniRef90 proteins

	both PF00702 and PF00982 have PDB structures
	PF00702 	   c.108.1.1  c.108.1.10  c.108.1.11  c.108.1.14  c.108.1.2  c.108.1.22  c.108.1.3  c.108.1.4  c.108.1.5  c.108.1.6  d.220.1.1  i.18.1.1
	PF00982 	   c.87.1.6

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 674 ) 6672121_PF07018_PF07201            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07201  is 6650337 with Jaccard = 0.7000	|PF07201|=23 [ 21	7	1100181	2 ]
 parent [ 6650337 ] : 6672121 0.15505 (=109/(19*37)) 86.0511
  given [ 6650337 ] : 6650337 0.211538 (=66/(13*24)) 80.1255
	best keyword for cluster 6650337 is PF07201 with Jaccard = 0.7000	[ 21	7	1100181	2 ]	0.7500 0.9130
sibling [ 6650337 ] : 6603328 0.465909 (=41/(8*11)) 63.3105
	best keyword for cluster 6603328 is PF07018 with Jaccard = 0.8000	[ 8	2	1100201	0 ]	0.8000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07201 ( PF07201   Hypersensitivity response secretion protein HrpJ )
	B> PF07018 ( PF07018   SepL/SsaL protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07201 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 675 ) 6773436_PF00631_PF01990            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00631  is 6747899 with Jaccard = 0.6947	|PF00631|=94 [ 66	1	1100116	28 ]
 parent [ 6747899 ] : 6773436 0.0028909 (=23/(78*102)) 99.817
  given [ 6747899 ] : 6747899 0.0267241 (=31/(58*20)) 98.4594
	best keyword for cluster 6747899 is PF00631 with Jaccard = 0.6947	[ 66	1	1100116	28 ]	0.9851 0.7021
sibling [ 6747899 ] : 6763173 0.00990099 (=1/(1*101)) 99.4257
	best keyword for cluster 6763173 is PF01990 with Jaccard = 1.0000	[ 92	0	1100119	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF00631 ( PF00631   GGL domain )
	B> PF01990 ( PF01990   ATP synthase (F/14-kDa) subunit )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF00631 and PF01990 have PDB structures
	PF00631 	   a.137.3.1  j.103.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00631 SSF48670 0.846 (average over 252 mutual instances, PF00631 331 appearances, SSF48670 404 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 676 ) 6558673_PF02875_PF08353            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01225  is 6530772 with Jaccard = 0.6944	|PF01225|=774 [ 768	332	1099105	6 ]
 parent [ 6530772 ] : 6558673 0.597316 (=45218/(62*1221)) 45.7388
  given [ 6530772 ] : 6530772 0.762885 (=284045/(591*630)) 26.1032
	best keyword for cluster 6530772 is PF02875 with Jaccard = 0.7965	[ 1057	43	1098884	227 ]	0.9609 0.8232
sibling [ 6530772 ] : 6493015 0.916667 (=110/(2*60)) 9.11813
	best keyword for cluster 6493015 is PF08353 with Jaccard = 0.8947	[ 51	6	1100154	0 ]	0.8947 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02875 ( PF02875   Mur ligase family, glutamate ligase domain )
	B> PF08353 ( PF08353   Domain of unknown function (DUF1727) )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02875 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 677 ) 6695940_PF01537_PF02400            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01537  is 6647811 with Jaccard = 0.6944	|PF01537|=36 [ 25	0	1100175	11 ]
 parent [ 6647811 ] : 6695940 0.112795 (=67/(27*22)) 91.5255
  given [ 6647811 ] : 6647811 0.22 (=11/(2*25)) 79.2617
	best keyword for cluster 6647811 is PF01537 with Jaccard = 0.6944	[ 25	0	1100175	11 ]	1.0000 0.6944
sibling [ 6647811 ] : 6668407 0.15 (=6/(2*20)) 85.08
	best keyword for cluster 6668407 is PF02400 with Jaccard = 0.8636	[ 19	0	1100189	3 ]	1.0000 0.8636

	SUGGESTING RELATEDNESS OF:   
	A> PF01537 ( PF01537   Herpesvirus glycoprotein D )
	B> PF02400 ( PF02400   Glycoprotein GG/GX )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF01537 has a PDB structure (may not be up to date)
	PF01537 	   b.1.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 678 ) 6643964_PF02403_PF03129            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03129  is 6626433 with Jaccard = 0.6929	|PF03129|=1189 [ 846	32	1098990	343 ]
 parent [ 6626433 ] : 6643964 0.240807 (=89125/(390*949)) 78.2006
  given [ 6626433 ] : 6626433 0.320898 (=40308/(159*790)) 73.0202
	best keyword for cluster 6626433 is PF03129 with Jaccard = 0.6929	[ 846	32	1098990	343 ]	0.9636 0.7115
sibling [ 6626433 ] : 6608037 0.413882 (=161/(1*389)) 65.9262
	best keyword for cluster 6608037 is PF02403 with Jaccard = 0.9190	[ 329	27	1099853	2 ]	0.9242 0.9940

	SUGGESTING RELATEDNESS OF:   
	A> PF03129 ( PF03129   Anticodon binding domain )
	B> PF02403 ( PF02403   Seryl-tRNA synthetase N-terminal domain )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF03129 and PF02403 have PDB structures
	PF03129 	   c.51.1.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF02403 SSF46589 0.950 (average over 1002 mutual instances, PF02403 1005 appearances, SSF46589 5268 appearances)
     2	PF03129 SSF52954 0.857 (average over 3463 mutual instances, PF03129 5178 appearances, SSF52954 9421 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 679 ) 6574812_PF04968_PF05002            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05002  is 6383908 with Jaccard = 0.6901	|PF05002|=71 [ 49	0	1100140	22 ]
 parent [ 6383908 ] : 6574812 0.494505 (=945/(49*39)) 52.1661
  given [ 6383908 ] : 6383908 1 (=138/(46*3)) 0.000849054
	best keyword for cluster 6383908 is PF05002 with Jaccard = 0.6901	[ 49	0	1100140	22 ]	1.0000 0.6901
sibling [ 6383908 ] : 6530945 0.77027 (=57/(2*37)) 26.2604
	best keyword for cluster 6530945 is PF04968 with Jaccard = 0.9730	[ 36	0	1100174	1 ]	1.0000 0.9730

	SUGGESTING RELATEDNESS OF:   
	A> PF05002 ( PF05002   SGS domain )
	B> PF04968 ( PF04968   CHORD )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF05002 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 680 ) 6646364_PF00059_PF00193            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00193  is 6527604 with Jaccard = 0.6882	|PF00193|=93 [ 64	0	1100118	29 ]
 parent [ 6527604 ] : 6646364 0.218952 (=13787/(68*926)) 78.8773
  given [ 6527604 ] : 6527604 0.778605 (=837/(43*25)) 24.845
	best keyword for cluster 6527604 is PF00193 with Jaccard = 0.6882	[ 64	0	1100118	29 ]	1.0000 0.6882
sibling [ 6527604 ] : 6643806 0.24105 (=4794/(22*904)) 78.0925
	best keyword for cluster 6643806 is PF00059 with Jaccard = 0.7384	[ 830	18	1099087	276 ]	0.9788 0.7505

	SUGGESTING RELATEDNESS OF:   
	A> PF00193 ( PF00193   Extracellular link domain )
	B> PF00059 ( PF00059   Lectin C-type domain )

	they come from the same clan: CL0056.7 : PF03440	PF01413	PF07979	PF00059	PF00193

	the two keywords coincide on Uniref90 proteins: |PF00059| = 1106 ,	 |PF00193| = 93 , 	|PF00059^PF00193| = 27	( 2.4% and 29.0% )

	both PF00193 and PF00059 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00193 SSF56436 0.607 (average over 209 mutual instances, PF00193 295 appearances, SSF56436 4895 appearances)
     2	PF00059 SSF56436 0.797 (average over 2340 mutual instances, PF00059 2804 appearances, SSF56436 4895 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 681 ) 6524585_PF00019_PF04709            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04709  is 6391041 with Jaccard = 0.6875	|PF04709|=12 [ 11	4	1100195	1 ]
 parent [ 6391041 ] : 6524585 0.798955 (=4129/(16*323)) 22.7655
  given [ 6391041 ] : 6391041 1 (=63/(7*9)) 0.00236828
	best keyword for cluster 6391041 is PF04709 with Jaccard = 0.6875	[ 11	4	1100195	1 ]	0.7333 0.9167
sibling [ 6391041 ] : 6516282 0.84345 (=2640/(10*313)) 18.1837
	best keyword for cluster 6516282 is PF00019 with Jaccard = 0.8119	[ 315	0	1099823	73 ]	1.0000 0.8119

	SUGGESTING RELATEDNESS OF:   
	A> PF04709 ( PF04709   Anti-Mullerian hormone, N terminal region )
	B> PF00019 ( PF00019   Transforming growth factor beta like domain )

	Only B has a clan ( CL0079.7 ).

	the two keywords coincide on Uniref90 proteins: |PF00019| = 388 ,	 |PF04709| = 12 , 	|PF00019^PF04709| = 11	( 2.8% and 91.7% )

	only  PF04709 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 682 ) 6755081_PF00615_PF00787            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00615  is 6750739 with Jaccard = 0.6855	|PF00615|=250 [ 194	33	1099928	56 ]
 parent [ 6750739 ] : 6755081 0.0116959 (=2024/(342*506)) 98.9747
  given [ 6750739 ] : 6750739 0.0166147 (=112/(21*321)) 98.6738
	best keyword for cluster 6750739 is PF00615 with Jaccard = 0.6855	[ 194	33	1099928	56 ]	0.8546 0.7760
sibling [ 6750739 ] : 6752634 0.0118812 (=6/(1*505)) 98.8126
	best keyword for cluster 6752634 is PF00787 with Jaccard = 0.6625	[ 424	21	1099571	195 ]	0.9528 0.6850

	SUGGESTING RELATEDNESS OF:   
	A> PF00615 ( PF00615   Regulator of G protein signaling domain )
	B> PF00787 ( PF00787   PX domain )

	Only A has a clan ( CL0272.2 ).

	the two keywords coincide on Uniref90 proteins: |PF00615| = 250 ,	 |PF00787| = 619 , 	|PF00615^PF00787| = 7	( 2.8% and 1.1% )

	both PF00615 and PF00787 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00787 SSF64268 0.897 (average over 1424 mutual instances, PF00787 1915 appearances, SSF64268 2671 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 683 ) 6443664_PF03131_PF08383            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08383  is 6209650 with Jaccard = 0.6818	|PF08383|=22 [ 15	0	1100189	7 ]
 parent [ 6209650 ] : 6443664 0.991398 (=461/(15*31)) 0.861333
  given [ 6209650 ] : 6209650 1 (=56/(7*8)) 1.80004e-16
	best keyword for cluster 6209650 is PF08383 with Jaccard = 0.6818	[ 15	0	1100189	7 ]	1.0000 0.6818
sibling [ 6209650 ] : 6427635 1 (=30/(1*30)) 0.227124
	best keyword for cluster 6427635 is PF03131 with Jaccard = 0.6200	[ 31	0	1100161	19 ]	1.0000 0.6200

	SUGGESTING RELATEDNESS OF:   
	A> PF08383 ( PF08383   Maf N-terminal region )
	B> PF03131 ( PF03131   bZIP Maf transcription factor )

	Only B has a clan ( CL0018.10 ).

	the two keywords coincide on Uniref90 proteins: |PF03131| = 50 ,	 |PF08383| = 22 , 	|PF03131^PF08383| = 22	( 44.0% and 100.0% )

	only  PF08383 has a PDB structure (may not be up to date)
		PF03131 	   a.37.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF03131 SSF47454 0.500 (average over 115 mutual instances, PF03131 115 appearances, SSF47454 529 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 684 ) 6764539_PF00428_PF00466            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00428  is 6753376 with Jaccard = 0.6791	|PF00428|=296 [ 201	0	1099915	95 ]
 parent [ 6753376 ] : 6764539 0.00800582 (=781/(458*213)) 99.4906
  given [ 6753376 ] : 6753376 0.0141509 (=3/(1*212)) 98.8632
	best keyword for cluster 6753376 is PF00428 with Jaccard = 0.6791	[ 201	0	1099915	95 ]	1.0000 0.6791
sibling [ 6753376 ] : 6731084 0.0371991 (=17/(1*457)) 96.8418
	best keyword for cluster 6731084 is PF00466 with Jaccard = 0.9791	[ 374	6	1099829	2 ]	0.9842 0.9947

	SUGGESTING RELATEDNESS OF:   
	A> PF00428 ( PF00428   60s Acidic ribosomal protein )
	B> PF00466 ( PF00466   Ribosomal protein L10 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00428| = 296 ,	 |PF00466| = 376 , 	|PF00428^PF00466| = 73	( 24.7% and 19.4% )

	both PF00428 and PF00466 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 685 ) 6745345_PF00590_PF01890            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01890  is 6712665 with Jaccard = 0.6765	|PF01890|=135 [ 92	1	1100075	43 ]
 parent [ 6712665 ] : 6745345 0.0177431 (=3430/(115*1681)) 98.2598
  given [ 6712665 ] : 6712665 0.0582524 (=72/(12*103)) 94.3492
	best keyword for cluster 6712665 is PF01890 with Jaccard = 0.6765	[ 92	1	1100075	43 ]	0.9892 0.6815
sibling [ 6712665 ] : 6727857 0.0459337 (=1072/(14*1667)) 96.4642
	best keyword for cluster 6727857 is PF00590 with Jaccard = 0.7680	[ 1152	230	1098711	118 ]	0.8336 0.9071

	SUGGESTING RELATEDNESS OF:   
	A> PF01890 ( PF01890   CbiG )
	B> PF00590 ( PF00590   Tetrapyrrole (Corrin/Porphyrin) Methylases )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00590| = 1270 ,	 |PF01890| = 135 , 	|PF00590^PF01890| = 43	( 3.4% and 31.9% )

	only  PF01890 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 686 ) 6604579_PF05048_PF07602            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07602  is 6559946 with Jaccard = 0.6765	|PF07602|=32 [ 23	2	1100177	9 ]
 parent [ 6559946 ] : 6604579 0.400558 (=3015/(39*193)) 63.9963
  given [ 6559946 ] : 6559946 0.591667 (=213/(15*24)) 46.7929
	best keyword for cluster 6559946 is PF07602 with Jaccard = 0.6765	[ 23	2	1100177	9 ]	0.9200 0.7188
sibling [ 6559946 ] : 6560014 0.581081 (=3870/(148*45)) 46.8474
	best keyword for cluster 6560014 is PF05048 with Jaccard = 0.6259	[ 92	13	1100064	42 ]	0.8762 0.6866

	SUGGESTING RELATEDNESS OF:   
	A> PF07602 ( PF07602   Protein of unknown function (DUF1565) )
	B> PF05048 ( PF05048   Periplasmic copper-binding protein (NosD) )

	Only A has a clan ( CL0268.2 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF07602 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 687 ) 6543609_PF00516_PF00517            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00517  is 6535611 with Jaccard = 0.6737	|PF00517|=2096 [ 1412	0	1098115	684 ]
 parent [ 6535611 ] : 6543609 0.65931 (=13143489/(14029*1421)) 34.3182
  given [ 6535611 ] : 6535611 0.745948 (=2117/(2*1419)) 29.169
	best keyword for cluster 6535611 is PF00517 with Jaccard = 0.6737	[ 1412	0	1098115	684 ]	1.0000 0.6737
sibling [ 6535611 ] : 6532858 0.758554 (=10641/(1*14028)) 27.5467
	best keyword for cluster 6532858 is PF00516 with Jaccard = 0.8922	[ 13759	0	1084789	1663 ]	1.0000 0.8922

	SUGGESTING RELATEDNESS OF:   
	A> PF00517 ( PF00517   Envelope Polyprotein GP41 )
	B> PF00516 ( PF00516   Envelope glycoprotein GP120 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00516| = 15422 ,	 |PF00517| = 2096 , 	|PF00516^PF00517| = 1568	( 10.2% and 74.8% )

	both PF00517 and PF00516 have PDB structures
	PF00517 	   h.3.2.1  j.85.1.1
	PF00516 	   d.172.1.1  j.53.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00516 SSF56502 0.967 (average over 73444 mutual instances, PF00516 73449 appearances, SSF56502 79042 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 688 ) 6756749_PF00587_PF04073            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF04073  is 6689420 with Jaccard = 0.6721	|PF04073|=427 [ 287	0	1099784	140 ]
 parent [ 6689420 ] : 6756749 0.0106988 (=8799/(335*2455)) 99.0757
  given [ 6689420 ] : 6689420 0.113965 (=1105/(32*303)) 90.1349
	best keyword for cluster 6689420 is PF04073 with Jaccard = 0.6721	[ 287	0	1099784	140 ]	1.0000 0.6721
sibling [ 6689420 ] : 6752593 0.0126175 (=247/(8*2447)) 98.81
	best keyword for cluster 6752593 is PF00587 with Jaccard = 0.7547	[ 1652	525	1098022	12 ]	0.7588 0.9928

	SUGGESTING RELATEDNESS OF:   
	A> PF04073 ( PF04073   YbaK / prolyl-tRNA synthetases associated domain )
	B> PF00587 ( PF00587   tRNA synthetase class II core domain (G, H, P, S and T) )

	Only B has a clan ( CL0040.10 ).

	the two keywords coincide on Uniref90 proteins: |PF00587| = 1664 ,	 |PF04073| = 427 , 	|PF00587^PF04073| = 136	( 8.2% and 31.9% )

	both PF04073 and PF00587 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF04073 SSF55826 0.873 (average over 1486 mutual instances, PF04073 1998 appearances, SSF55826 2765 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 689 ) 6719656_PF03029_PF06807            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03029  is 6581374 with Jaccard = 0.6699	|PF03029|=206 [ 138	0	1100005	68 ]
 parent [ 6581374 ] : 6719656 0.0655866 (=1008/(141*109)) 95.3684
  given [ 6581374 ] : 6581374 0.482014 (=134/(2*139)) 54.1992
	best keyword for cluster 6581374 is PF03029 with Jaccard = 0.6699	[ 138	0	1100005	68 ]	1.0000 0.6699
sibling [ 6581374 ] : 6693227 0.107143 (=45/(4*105)) 90.937
	best keyword for cluster 6693227 is PF06807 with Jaccard = 0.7083	[ 34	13	1100163	1 ]	0.7234 0.9714

	SUGGESTING RELATEDNESS OF:   
	A> PF03029 ( PF03029   Conserved hypothetical ATP binding protein )
	B> PF06807 ( PF06807   Pre-mRNA cleavage complex II protein Clp1 )

	Only A has a clan ( CL0017.14 ).
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF03029 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 690 ) 6666877_PF01291_PF06875            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01291  is 5775091 with Jaccard = 0.6667	|PF01291|=12 [ 8	0	1100199	4 ]
 parent [ 5775091 ] : 6666877 0.181818 (=32/(8*22)) 84.6952
  given [ 5775091 ] : 5775091 1 (=7/(1*7)) 4.29043e-57
	best keyword for cluster 5775091 is PF01291 with Jaccard = 0.6667	[ 8	0	1100199	4 ]	1.0000 0.6667
sibling [ 5775091 ] : 6628228 0.302083 (=29/(6*16)) 73.7835
	best keyword for cluster 6628228 is PF06875 with Jaccard = 0.6667	[ 12	5	1100193	1 ]	0.7059 0.9231

	SUGGESTING RELATEDNESS OF:   
	A> PF01291 ( PF01291   LIF / OSM family )
	B> PF06875 ( PF06875   Plethodontid receptivity factor PRF )

	they come from the same clan: CL0053.9 : PF06875	PF01291	PF02024	PF00143	PF00489	PF02025	PF00727	PF02059	PF00715	PF03487	PF03039	PF07400	PF00726	PF00714	PF00103	PF01109	PF02947	PF00758	PF01110	PF02404
	the two keywords do not coincide on UniRef90 proteins

	only  PF01291 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF06875 SSF47266 0.873 (average over 209 mutual instances, PF06875 212 appearances, SSF47266 2488 appearances)
     2	PF01291 SSF47266 0.910 (average over 29 mutual instances, PF01291 29 appearances, SSF47266 2488 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 691 ) 6726719_PF05747_PF06225            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF05747  is 6677373 with Jaccard = 0.6667	|PF05747|=9 [ 6	0	1100202	3 ]
 parent [ 6677373 ] : 6726719 0.0395349 (=17/(10*43)) 96.3268
  given [ 6677373 ] : 6677373 0.125 (=3/(6*4)) 87.5379
	best keyword for cluster 6677373 is PF05747 with Jaccard = 0.6667	[ 6	0	1100202	3 ]	1.0000 0.6667
sibling [ 6677373 ] : 6696267 0.101307 (=31/(34*9)) 91.6104
	best keyword for cluster 6696267 is PF06225 with Jaccard = 0.8235	[ 28	6	1100177	0 ]	0.8235 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF05747 ( PF05747   Poxvirus N2L protein )
	B> PF06225 ( PF06225   Poxvirus A4/B15 family )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	Neither  PF05747 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 692 ) 6636414_PF05318_PF06692            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF06692  is 6563892 with Jaccard = 0.6667	|PF06692|=2 [ 2	1	1100208	0 ]
 parent [ 6563892 ] : 6636414 0.282051 (=11/(3*13)) 76.0387
  given [ 6563892 ] : 6563892 0.5 (=1/(1*2)) 50
	best keyword for cluster 6563892 is PF06692 with Jaccard = 0.6667	[ 2	1	1100208	0 ]	0.6667 1.0000
sibling [ 6563892 ] : 6590933 0.454545 (=10/(2*11)) 57.6162
	best keyword for cluster 6590933 is PF05318 with Jaccard = 0.8000	[ 12	0	1100196	3 ]	1.0000 0.8000

	SUGGESTING RELATEDNESS OF:   
	A> PF06692 ( PF06692   Melon necrotic spot virus P7B protein )
	B> PF05318 ( PF05318   Tombusvirus movement protein )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF05318| = 15 ,	 |PF06692| = 2 , 	|PF05318^PF06692| = 1	( 6.7% and 50.0% )

	Neither  PF06692 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 693 ) 6750681_PF04239_PF07870            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF07870  is 6661366 with Jaccard = 0.6667	|PF07870|=21 [ 14	0	1100190	7 ]
 parent [ 6661366 ] : 6750681 0.0154044 (=48/(164*19)) 98.6673
  given [ 6661366 ] : 6661366 0.166667 (=3/(1*18)) 83.5733
	best keyword for cluster 6661366 is PF07870 with Jaccard = 0.6667	[ 14	0	1100190	7 ]	1.0000 0.6667
sibling [ 6661366 ] : 6560505 0.565217 (=273/(3*161)) 47.1195
	best keyword for cluster 6560505 is PF04239 with Jaccard = 1.0000	[ 135	0	1100076	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF07870 ( PF07870   Protein of unknown function (DUF1657) )
	B> PF04239 ( PF04239   Protein of unknown function (DUF421) )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF04239| = 135 ,	 |PF07870| = 21 , 	|PF04239^PF07870| = 7	( 5.2% and 33.3% )

	Neither  PF07870 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 694 ) 6561542_PF02955_PF08443            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08443  is 6526079 with Jaccard = 0.6667	|PF08443|=207 [ 142	6	1099998	65 ]
 parent [ 6526079 ] : 6561542 0.53425 (=11738/(127*173)) 48.0237
  given [ 6526079 ] : 6526079 0.777273 (=1026/(8*165)) 23.7269
	best keyword for cluster 6526079 is PF08443 with Jaccard = 0.6667	[ 142	6	1099998	65 ]	0.9595 0.6860
sibling [ 6526079 ] : 6486085 0.935484 (=348/(3*124)) 7.1594
	best keyword for cluster 6486085 is PF02955 with Jaccard = 1.0000	[ 112	0	1100099	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF08443 ( PF08443   RimK-like ATP-grasp domain )
	B> PF02955 ( PF02955   Prokaryotic glutathione synthetase, ATP-grasp domain )

	they come from the same clan: CL0179.8 : PF01740	PF08443	PF05770	PF02955	PF01071	PF04174	PF07478	PF02786	PF02655	PF08442	PF02222	PF02750	PF03133
	the two keywords do not coincide on UniRef90 proteins

	both PF08443 and PF02955 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 695 ) 6756225_PF02810_PF05118            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02810  is 6743318 with Jaccard = 0.6580	|PF02810|=409 [ 379	167	1099635	30 ]
 parent [ 6743318 ] : 6756225 0.0100039 (=820/(752*109)) 99.046
  given [ 6743318 ] : 6743318 0.0272359 (=222/(11*741)) 98.0826
	best keyword for cluster 6743318 is PF02810 with Jaccard = 0.6580	[ 379	167	1099635	30 ]	0.6941 0.9267
sibling [ 6743318 ] : 6744839 0.037037 (=4/(1*108)) 98.2222
	best keyword for cluster 6744839 is PF05118 with Jaccard = 0.9425	[ 82	3	1100124	2 ]	0.9647 0.9762

	SUGGESTING RELATEDNESS OF:   
	A> PF02810 ( PF02810   SEC-C motif )
	B> PF05118 ( PF05118   Aspartyl/Asparaginyl beta-hydroxylase )

	Only B has a clan ( CL0029.13 ).

	the two keywords coincide on Uniref90 proteins: |PF02810| = 409 ,	 |PF05118| = 84 , 	|PF02810^PF05118| = 2	( 0.5% and 2.4% )

	both PF02810 and PF05118 have PDB structures
		PF05118 	   b.82.2.4

	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 696 ) 6731963_PF03446_PF03807            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03807  is 6581897 with Jaccard = 0.6541	|PF03807|=554 [ 363	1	1099656	191 ]
 parent [ 6581897 ] : 6731963 0.0407252 (=22407/(393*1400)) 96.9353
  given [ 6581897 ] : 6581897 0.479381 (=930/(5*388)) 54.3751
	best keyword for cluster 6581897 is PF03807 with Jaccard = 0.6541	[ 363	1	1099656	191 ]	0.9973 0.6552
sibling [ 6581897 ] : 6722456 0.0544 (=17952/(300*1100)) 95.7783
	best keyword for cluster 6722456 is PF03446 with Jaccard = 0.6202	[ 769	453	1098971	18 ]	0.6293 0.9771

	SUGGESTING RELATEDNESS OF:   
	A> PF03807 ( PF03807   NADP oxidoreductase coenzyme F420-dependent )
	B> PF03446 ( PF03446   NAD binding domain of 6-phosphogluconate dehydrogenase )

	they come from the same clan: CL0063.17 : PF03721	PF04820	PF02254	PF00899	PF01946	PF02882	PF01488	PF01118	PF08491	PF03435	PF04321	PF07992	PF00070	PF02719	PF02153	PF02423	PF05368	PF01210	PF07994	PF07993	PF03447	PF03446	PF01225	PF06039	PF01232	PF03949	PF05834	PF00056	PF08659	PF07991	PF03486	PF00044	PF00732	PF01134	PF01408	PF00996	PF00479	PF00743	PF01494	PF00890	PF03807	PF01370	PF00208	PF02670	PF01113	PF01266	PF02629	PF02558	PF01593	PF01262	PF00670	PF00107	PF00106	PF02737	PF01073	PF02826

	the two keywords coincide on Uniref90 proteins: |PF03446| = 787 ,	 |PF03807| = 554 , 	|PF03446^PF03807| = 2	( 0.3% and 0.4% )

	both PF03807 and PF03446 have PDB structures
	PF03807 	   c.2.1.6
	
	SUPERFAM mapping significantly overlapping:
     1	PF03807 SSF51735 0.687 (average over 1471 mutual instances, PF03807 1503 appearances, SSF51735 164772 appearances)
     2	PF03446 SSF51735 0.944 (average over 2816 mutual instances, PF03446 5512 appearances, SSF51735 164772 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 697 ) 6774609_PF01402_PF03681            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01402  is 6772743 with Jaccard = 0.6460	|PF01402|=408 [ 312	75	1099728	96 ]
 parent [ 6772743 ] : 6774609 0.0023806 (=1361/(581*984)) 99.8461
  given [ 6772743 ] : 6772743 0.00334821 (=624/(256*728)) 99.7975
	best keyword for cluster 6772743 is PF01402 with Jaccard = 0.6460	[ 312	75	1099728	96 ]	0.8062 0.7647
sibling [ 6772743 ] : 6766847 0.00561542 (=345/(442*139)) 99.5924
	best keyword for cluster 6766847 is PF03681 with Jaccard = 0.7405	[ 234	81	1099895	1 ]	0.7429 0.9957

	SUGGESTING RELATEDNESS OF:   
	A> PF01402 ( PF01402   Ribbon-helix-helix protein, copG family )
	B> PF03681 ( PF03681   Uncharacterised protein family (UPF0150) )

	Only A has a clan ( CL0057.9 ).

	the two keywords coincide on Uniref90 proteins: |PF01402| = 408 ,	 |PF03681| = 235 , 	|PF01402^PF03681| = 14	( 3.4% and 6.0% )

	only  PF01402 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF01402 SSF47598 0.835 (average over 310 mutual instances, PF01402 324 appearances, SSF47598 883 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 698 ) 6740759_PF01822_PF03659            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01822  is 6732267 with Jaccard = 0.6441	|PF01822|=114 [ 76	4	1100093	38 ]
 parent [ 6732267 ] : 6740759 0.0221798 (=151/(46*148)) 97.8541
  given [ 6732267 ] : 6732267 0.0326087 (=45/(138*10)) 96.9757
	best keyword for cluster 6732267 is PF01822 with Jaccard = 0.6441	[ 76	4	1100093	38 ]	0.9500 0.6667
sibling [ 6732267 ] : 6625911 0.278049 (=57/(5*41)) 72.9853
	best keyword for cluster 6625911 is PF03659 with Jaccard = 0.9268	[ 38	1	1100170	2 ]	0.9744 0.9500

	SUGGESTING RELATEDNESS OF:   
	A> PF01822 ( PF01822   WSC domain )
	B> PF03659 ( PF03659   Glycosyl hydrolase family 71 )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF01822| = 114 ,	 |PF03659| = 40 , 	|PF01822^PF03659| = 2	( 1.8% and 5.0% )

	Neither  PF01822 have structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 699 ) 6509980_PF00906_PF08290            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF08290  is 6379878 with Jaccard = 0.6429	|PF08290|=28 [ 18	0	1100183	10 ]
 parent [ 6379878 ] : 6509980 0.84639 (=551/(21*31)) 15.4148
  given [ 6379878 ] : 6379878 1 (=20/(1*20)) 0.00049646
	best keyword for cluster 6379878 is PF08290 with Jaccard = 0.6429	[ 18	0	1100183	10 ]	1.0000 0.6429
sibling [ 6379878 ] : 6452822 1 (=30/(1*30)) 1.56867
	best keyword for cluster 6452822 is PF00906 with Jaccard = 0.6444	[ 29	0	1100166	16 ]	1.0000 0.6444

	SUGGESTING RELATEDNESS OF:   
	A> PF08290 ( PF08290   Hepatitis core protein, putative zinc finger )
	B> PF00906 ( PF00906   Hepatitis core antigen )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00906| = 45 ,	 |PF08290| = 28 , 	|PF00906^PF08290| = 22	( 48.9% and 78.6% )

	only  PF08290 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:
     1	PF00906 SSF47852 0.837 (average over 2080 mutual instances, PF00906 2081 appearances, SSF47852 3170 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 700 ) 6737655_PF00037_PF00384            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00037  is 6720702 with Jaccard = 0.6357	|PF00037|=4327 [ 3387	1001	1094883	940 ]
 parent [ 6720702 ] : 6737655 0.0304517 (=283631/(5218*1785)) 97.5548
  given [ 6720702 ] : 6720702 0.0707303 (=369/(1*5217)) 95.5031
	best keyword for cluster 6720702 is PF00037 with Jaccard = 0.6357	[ 3387	1001	1094883	940 ]	0.7719 0.7828
sibling [ 6720702 ] : 6730037 0.0391315 (=3064/(45*1740)) 96.7255
	best keyword for cluster 6730037 is PF00384 with Jaccard = 0.8944	[ 1381	150	1098667	13 ]	0.9020 0.9907

	SUGGESTING RELATEDNESS OF:   
	A> PF00037 ( PF00037   4Fe-4S binding domain )
	B> PF00384 ( PF00384   Molybdopterin oxidoreductase )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00037| = 4327 ,	 |PF00384| = 1394 , 	|PF00037^PF00384| = 103	( 2.4% and 7.4% )

	both PF00037 and PF00384 have PDB structures
	PF00037 	   d.58.1.1  d.58.1.2  d.58.1.5  i.4.1.1
	
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 701 ) 6770483_PF00235_PF03259            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03259  is 6764885 with Jaccard = 0.6301	|PF03259|=170 [ 109	3	1100038	61 ]
 parent [ 6764885 ] : 6770483 0.00354773 (=116/(173*189)) 99.7271
  given [ 6764885 ] : 6764885 0.00632911 (=31/(31*158)) 99.5073
	best keyword for cluster 6764885 is PF03259 with Jaccard = 0.6301	[ 109	3	1100038	61 ]	0.9732 0.6412
sibling [ 6764885 ] : 6759556 0.0144928 (=28/(12*161)) 99.2412
	best keyword for cluster 6759556 is PF00235 with Jaccard = 1.0000	[ 119	0	1100092	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF03259 ( PF03259   Roadblock/LC7 domain )
	B> PF00235 ( PF00235   Profilin )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	both PF03259 and PF00235 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00235 SSF55770 0.955 (average over 395 mutual instances, PF00235 395 appearances, SSF55770 397 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 702 ) 6735807_PF00165_PF01965            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01965  is 6672831 with Jaccard = 0.6287	|PF01965|=1072 [ 674	0	1099139	398 ]
 parent [ 6672831 ] : 6735807 0.0287198 (=79647/(760*3649)) 97.3532
  given [ 6672831 ] : 6672831 0.146631 (=333/(3*757)) 86.3036
	best keyword for cluster 6672831 is PF01965 with Jaccard = 0.6287	[ 674	0	1099139	398 ]	1.0000 0.6287
sibling [ 6672831 ] : 6732735 0.0314039 (=1939/(17*3632)) 97.0212
	best keyword for cluster 6732735 is PF00165 with Jaccard = 0.8007	[ 2559	492	1097015	145 ]	0.8387 0.9464

	SUGGESTING RELATEDNESS OF:   
	A> PF01965 ( PF01965   DJ-1/PfpI family )
	B> PF00165 ( PF00165   Bacterial regulatory helix-turn-helix proteins, AraC family )

	A and B come from a different clan ( CL0014.17 , CL0123.12 ).

	the two keywords coincide on Uniref90 proteins: |PF00165| = 2704 ,	 |PF01965| = 1072 , 	|PF00165^PF01965| = 271	( 10.0% and 25.3% )

	both PF01965 and PF00165 have PDB structures
	PF01965 	   c.23.16.2
	PF00165 	   a.4.1.8  i.11.1.1

	SUPERFAM mapping significantly overlapping:
     1	PF00165 SSF46689 0.817 (average over 10023 mutual instances, PF00165 14372 appearances, SSF46689 68153 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 703 ) 6621544_PF02189_PF07213            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02189  is 6488687 with Jaccard = 0.6200	|PF02189|=49 [ 31	1	1100161	18 ]
 parent [ 6488687 ] : 6621544 0.313636 (=138/(40*11)) 71.0381
  given [ 6488687 ] : 6488687 0.944862 (=377/(19*21)) 7.88283
	best keyword for cluster 6488687 is PF02189 with Jaccard = 0.6200	[ 31	1	1100161	18 ]	0.9688 0.6327
sibling [ 6488687 ] : 6538677 0.833333 (=25/(5*6)) 31.3245
	best keyword for cluster 6538677 is PF07213 with Jaccard = 1.0000	[ 4	0	1100207	0 ]	1.0000 1.0000

	SUGGESTING RELATEDNESS OF:   
	A> PF02189 ( PF02189   Immunoreceptor tyrosine-based activation motif )
	B> PF07213 ( PF07213   DAP10 membrane protein )

	Neither A nor B are assigned a clan.
	the two keywords do not coincide on UniRef90 proteins

	only  PF02189 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 704 ) 6626512_PF00095_PF02822            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF02822  is 6517172 with Jaccard = 0.6154	|PF02822|=26 [ 16	0	1100185	10 ]
 parent [ 6517172 ] : 6626512 0.279123 (=484/(17*102)) 73.0889
  given [ 6517172 ] : 6517172 0.865385 (=45/(4*13)) 18.8895
	best keyword for cluster 6517172 is PF02822 with Jaccard = 0.6154	[ 16	0	1100185	10 ]	1.0000 0.6154
sibling [ 6517172 ] : 6625748 0.33 (=66/(2*100)) 72.8359
	best keyword for cluster 6625748 is PF00095 with Jaccard = 0.6715	[ 92	0	1100074	45 ]	1.0000 0.6715

	SUGGESTING RELATEDNESS OF:   
	A> PF02822 ( PF02822   Antistasin family )
	B> PF00095 ( PF00095   WAP-type (Whey Acidic Protein) 'four-disulfide core' )

	Neither A nor B are assigned a clan.

	the two keywords coincide on Uniref90 proteins: |PF00095| = 137 ,	 |PF02822| = 26 , 	|PF00095^PF02822| = 6	( 4.4% and 23.1% )

	both PF02822 and PF00095 have PDB structures
	PF02822 	   g.3.15.1
	
	SUPERFAM mapping significantly overlapping:
     1	PF00095 SSF57256 0.887 (average over 257 mutual instances, PF00095 361 appearances, SSF57256 386 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 705 ) 6719519_PF00488_PF01713            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01713  is 6700202 with Jaccard = 0.6142	|PF01713|=316 [ 199	8	1099887	117 ]
 parent [ 6700202 ] : 6719519 0.0521141 (=9058/(687*253)) 95.3499
  given [ 6700202 ] : 6700202 0.1002 (=1504/(95*158)) 92.214
	best keyword for cluster 6700202 is PF01713 with Jaccard = 0.6142	[ 199	8	1099887	117 ]	0.9614 0.6297
sibling [ 6700202 ] : 6694179 0.0912873 (=373/(6*681)) 91.1387
	best keyword for cluster 6694179 is PF00488 with Jaccard = 0.9400	[ 580	29	1099594	8 ]	0.9524 0.9864

	SUGGESTING RELATEDNESS OF:   
	A> PF01713 ( PF01713   Smr domain )
	B> PF00488 ( PF00488   MutS domain V )

	Only B has a clan ( CL0023.26 ).

	the two keywords coincide on Uniref90 proteins: |PF00488| = 588 ,	 |PF01713| = 316 , 	|PF00488^PF01713| = 94	( 16.0% and 29.7% )

	only  PF01713 has a PDB structure (may not be up to date)
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 706 ) 6771123_PF00579_PF01479            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01479  is 6750128 with Jaccard = 0.6071	|PF01479|=2079 [ 1817	914	1097218	262 ]
 parent [ 6750128 ] : 6771123 0.00392019 (=10137/(3123*828)) 99.7475
  given [ 6750128 ] : 6750128 0.0192498 (=46916/(1594*1529)) 98.6269
	best keyword for cluster 6750128 is PF01479 with Jaccard = 0.6071	[ 1817	914	1097218	262 ]	0.6653 0.8740
sibling [ 6750128 ] : 6769686 0.00483676 (=4/(1*827)) 99.7001
	best keyword for cluster 6769686 is PF00579 with Jaccard = 0.9934	[ 755	1	1099451	4 ]	0.9987 0.9947

	SUGGESTING RELATEDNESS OF:   
	A> PF01479 ( PF01479   S4 domain )
	B> PF00579 ( PF00579   tRNA synthetases class I (W and Y) )

	Only B has a clan ( CL0038.9 ).

	the two keywords coincide on Uniref90 proteins: |PF00579| = 759 ,	 |PF01479| = 2079 , 	|PF00579^PF01479| = 198	( 26.1% and 9.5% )

	both PF01479 and PF00579 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 707 ) 6704263_PF01436_PF08450            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF01436  is 6692435 with Jaccard = 0.6070	|PF01436|=306 [ 190	7	1099898	116 ]
 parent [ 6692435 ] : 6704263 0.090432 (=9857/(367*297)) 92.9729
  given [ 6692435 ] : 6692435 0.110837 (=225/(7*290)) 90.7586
	best keyword for cluster 6692435 is PF01436 with Jaccard = 0.6070	[ 190	7	1099898	116 ]	0.9645 0.6209
sibling [ 6692435 ] : 6634810 0.273158 (=7201/(269*98)) 75.6875
	best keyword for cluster 6634810 is PF08450 with Jaccard = 0.6741	[ 211	94	1099898	8 ]	0.6918 0.9635

	SUGGESTING RELATEDNESS OF:   
	A> PF01436 ( PF01436   NHL repeat )
	B> PF08450 ( PF08450   SMP-30/Gluconolaconase/LRE-like region )

	they come from the same clan: CL0186.8 : PF03088	PF08450	PF06739	PF07494	PF01011	PF02897	PF07676	PF08801	PF01436	PF06433	PF00058	PF01839	PF00930	PF02239	PF01731	PF00400

	the two keywords coincide on Uniref90 proteins: |PF01436| = 306 ,	 |PF08450| = 219 , 	|PF01436^PF08450| = 1	( 0.3% and 0.5% )

	both PF01436 and PF08450 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 708 ) 6701752_PF00069_PF00560            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00560  is 6701479 with Jaccard = 0.6048	|PF00560|=5445 [ 3346	87	1094679	2099 ]
 parent [ 6701479 ] : 6701752 0.0780373 (=4048390/(13652*3800)) 92.4987
  given [ 6701479 ] : 6701479 0.0982917 (=4839/(13*3787)) 92.4359
	best keyword for cluster 6701479 is PF00560 with Jaccard = 0.6048	[ 3346	87	1094679	2099 ]	0.9747 0.6145
sibling [ 6701479 ] : 6699288 0.0817025 (=27834/(25*13627)) 92.0576
	best keyword for cluster 6699288 is PF00069 with Jaccard = 0.7678	[ 10431	2211	1086625	944 ]	0.8251 0.9170

	SUGGESTING RELATEDNESS OF:   
	A> PF00560 ( PF00560   Leucine Rich Repeat )
	B> PF00069 ( PF00069   Protein kinase domain )

	A and B come from a different clan ( CL0022.25 , CL0016.14 ).

	the two keywords coincide on Uniref90 proteins: |PF00069| = 11375 ,	 |PF00560| = 5445 , 	|PF00069^PF00560| = 530	( 4.7% and 9.7% )

	both PF00560 and PF00069 have PDB structures
		
	SUPERFAM mapping significantly overlapping:
     1	PF00069 SSF56112 0.797 (average over 32363 mutual instances, PF00069 36405 appearances, SSF56112 66637 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 709 ) 6728229_PF00313_PF00545            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF00545  is 6584449 with Jaccard = 0.6032	|PF00545|=63 [ 38	0	1100148	25 ]
 parent [ 6584449 ] : 6728229 0.0355311 (=1455/(45*910)) 96.502
  given [ 6584449 ] : 6584449 0.497696 (=216/(31*14)) 55.0605
	best keyword for cluster 6584449 is PF00545 with Jaccard = 0.6032	[ 38	0	1100148	25 ]	1.0000 0.6032
sibling [ 6584449 ] : 6690491 0.116722 (=423/(4*906)) 90.3619
	best keyword for cluster 6690491 is PF00313 with Jaccard = 0.9095	[ 724	65	1099415	7 ]	0.9176 0.9904

	SUGGESTING RELATEDNESS OF:   
	A> PF00545 ( PF00545   ribonuclease )
	B> PF00313 ( PF00313   'Cold-shock' DNA-binding domain )

	Only B has a clan ( CL0021.12 ).

	the two keywords coincide on Uniref90 proteins: |PF00313| = 731 ,	 |PF00545| = 63 , 	|PF00313^PF00545| = 1	( 0.1% and 1.6% )

	both PF00545 and PF00313 have PDB structures
		PF00313 	   b.40.4.5

	SUPERFAM mapping significantly overlapping:
     1	PF00313 SSF50249 0.971 (average over 2759 mutual instances, PF00313 2782 appearances, SSF50249 52669 appearances)
     2	PF00545 SSF53933 0.914 (average over 162 mutual instances, PF00545 165 appearances, SSF53933 193 appearances)


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes:






-------------------====== ( 710 ) 6673211_PF02568_PF03054            ==========------------------
====== the next two brothers (given, sibling) merge two good clusters      ======================
======  (given is best cluster, sibling has J > 0.6 threshold)             ======================
======  for two seperate keywords - are the keywords related?              ======================
best cluster for keyword PF03054  is 6536179 with Jaccard = 0.6004	|PF03054|=457 [ 275	1	1099753	182 ]
 parent [ 6536179 ] : 6673211 0.170046 (=9469/(301*185)) 86.3916
  given [ 6536179 ] : 6536179 0.764214 (=457/(2*299)) 29.6917
	best keyword for cluster 6536179 is PF03054 with Jaccard = 0.6004	[ 275	1	1099753	182 ]	0.9964 0.6018
sibling [ 6536179 ] : 6625216 0.292549 (=746/(15*170)) 72.55
	best keyword for cluster 6625216 is PF02568 with Jaccard = 0.7976	[ 134	32	1100043	2 ]	0.8072 0.9853

	SUGGESTING RELATEDNESS OF:   
	A> PF03054 ( PF03054   tRNA methyl transferase )
	B> PF02568 ( PF02568   Thiamine biosynthesis protein (ThiI) )

	they come from the same clan: CL0039.7 : PF00764	PF00733	PF01171	PF01902	PF06508	PF02540	PF01507	PF02568	PF03054
	the two keywords do not coincide on UniRef90 proteins

	both PF03054 and PF02568 have PDB structures
		
	SUPERFAM mapping significantly overlapping:


	HMM-Logos (old pfam site):A , B  MSA-full-fasta: A,B

manual inspection notes: