cocci.inria.fr archive mirror
 help / color / mirror / Atom feed
* Re: [Cocci] Determination of an usage statistic for macro calls “for_each…node…”
       [not found] <42184243.d196fk2T0j@sonne>
@ 2020-10-17  5:38 ` Markus Elfring
  2020-10-18 11:16   ` Markus Elfring
  0 siblings, 1 reply; 2+ messages in thread
From: Markus Elfring @ 2020-10-17  5:38 UTC (permalink / raw)
  To: Sumera Priyadarsini, Coccinelle
  Cc: Michal Marek, Gilles Muller, kernel-janitors, Nicolas Palix,
	linux-kernel, Julia Lawall

> …
> > +(
> > +for_each_node_by_name(n,e1) S
> > +|
> …
> > +|
> > +for_each_node_with_property(n,e1) S
> > +)
> …
> 
> 
> Do you indicate any occurrence frequencies or probabilities for the mentioned macro calls
> by the ordering in this disjunction for the semantic patch language?

I would like to share another source code analysis approach.
I hope that this contribution can trigger further helpful software development ideas.


@initialize:python@
@@
import sys, sqlalchemy
sys.stderr.write("\n".join( ("Using SQLAlchemy version:", sqlalchemy.__version__) ))
sys.stderr.write("\n")
from sqlalchemy import Column, Integer, String, create_engine
from sqlalchemy.ext.declarative import declarative_base
from sqlalchemy.orm import sessionmaker
engine = create_engine("sqlite:///:memory:", echo=False)
base = declarative_base()

class action(base):
   __tablename__ = "macros"
   name = Column(String, primary_key=True)
   source_file = Column(String, primary_key=True)
   macro = Column(String, primary_key=True)
   line = Column(Integer, primary_key=True)
   column = Column(Integer, primary_key=True)
   
   def __repr__(self):
      return """<action(name='%s',
source_file='%s',
macro='%s',
line='%s',
column='%s')>""" % (self.name,
                    self.source_file,
                    self.macro,
                    self.line,
                    self.column)

configured_session = sessionmaker(bind=engine)
session = configured_session()
base.metadata.create_all(engine)

def store_position(source, loop):
    """Add data to an internal table."""
    for place in source:
       entry = action(name = place.current_element,
                      source_file = place.file,
                      macro = loop,
                      line = place.line,
                      column = int(place.column) + 1)
       session.add(entry)

@find@
identifier for_loop, work;
iterator name for_each_node_by_name,
              for_each_node_by_type,
              for_each_node_with_property,
              for_each_matching_node,
              for_each_matching_node_and_match,
              for_each_compatible_node,
              for_each_child_of_node,
              for_each_available_child_of_node;
position pos;
statement s;
type t;
@@
 t work(...)
 {
 <+...
(for_each_node_by_name@for_loop@pos(...) s
|for_each_node_by_type@for_loop@pos(...) s
|for_each_matching_node@for_loop@pos(...) s
|for_each_node_with_property@for_loop@pos(...) s
|for_each_compatible_node@for_loop@pos(...) s
|for_each_matching_node_and_match@for_loop@pos(...) s
|for_each_child_of_node@for_loop@pos(...) s
|for_each_available_child_of_node@for_loop@pos(...) s
)
 ...+>
 }

@script:python collection@
fl << find.for_loop;
place << find.pos;
@@
store_position(place, fl)

@finalize:python@
@@
session.commit()
from sqlalchemy import func
entries = session.query(func.count()).select_from(action).scalar()

if entries > 0:
   from sqlalchemy.sql import literal_column
   delimiter = "|"
   sys.stdout.write(delimiter.join(['"source file"', 'macro', 'incidence']))
   sys.stdout.write("\r\n")
   mark = ['"', '', '"']
   
   for file, \
       macro, \
       incidence in session.query(action.source_file,
                                  action.macro,
                                  func.count(literal_column("*"))
                                 ) \
                           .group_by(action.source_file, action.macro) \
                           .order_by(action.source_file, func.count(literal_column("*")).desc()):
      mark[1] = file
      sys.stdout.write(delimiter.join([''.join(mark), macro, str(incidence)]))
      sys.stdout.write("\r\n")

   sys.stdout.write("=====\r\n")
   sys.stdout.write(delimiter.join(['macro', 'incidence']))
   sys.stdout.write("\r\n")

   for macro, \
       incidence in session.query(action.macro, func.count(literal_column("*"))) \
                           .group_by(action.macro) \
                           .order_by(func.count(literal_column("*")).desc()):
      sys.stdout.write(macro + delimiter + str(incidence))
      sys.stdout.write("\r\n")
else:
   sys.stderr.write("No result for this analysis!\n")


Test result:
elfring@Sonne:~/Projekte/Linux/next-patched> git checkout next-20201016 && XX=$(date) && time spatch --python $(which python3) --dir . ~/Projekte/Coccinelle/janitor/report_for_each_node_macro_calls.cocci; YY=$(date) && echo "$XX | $YY"
…
Using SQLAlchemy version:
1.3.19
518 files match
…
=====
macro|incidence
for_each_child_of_node|357
for_each_available_child_of_node|157
for_each_compatible_node|79
for_each_node_by_name|55
for_each_node_by_type|53
for_each_matching_node|22
for_each_matching_node_and_match|16
for_each_node_with_property|6

real	3m26,039s
user	2m3,453s
sys	0m5,041s
Sa 17. Okt 07:00:42 CEST 2020 | Sa 17. Okt 07:04:08 CEST 2020


Can such facts influence the specification of efficient SmPL disjunctions any more?


Would you like to look for software configuration alternatives for better parallel data processing?

Regards,
Markus


_______________________________________________
Cocci mailing list
Cocci@systeme.lip6.fr
https://systeme.lip6.fr/mailman/listinfo/cocci

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [Cocci] Determination of an usage statistic for macro calls “for_each…node…”
  2020-10-17  5:38 ` [Cocci] Determination of an usage statistic for macro calls “for_each…node…” Markus Elfring
@ 2020-10-18 11:16   ` Markus Elfring
  0 siblings, 0 replies; 2+ messages in thread
From: Markus Elfring @ 2020-10-18 11:16 UTC (permalink / raw)
  To: Sumera Priyadarsini, Coccinelle
  Cc: Michal Marek, Gilles Muller, kernel-janitors, Nicolas Palix,
	linux-kernel, Julia Lawall

> Would you like to look for software configuration alternatives for better parallel data processing?

I would like to share another source code analysis approach.
I hope that this contribution can trigger further helpful software development ideas.


@initialize:python@
@@
import sys

def write_identifier(source, loop):
    names = []
    for x in source:
       names.append(loop)

    sys.stdout.write("\n".join(names) + "\n")

@find@
identifier fe, x;
iterator name for_each_node_by_name,
              for_each_node_by_type,
              for_each_node_with_property,
              for_each_matching_node,
              for_each_matching_node_and_match,
              for_each_compatible_node,
              for_each_child_of_node,
              for_each_available_child_of_node;
position pos;
statement s;
type t;
@@
 t x(...)
 {
 <+...
(for_each_child_of_node@fe@pos(...) s
|for_each_available_child_of_node@fe@pos(...) s
|for_each_compatible_node@fe@pos(...) s
|for_each_node_by_name@fe@pos(...) s
|for_each_node_by_type@fe@pos(...) s
|for_each_matching_node@fe@pos(...) s
|for_each_matching_node_and_match@fe@pos(...) s
|for_each_node_with_property@fe@pos(...) s
)
 ...+>
 }

@script:python collection@
fe << find.fe;
place << find.pos;
@@
write_identifier(place, fe)


Test result:
elfring@Sonne:~/Projekte/Linux/next-patched> git checkout next-20201016 && XX=$(date) && time spatch --python python3 --jobs 4 --include-headers --no-includes --dir . ~/Projekte/Coccinelle/janitor/report_for_each_node_macro_calls5.cocci | echo "$(echo 'call' && cat)" | csvsql --query 'select call, count(*) from stdin group by call'; YY=$(date) && echo "$XX | $YY"
…
523 files match
…
call,count(*)
for_each_available_child_of_node,158
for_each_child_of_node,359
for_each_compatible_node,80
for_each_matching_node,22
for_each_matching_node_and_match,16
for_each_node_by_name,59
for_each_node_by_type,53
for_each_node_with_property,6

real	0m47,779s
user	2m19,285s
sys	0m1,541s
So 18. Okt 13:13:02 CEST 2020 | So 18. Okt 13:13:50 CEST 2020


Can such facts influence the specification of efficient SmPL disjunctions any more?

Regards,
Markus


_______________________________________________
Cocci mailing list
Cocci@systeme.lip6.fr
https://systeme.lip6.fr/mailman/listinfo/cocci

^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2020-10-18 11:38 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <42184243.d196fk2T0j@sonne>
2020-10-17  5:38 ` [Cocci] Determination of an usage statistic for macro calls “for_each…node…” Markus Elfring
2020-10-18 11:16   ` Markus Elfring

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).